TheDrummer/Tiger-Gemma-9B-v3

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:9BQuant:FP8Ctx Length:16kPublished:Oct 4, 2024Architecture:Transformer0.1K Warm

TheDrummer/Tiger-Gemma-9B-v3 is a 9 billion parameter language model based on the Gemma architecture, featuring a 16384-token context length. This model is specifically fine-tuned using SPPO with a unique dataset designed to remove unwanted conversational 'yapping' and 'evil' content. It aims to provide a 'decensored' experience, making it suitable for applications requiring unfiltered or less restricted text generation.

Loading preview...

Overview

TheDrummer/Tiger-Gemma-9B-v3 is a 9 billion parameter language model built upon the Gemma architecture, offering a substantial 16384-token context window. This iteration, version 3, has undergone specific fine-tuning using the SPPO (Supervised Preference Optimization) method.

Key Capabilities

  • Decensored Output: The model is explicitly designed to produce 'decensored' content, having been trained on a dataset curated to remove what the developer describes as 'yapping and evil'.
  • Gemma Architecture: Leverages the foundational capabilities of the Gemma model family.
  • Extended Context Length: Supports a 16384-token context, allowing for processing and generating longer sequences of text.

Good For

  • Use cases requiring a less restricted or unfiltered language model output.
  • Applications where the removal of specific conversational patterns or 'evil' content, as defined by the developer's dataset, is desired.
  • Developers seeking a Gemma-based model with a focus on direct and unconstrained text generation.