lunahr/SystemGemma2-9b-it

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:9BQuant:FP8Ctx Length:16kLicense:gemmaArchitecture:Transformer0.0K Warm

The lunahr/SystemGemma2-9b-it is a 9 billion parameter instruction-tuned decoder-only large language model from Google's Gemma 2 family, optimized for text generation tasks such as question answering, summarization, and reasoning. This version specifically enables system prompts, building upon the base Gemma 2 9B model. Trained on 8 trillion tokens, it offers state-of-the-art performance for its size, suitable for deployment in resource-limited environments.

Loading preview...

lunahr/SystemGemma2-9b-it: An Instruction-Tuned Gemma 2 Model

This model is a 9 billion parameter instruction-tuned variant of Google's Gemma 2 family, specifically configured to enable system prompts. Gemma models are lightweight, state-of-the-art open models built from the same research and technology as the Gemini models, designed for text-to-text generation.

Key Capabilities

  • Text Generation: Excels at various text generation tasks, including question answering, summarization, and reasoning.
  • System Prompt Support: This particular version is adapted to work effectively with system prompts, enhancing conversational use cases.
  • Efficiency: Its relatively small size (9B parameters) allows for deployment in environments with limited resources, such as laptops, desktops, or private cloud infrastructure.
  • Robust Training: Trained on 8 trillion tokens, including web documents, code, and mathematical texts, ensuring broad linguistic understanding and task versatility.

Performance Highlights

Evaluated across a range of benchmarks, the Gemma 2 9B model demonstrates strong performance:

  • MMLU: 71.3 (5-shot, top-1)
  • HumanEval: 40.2 (pass@1)
  • GSM8K: 68.6 (5-shot, maj@1)

Intended Usage

  • Content Creation: Generating creative text formats, marketing copy, and email drafts.
  • Conversational AI: Powering chatbots and virtual assistants.
  • Research & Education: Supporting NLP research, language learning tools, and knowledge exploration.

Limitations

Users should be aware of potential limitations related to training data biases, factual accuracy (LLMs are not knowledge bases), and challenges with highly complex tasks or subtle language nuances.