google/gemma-2-9b-it

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:9BQuant:FP8Ctx Length:16kPublished:Jun 24, 2024License:gemmaArchitecture:Transformer0.8K Gated Warm

Gemma 2 9B IT is a 9 billion parameter instruction-tuned decoder-only large language model developed by Google. Built from the same research as Gemini models, it is optimized for a variety of text generation tasks including question answering, summarization, and reasoning. Its relatively small size and open weights make it suitable for deployment in resource-limited environments like desktops or personal cloud infrastructure.

Loading preview...

Overview

Gemma 2 9B IT is a 9 billion parameter instruction-tuned model from Google's Gemma family of lightweight, open large language models. These models are built using the same research and technology as the Gemini models, offering state-of-the-art capabilities in a more accessible package. Gemma models are text-to-text, decoder-only architectures, available with open weights for both pre-trained and instruction-tuned variants.

Key Capabilities

  • Text Generation: Excels at generating various text formats, including creative content, code, and email drafts.
  • Conversational AI: Suitable for powering chatbots, virtual assistants, and interactive applications.
  • Text Summarization: Can create concise summaries of documents, research papers, and reports.
  • Reasoning: Demonstrates strong performance in reasoning tasks, as evidenced by benchmarks like MMLU (71.3% for the pre-trained 9B model).
  • Code Generation: Achieves a pass@1 score of 40.2% on HumanEval for the pre-trained 9B model, indicating proficiency in code-related tasks.
  • Resource-Efficient Deployment: Its smaller size allows for deployment on devices with limited resources, such as laptops, desktops, or personal cloud infrastructure.

Training and Evaluation

The 9B model was trained on 8 trillion tokens, encompassing a diverse dataset of web documents, code, and mathematical text. Training utilized Google's latest Tensor Processing Unit (TPUv5p) hardware and JAX with ML Pathways software. The model underwent rigorous ethics and safety evaluations, including assessments for content safety, representational harms, memorization, and large-scale harms, meeting internal policy thresholds.

Intended Usage

Gemma 2 9B IT is designed for a wide range of applications, including content creation, conversational AI, text summarization, NLP research, language learning tools, and knowledge exploration. Its open nature fosters innovation and democratizes access to advanced AI capabilities.

Popular Sampler Settings

Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.

temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p