google/gemma-2b
TEXT GENERATIONConcurrency Cost:1Model Size:2.6BQuant:BF16Ctx Length:8kPublished:Feb 8, 2024License:gemmaArchitecture:Transformer1.2K Gated Warm

Gemma-2b is a 2.6 billion parameter, decoder-only large language model developed by Google, built from the same research and technology as the Gemini models. It is designed for text-to-text generation tasks such as question answering, summarization, and reasoning, with a context length of 8192 tokens. Its compact size makes it suitable for deployment in resource-limited environments like laptops, desktops, or personal cloud infrastructure, democratizing access to advanced AI capabilities.

Loading preview...

Popular Sampler Settings

Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.

temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p