google/gemma-3-1b-it-qat-q4_0-unquantized
TEXT GENERATIONConcurrency Cost:1Model Size:1BQuant:BF16Ctx Length:32kPublished:Apr 8, 2025License:gemmaArchitecture:Transformer0.0K Gated Warm

The google/gemma-3-1b-it-qat-q4_0-unquantized model is a 1 billion parameter instruction-tuned variant from the Gemma 3 family, developed by Google DeepMind. This multimodal model handles text and image inputs, generating text outputs, and is specifically designed for efficient deployment in resource-constrained environments. Utilizing Quantization Aware Training (QAT), this unquantized checkpoint is intended for Q4_0 quantization to significantly reduce memory requirements while preserving quality. It excels in text generation, image understanding, question answering, summarization, and reasoning across over 140 languages.

Loading preview...

Popular Sampler Settings

Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.

temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p