google/gemma-3-4b-it-qat-q4_0-unquantized
TEXT GENERATIONConcurrency Cost:1Model Size:4.3BQuant:BF16Ctx Length:32kPublished:Apr 8, 2025License:gemma Vision Architecture:Transformer0.0K Gated Warm

The google/gemma-3-4b-it-qat-q4_0-unquantized model is a 4.3 billion parameter instruction-tuned variant from Google's Gemma 3 family of lightweight, open multimodal models. Built with technology from Gemini models, it handles text and image inputs, generating text outputs, and features a 32K token context window. This specific version is optimized using Quantization Aware Training (QAT) to maintain quality while reducing memory footprint, making it suitable for resource-constrained environments and a variety of text generation and image understanding tasks.

Loading preview...

Popular Sampler Settings

Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.

temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p