unsloth/gemma-3-12b-it-qat-int4
VISIONConcurrency Cost:1Model Size:12BQuant:FP8Ctx Length:32kLicense:gemmaArchitecture:Transformer0.0K Cold

The unsloth/gemma-3-12b-it-qat-int4 model is a 12 billion parameter instruction-tuned variant of Google DeepMind's Gemma 3 family, optimized for quantization-aware training (QAT) to reduce memory footprint while maintaining quality. This multimodal model handles text and image inputs, generating text outputs, and features a large 128K context window with multilingual support across 140+ languages. It excels at diverse tasks including question answering, summarization, reasoning, and image understanding, making it suitable for deployment in resource-constrained environments.

Loading preview...