dwnmf/gemma_3_4b_opus_distilled
VISIONConcurrency Cost:1Model Size:4.3BQuant:BF16Ctx Length:32kPublished:Mar 20, 2026Architecture:Transformer0.0K Warm

The dwnmf/gemma_3_4b_opus_distilled model is a 4.3 billion parameter language model, fine-tuned and converted to GGUF format by dwnmf using Unsloth. This model is based on the Gemma architecture and supports a 32768 token context length. It is notable for its GGUF compatibility and includes specific configurations for both text-only and multimodal applications, with a particular focus on vision model integration for platforms like Ollama.

Loading preview...