Name: dwnmf/gemma_3_4b_opus_distilled API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: dwnmf

Overview

dwnmf/gemma_3_4b_opus_distilled is a 4.3 billion parameter model, fine-tuned and converted into the GGUF format by dwnmf, leveraging the Unsloth framework for faster training. This model is designed for efficient deployment and use with llama-cli for text-only tasks and llama-mtmd-cli for multimodal applications.

Key Capabilities

GGUF Compatibility: Provided in GGUF format, making it suitable for local inference with tools like llama-cli.
Multimodal Support: Includes configurations for multimodal use, specifically with BF16-mmproj.gguf for vision tasks.
Ollama Integration: Specific instructions are provided for creating unified BF16 models for use with Ollama, addressing its current lack of separate mmproj file support.
Optimized Training: Benefits from Unsloth's optimizations, enabling 2x faster training.

Good for

Developers seeking a Gemma-based model in GGUF format for local deployment.
Applications requiring a 4.3B parameter model with a 32768 token context for both text and vision tasks.
Users looking to integrate a vision-capable model with Ollama, following the provided conversion steps.

Overview

Overview

Key Capabilities

Good for

Full Model Card (README)