kairawal/Gemma-3-4B-IT-PT-SynthDolly-r16alpha128-E5-S73

Hugging Face
VISIONConcurrency Cost:1Model Size:4.3BQuant:BF16Ctx Length:32kPublished:May 22, 2026License:apache-2.0Architecture:Transformer Open Weights Warm

kairawal/Gemma-3-4B-IT-PT-SynthDolly-r16alpha128-E5-S73 is a 4.3 billion parameter Gemma-3 instruction-tuned causal language model developed by kairawal. This model was fine-tuned using Unsloth and Huggingface's TRL library, resulting in a 2x faster training process. It is designed for general instruction-following tasks, leveraging its efficient training methodology.

Loading preview...

Model Overview

This model, kairawal/Gemma-3-4B-IT-PT-SynthDolly-r16alpha128-E5-S73, is a 4.3 billion parameter instruction-tuned variant of the Gemma-3 architecture. Developed by kairawal, it was fine-tuned from the unsloth/gemma-3-4b-it base model.

Key Characteristics

  • Architecture: Based on the Gemma-3 model family.
  • Parameter Count: 4.3 billion parameters.
  • Context Length: Supports a context length of 32768 tokens.
  • Training Efficiency: Fine-tuned using Unsloth and Huggingface's TRL library, which enabled a 2x faster training process compared to standard methods.

Intended Use Cases

This model is suitable for general instruction-following tasks where a Gemma-3 based model with efficient fine-tuning is beneficial. Its optimized training process suggests potential for applications requiring rapid iteration or deployment of instruction-tuned models.