kairawal/Gemma-3-4B-IT-ZH-SynthDolly-r16alpha128-E5-S73

Hugging Face
VISIONConcurrency Cost:1Model Size:4.3BQuant:BF16Ctx Length:32kPublished:May 22, 2026License:apache-2.0Architecture:Transformer Open Weights Warm

The kairawal/Gemma-3-4B-IT-ZH-SynthDolly-r16alpha128-E5-S73 is a 4.3 billion parameter instruction-tuned language model, fine-tuned from unsloth/gemma-3-4b-it. Developed by kairawal, this model was trained using Unsloth and Huggingface's TRL library, enabling faster training. It is designed for general language generation tasks, leveraging its Gemma architecture and instruction-tuning for improved performance.

Loading preview...

Model Overview

The kairawal/Gemma-3-4B-IT-ZH-SynthDolly-r16alpha128-E5-S73 is an instruction-tuned language model with approximately 4.3 billion parameters and a context length of 32768 tokens. It was developed by kairawal and fine-tuned from the unsloth/gemma-3-4b-it base model.

Key Characteristics

  • Base Architecture: Fine-tuned from the Gemma 3.4B instruction-tuned model.
  • Training Efficiency: Utilizes Unsloth and Huggingface's TRL library, which facilitated a 2x faster training process.
  • License: Distributed under the Apache-2.0 license.

Potential Use Cases

This model is suitable for various natural language processing applications that benefit from instruction-following capabilities. Its efficient training methodology suggests a focus on practical deployment and performance. Developers looking for a Gemma-based model with optimized training and instruction-tuning for general language tasks may find this model useful.