kairawal/Gemma-3-4B-IT-ZH-SynthDolly-r16alpha32-E1-S73

VISIONConcurrency Cost:1Model Size:4.3BQuant:BF16Ctx Length:32kPublished:May 11, 2026License:apache-2.0Architecture:Transformer Open Weights Cold

kairawal/Gemma-3-4B-IT-ZH-SynthDolly-r16alpha32-E1-S73 is a 4.3 billion parameter Gemma-3 instruction-tuned model developed by kairawal. This model was fine-tuned using Unsloth and Huggingface's TRL library, enabling faster training. It is designed for general instruction-following tasks, leveraging the Gemma architecture's capabilities.

Loading preview...

Model Overview

kairawal/Gemma-3-4B-IT-ZH-SynthDolly-r16alpha32-E1-S73 is an instruction-tuned language model based on the Gemma-3 architecture, featuring 4.3 billion parameters and a 32768-token context length. Developed by kairawal, this model was fine-tuned from unsloth/gemma-3-4b-it.

Key Characteristics

  • Architecture: Based on the Gemma-3 model family.
  • Parameter Count: 4.3 billion parameters.
  • Context Length: Supports a context window of 32768 tokens.
  • Training Efficiency: Fine-tuned using Unsloth and Huggingface's TRL library, which facilitated a 2x faster training process.
  • License: Distributed under the Apache-2.0 license.

Use Cases

This model is suitable for various instruction-following applications, benefiting from its efficient fine-tuning process and the underlying Gemma-3 architecture. Its capabilities are geared towards general language understanding and generation tasks where instruction adherence is crucial.