kairawal/Gemma-3-4B-IT-ZH-SynthDolly-r16alpha32-E1-S73
kairawal/Gemma-3-4B-IT-ZH-SynthDolly-r16alpha32-E1-S73 is a 4.3 billion parameter Gemma-3 instruction-tuned model developed by kairawal. This model was fine-tuned using Unsloth and Huggingface's TRL library, enabling faster training. It is designed for general instruction-following tasks, leveraging the Gemma architecture's capabilities.
Loading preview...
Model Overview
kairawal/Gemma-3-4B-IT-ZH-SynthDolly-r16alpha32-E1-S73 is an instruction-tuned language model based on the Gemma-3 architecture, featuring 4.3 billion parameters and a 32768-token context length. Developed by kairawal, this model was fine-tuned from unsloth/gemma-3-4b-it.
Key Characteristics
- Architecture: Based on the Gemma-3 model family.
- Parameter Count: 4.3 billion parameters.
- Context Length: Supports a context window of 32768 tokens.
- Training Efficiency: Fine-tuned using Unsloth and Huggingface's TRL library, which facilitated a 2x faster training process.
- License: Distributed under the Apache-2.0 license.
Use Cases
This model is suitable for various instruction-following applications, benefiting from its efficient fine-tuning process and the underlying Gemma-3 architecture. Its capabilities are geared towards general language understanding and generation tasks where instruction adherence is crucial.