kairawal/Gemma-3-4B-IT-ES-SynthDolly-r16alpha128-E8-S73

Hugging Face
VISIONConcurrency Cost:1Model Size:4.3BQuant:BF16Ctx Length:32kPublished:May 24, 2026License:apache-2.0Architecture:Transformer Open Weights Warm

The kairawal/Gemma-3-4B-IT-ES-SynthDolly-r16alpha128-E8-S73 is a 4.3 billion parameter instruction-tuned Gemma model, fine-tuned by kairawal. This model was trained using Unsloth and Huggingface's TRL library, enabling faster training. It is designed for general instruction-following tasks, leveraging its Gemma architecture and a 32768 token context length. Its primary strength lies in efficient processing of conversational prompts due to its instruction-tuned nature.

Loading preview...

Model Overview

This model, kairawal/Gemma-3-4B-IT-ES-SynthDolly-r16alpha128-E8-S73, is a 4.3 billion parameter instruction-tuned variant of the Gemma architecture. Developed by kairawal, it was fine-tuned from the unsloth/gemma-3-4b-it base model.

Key Characteristics

  • Architecture: Based on the Gemma family of models.
  • Parameter Count: 4.3 billion parameters, offering a balance between performance and computational efficiency.
  • Context Length: Supports a substantial context window of 32768 tokens, allowing for processing longer inputs and maintaining conversational coherence.
  • Training Efficiency: The fine-tuning process utilized Unsloth and Huggingface's TRL library, which facilitated 2x faster training compared to standard methods.

Use Cases and Strengths

This model is particularly well-suited for:

  • Instruction Following: Excels at understanding and executing user instructions due to its instruction-tuned nature.
  • Conversational AI: Its large context window and instruction-following capabilities make it suitable for dialogue systems and chatbots.
  • Efficient Deployment: The use of Unsloth for training suggests potential for optimized inference, making it a good candidate for applications where speed and resource efficiency are important.