kairawal/Llama-3.2-3B-Instruct-ZH-SynthDolly-r16alpha128-E5-S73

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:3.2BQuant:BF16Ctx Length:32kPublished:May 21, 2026License:apache-2.0Architecture:Transformer Open Weights Warm

The kairawal/Llama-3.2-3B-Instruct-ZH-SynthDolly-r16alpha128-E5-S73 is a 3.2 billion parameter Llama-based instruction-tuned language model developed by kairawal, finetuned from unsloth/llama-3.2-3b-Instruct. This model was trained 2x faster using Unsloth and Huggingface's TRL library, offering efficient performance for its size. With a 32768 token context length, it is designed for general instruction-following tasks.

Loading preview...

Model Overview

The kairawal/Llama-3.2-3B-Instruct-ZH-SynthDolly-r16alpha128-E5-S73 is a 3.2 billion parameter instruction-tuned language model developed by kairawal. It is finetuned from the unsloth/llama-3.2-3b-Instruct base model and utilizes a substantial 32768 token context length, making it suitable for processing longer inputs and generating comprehensive responses.

Key Characteristics

  • Architecture: Based on the Llama family, providing a robust foundation for language understanding and generation.
  • Parameter Count: Features 3.2 billion parameters, balancing performance with computational efficiency.
  • Training Efficiency: This model was notably trained 2x faster by leveraging the Unsloth library in conjunction with Huggingface's TRL library, indicating an optimized training process.
  • Context Length: Supports a 32768 token context window, allowing for detailed and context-aware interactions.

Intended Use Cases

This model is primarily designed for general instruction-following tasks, benefiting from its instruction-tuned nature and extended context window. Its efficient training methodology suggests it could be a good candidate for applications where rapid deployment and resource optimization are important.