spar-project/Llama-3.2-3B-Instruct-attention-layers
TEXT GENERATIONConcurrency Cost:1Model Size:3.2BQuant:BF16Ctx Length:32kPublished:Mar 25, 2026License:apache-2.0Architecture:Transformer Open Weights Loading
The spar-project/Llama-3.2-3B-Instruct-attention-layers is a 3.2 billion parameter instruction-tuned Llama model developed by spar-project. This model was finetuned from unsloth/Llama-3.2-3B-Instruct and optimized for faster training using Unsloth and Huggingface's TRL library. It is designed for general instruction-following tasks, leveraging its efficient training methodology.
Loading preview...