spar-project/Qwen2.5-7B-Instruct-layers-16-24-smaller-lr
TEXT GENERATIONConcurrency Cost:1Model Size:7.6BQuant:FP8Ctx Length:32kPublished:Apr 1, 2026License:apache-2.0Architecture:Transformer Open Weights Cold

The spar-project/Qwen2.5-7B-Instruct-layers-16-24-smaller-lr is a 7.6 billion parameter instruction-tuned Qwen2 model developed by spar-project, fine-tuned from unsloth/Qwen2.5-7B-Instruct. This model was trained with Unsloth and Huggingface's TRL library, focusing on faster training. It offers a 32768 token context length, making it suitable for applications requiring efficient processing of longer sequences.

Loading preview...