osieosie/Qwen2_5-7B-Instruct_qwen2_5-7b-s1k-sft-full-s42-e1-lr2e_5
TEXT GENERATIONConcurrency Cost:1Model Size:7.6BQuant:FP8Ctx Length:32kPublished:Jan 20, 2026Architecture:Transformer Cold

This is a 7.6 billion parameter instruction-tuned language model, fine-tuned from Qwen/Qwen2.5-7B-Instruct. Developed by osieosie, it leverages a 131,072-token context length, making it suitable for applications requiring extensive contextual understanding. The model is optimized for following instructions effectively, building upon the robust Qwen2.5 architecture.

Loading preview...