ARAVIND8179986644/model_sft_resta
TEXT GENERATIONConcurrency Cost:1Model Size:1.5BQuant:BF16Ctx Length:32kPublished:Apr 5, 2026Architecture:Transformer Cold

ARAVIND8179986644/model_sft_resta is a 1.5 billion parameter language model created by ARAVIND8179986644 using the Task Arithmetic merge method. Based on Qwen/Qwen2.5-1.5B-Instruct, this model combines two fine-tuned components to achieve its specific characteristics. It is designed for applications requiring a compact yet capable model, leveraging its 32768 token context length.

Loading preview...