sumith2425/model_sft_resta
TEXT GENERATIONConcurrency Cost:1Model Size:1.5BQuant:BF16Ctx Length:32kPublished:Mar 18, 2026Architecture:Transformer Cold

sumith2425/model_sft_resta is a 1.5 billion parameter language model created by sumith2425 using the Task Arithmetic merge method. It is based on Qwen/Qwen2.5-1.5B-Instruct and combines two merged models: 'harmful_merged_model' and 'sft_merged_model'. This model is designed for specific applications derived from its merged components, offering a 32768 token context length.

Loading preview...