ARAVIND8179986644/model_sft_dare_resta
TEXT GENERATIONConcurrency Cost:1Model Size:1.5BQuant:BF16Ctx Length:32kPublished:Apr 5, 2026Architecture:Transformer Cold

ARAVIND8179986644/model_sft_dare_resta is a 1.5 billion parameter language model with a 32768 token context length, created by ARAVIND8179986644 using the Task Arithmetic merge method. It is based on Qwen/Qwen2.5-1.5B-Instruct and incorporates components from ARAVIND8179986644/model_sft_dare and a local model named 'harmful_full'. This model is specifically designed through a merging process to combine and potentially adjust characteristics from its constituent models.

Loading preview...