Athkal/model-sft-dare
TEXT GENERATIONConcurrency Cost:1Model Size:1.5BQuant:BF16Ctx Length:32kPublished:Mar 22, 2026Architecture:Transformer Warm

Athkal/model-sft-dare is a merged language model created by Athkal using the Linear DARE method, based on Qwen/Qwen2.5-1.5B-Instruct. This model integrates a fine-tuned component from '/kaggle/working/model_sft_lora' to enhance specific capabilities. It is designed for tasks benefiting from a merged architecture, leveraging the strengths of its constituent models.

Loading preview...