aolans/Qwen2.5-7B-Instruct-SDFT-fp16
TEXT GENERATIONConcurrency Cost:1Model Size:7.6BQuant:FP8Ctx Length:32kPublished:Feb 27, 2026License:apache-2.0Architecture:Transformer Open Weights Cold
aolans/Qwen2.5-7B-Instruct-SDFT-fp16 is a 7.6 billion parameter instruction-tuned model based on Qwen/Qwen2.5-7B-Instruct, fine-tuned to enhance multi-turn agent task performance. It is specifically optimized for complex tasks like household automation (ALFWorld) and database operations (DBBench), learning from environment observations, action selection, and tool use. This model incorporates experimental training techniques, SDFT and Epiplexity, aimed at improving reasoning capabilities, and is provided in fp16 format for direct loading.
Loading preview...