tripathysagar/Qwen2.5-0.5B-GSM8K-SFT
TEXT GENERATIONConcurrency Cost:1Model Size:0.5BQuant:BF16Ctx Length:32kPublished:Feb 23, 2026Architecture:Transformer Warm

tripathysagar/Qwen2.5-0.5B-GSM8K-SFT is a 0.5 billion parameter Qwen2.5 model fine-tuned by tripathysagar for mathematical reasoning. This model specializes in solving GSM8K-style math problems, providing step-by-step solutions and a structured numerical answer format. It is optimized for tasks requiring precise arithmetic and logical deduction.

Loading preview...