Ilia2003Mah/qwen2.5_1.5b-gsm8k-test-step1000
TEXT GENERATIONConcurrency Cost:1Model Size:1.5BQuant:BF16Ctx Length:32kPublished:Mar 16, 2026Architecture:Transformer Loading

The Ilia2003Mah/qwen2.5_1.5b-gsm8k-test-step1000 model is a 1.5 billion parameter language model, likely based on the Qwen2.5 architecture, developed by Ilia2003Mah. This model is specifically fine-tuned for mathematical reasoning tasks, indicated by its GSM8K dataset focus. Its primary strength lies in numerical problem-solving, making it suitable for applications requiring arithmetic and logical deduction.

Loading preview...