Ilia2003Mah/qwen2.5-1.5b-gsm8k-test-step500
TEXT GENERATIONConcurrency Cost:1Model Size:1.5BQuant:BF16Ctx Length:32kPublished:Mar 18, 2026Architecture:Transformer Warm

The Ilia2003Mah/qwen2.5-1.5b-gsm8k-test-step500 model is a 1.5 billion parameter language model based on the Qwen2.5 architecture. This model is a fine-tuned version, specifically tested at step 500 on the GSM8K dataset, indicating a focus on mathematical reasoning and problem-solving capabilities. It is designed for tasks requiring numerical and logical inference, leveraging its compact size for efficient deployment in specialized applications.

Loading preview...