Ilia2003Mah/qwen2.5-1.5b-gsm8k-test-step1000
TEXT GENERATIONConcurrency Cost:1Model Size:1.5BQuant:BF16Ctx Length:32kPublished:Mar 18, 2026Architecture:Transformer Warm
The Ilia2003Mah/qwen2.5-1.5b-gsm8k-test-step1000 is a 1.5 billion parameter language model based on the Qwen2.5 architecture. This model is specifically designed for testing purposes, likely focusing on mathematical reasoning tasks as indicated by 'gsm8k-test'. Its compact size and specialized fine-tuning suggest it could be suitable for evaluating performance on specific arithmetic or problem-solving benchmarks.
Loading preview...