Ilia2003Mah/qwen2.5_1.5b-gsm8k-test-step500
TEXT GENERATIONConcurrency Cost:1Model Size:1.5BQuant:BF16Ctx Length:32kPublished:Mar 16, 2026Architecture:Transformer Loading
The Ilia2003Mah/qwen2.5_1.5b-gsm8k-test-step500 is a 1.5 billion parameter language model, likely based on the Qwen2.5 architecture, with a context length of 32768 tokens. This model appears to be an experimental or test version, potentially fine-tuned or evaluated on the GSM8K dataset, which focuses on mathematical reasoning and problem-solving. Its primary differentiator is its compact size combined with a large context window, making it suitable for tasks requiring processing extensive input while maintaining efficiency.
Loading preview...