ryzax/1.5B-v18
TEXT GENERATIONConcurrency Cost:1Model Size:2BQuant:BF16Ctx Length:32kPublished:May 24, 2025Architecture:Transformer Cold

The ryzax/1.5B-v18 model is a 2 billion parameter language model, fine-tuned from ryzax/qwen3_1.7B_sft_correct_v3_1e-5_4. It was trained using the TRL framework on the agentica-org/DeepScaleR-Preview-Dataset, incorporating the GRPO method for enhanced mathematical reasoning. This model is specifically optimized for tasks requiring advanced mathematical problem-solving capabilities.

Loading preview...