od2961/Qwen2.5-1.5B-OpenR1-GRAIL
TEXT GENERATIONConcurrency Cost:1Model Size:1.5BQuant:BF16Ctx Length:32kPublished:Aug 20, 2025Architecture:Transformer Warm

od2961/Qwen2.5-1.5B-OpenR1-GRAIL is a 1.5 billion parameter language model, fine-tuned from Qwen/Qwen2.5-1.5B-Instruct. It was trained using the TRL framework on the od2961/grail-wage dataset, incorporating the GRPO method for enhanced mathematical reasoning. This model is specialized for tasks requiring advanced mathematical problem-solving capabilities.

Loading preview...