kazuyamaa/Qwen3-8B-Math-GRPO
TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:32kPublished:Oct 19, 2025License:apache-2.0Architecture:Transformer Open Weights Cold
The kazuyamaa/Qwen3-8B-Math-GRPO is an 8 billion parameter Qwen3 model developed by kazuyamaa, fine-tuned for mathematical tasks. It was trained using Unsloth and Huggingface's TRL library, offering efficient performance. This model is designed for applications requiring strong mathematical reasoning capabilities.
Loading preview...