Keven16/Qwen3-4B-Non-Thinking-RL-Math-Step500
TEXT GENERATIONConcurrency Cost:1Model Size:4BQuant:BF16Ctx Length:32kPublished:Mar 16, 2026License:apache-2.0Architecture:Transformer Open Weights Warm

Keven16/Qwen3-4B-Non-Thinking-RL-Math-Step500 is a 4 billion parameter language model based on the Qwen3 architecture, developed by Keven16. This model features a 32768-token context length and is specifically fine-tuned for mathematical reasoning tasks. Its primary differentiator is its optimization for non-thinking reinforcement learning in mathematical problem-solving, making it suitable for applications requiring robust numerical and logical processing.

Loading preview...