sunblaze-ucb/Qwen3-14B-Intuitor-MATH-1EPOCH
TEXT GENERATIONConcurrency Cost:1Model Size:14BQuant:FP8Ctx Length:32kLicense:apache-2.0Architecture:Transformer Open Weights Cold

The sunblaze-ucb/Qwen3-14B-Intuitor-MATH-1EPOCH model is a 14 billion parameter Qwen3-based large language model fine-tuned using the Intuitor method on the MATH dataset. Developed by sunblaze-ucb, this model leverages Reinforcement Learning from Internal Feedback (RLIF) to learn reasoning skills using self-certainty as the sole reward, without external supervision. It is specifically optimized for mathematical problem-solving and reasoning tasks, offering a scalable approach for domains where labeled data is scarce. The model supports a context length of 32768 tokens.

Loading preview...