netease-youdao/Confucius-o1-14B
Confucius-o1-14B is a 14.8 billion parameter reasoning model developed by the NetEase Youdao Team, based on the Qwen2.5-14B-Instruct architecture. It employs a two-stage learning strategy to achieve o1-like thinking abilities, specifically optimized for generating a summarized step-by-step problem-solving process from its chain of thought. This model is designed for educational applications, particularly in K12 mathematics, and can be deployed on a single GPU without quantization, supporting a 131072 token context length.
Loading preview...
Confucius-o1-14B: Specialized Reasoning for Education
Confucius-o1-14B is a 14.8 billion parameter model from the NetEase Youdao Team, built upon the Qwen2.5-14B-Instruct base. Its core innovation lies in its o1-like reasoning capabilities and a unique output format that provides both a detailed chain of thought and a concise, step-by-step problem-solving summary.
Key Capabilities & Features
- Two-Stage Learning: Utilizes a teacher model for initial learning of o1 thinking patterns, followed by self-iterative learning to enhance reasoning.
- Summarized Problem-Solving: Generates a
<thinking>block for detailed reasoning and a<summary>block for a 3-5 step problem-solving process, making complex solutions accessible. - Educational Focus: Specifically optimized for K12 mathematics scenarios, with training data sampled from non-graphical math problems.
- Efficient Deployment: Designed to run on a single GPU without quantization, making it accessible for various applications.
- Rigorous Data Filtering: Employs automated evaluation to ensure the correctness of both the final answer and the explanation process, preventing false positives.
Use Cases & Limitations
This model is particularly well-suited for educational applications requiring structured mathematical problem-solving explanations. Its primary strength is in K12 mathematics, where it can provide clear, step-by-step solutions. However, users should be aware of its limitations:
- Scenario Specificity: Performance is verified only in K12 mathematics; quality in other domains is not guaranteed.
- Language Tendencies: The
summaryblock may lean towards Chinese content, and thethinkingblock might exhibit mixed language use. - Potential for Invalid Results: The model can occasionally fall into circular reasoning, leading to unparseable outputs.
- No Safety Alignment: The model has not undergone specific safety and ethical alignment testing.