Confucius-o1-14B: Specialized Reasoning for Education

Confucius-o1-14B is a 14.8 billion parameter model from the NetEase Youdao Team, built upon the Qwen2.5-14B-Instruct base. Its core innovation lies in its o1-like reasoning capabilities and a unique output format that provides both a detailed chain of thought and a concise, step-by-step problem-solving summary.

Key Capabilities & Features

Two-Stage Learning: Utilizes a teacher model for initial learning of o1 thinking patterns, followed by self-iterative learning to enhance reasoning.
Summarized Problem-Solving: Generates a <thinking> block for detailed reasoning and a <summary> block for a 3-5 step problem-solving process, making complex solutions accessible.
Educational Focus: Specifically optimized for K12 mathematics scenarios, with training data sampled from non-graphical math problems.
Efficient Deployment: Designed to run on a single GPU without quantization, making it accessible for various applications.
Rigorous Data Filtering: Employs automated evaluation to ensure the correctness of both the final answer and the explanation process, preventing false positives.

Use Cases & Limitations

This model is particularly well-suited for educational applications requiring structured mathematical problem-solving explanations. Its primary strength is in K12 mathematics, where it can provide clear, step-by-step solutions. However, users should be aware of its limitations:

Scenario Specificity: Performance is verified only in K12 mathematics; quality in other domains is not guaranteed.
Language Tendencies: The summary block may lean towards Chinese content, and the thinking block might exhibit mixed language use.
Potential for Invalid Results: The model can occasionally fall into circular reasoning, leading to unparseable outputs.
No Safety Alignment: The model has not undergone specific safety and ethical alignment testing.

Overview

Confucius-o1-14B: Specialized Reasoning for Education

Key Capabilities & Features

Use Cases & Limitations

Full Model Card (README)