TitleOS/Phi-4-mini-reasoning-heretic
TitleOS/Phi-4-mini-reasoning-heretic is a 3.8 billion parameter, 32K context length decoder-only Transformer model, based on Microsoft's Phi-4-mini-reasoning. This version is a 'decensored' variant, optimized for multi-step, logic-intensive mathematical problem-solving tasks. It excels in formal proof generation, symbolic computation, and advanced word problems, particularly in memory/compute constrained environments.
Loading preview...
What is TitleOS/Phi-4-mini-reasoning-heretic?
This model is a 'decensored' version of Microsoft's Phi-4-mini-reasoning, created using the Heretic v1.2.0 tool. It is a compact 3.8 billion parameter, decoder-only Transformer model with a 32K token context length, specifically fine-tuned for advanced mathematical reasoning. The original Phi-4-mini-reasoning was built upon synthetic data with a focus on high-quality, reasoning-dense content, and this 'heretic' variant aims to provide an uncensored experience.
Key Capabilities
- Advanced Mathematical Reasoning: Designed for multi-step, logic-intensive mathematical problem-solving.
- Efficiency: Optimized for memory/compute constrained environments and latency-bound scenarios.
- Diverse Math Problems: Excels in formal proof generation, symbolic computation, and advanced word problems.
- Decensored Output: Modified to reduce refusals compared to the original model, with 59/100 refusals versus 62/100.
Performance Highlights
Despite its compact size, the Phi-4-mini-reasoning model (on which this variant is based) demonstrates strong performance in mathematical benchmarks, achieving 57.5 on AIME, 94.6 on MATH-500, and 52.0 on GPQA Diamond. It aims to balance reasoning ability with efficiency, making it suitable for educational applications and lightweight deployment.
Good For
- Applications requiring robust mathematical problem-solving.
- Edge devices or mobile systems where computational resources are limited.
- Educational tools for tutoring and complex math assistance.
- Use cases where an uncensored response to reasoning tasks is preferred.