cs-552-2026-thinkinsidethebox/math_model
TEXT GENERATIONConcurrency Cost:1Model Size:2BQuant:BF16Ctx Length:32kPublished:May 7, 2026Architecture:Transformer Warm
The cs-552-2026-thinkinsidethebox/math_model is a specialized language model developed by cs-552-2026-thinkinsidethebox, fine-tuned for advanced mathematical reasoning. This model demonstrates strong performance on the MATH-500 benchmark, achieving a pass@8 score of 0.9300 and a pass@1 score of 0.7845. It is specifically optimized for solving complex math problems, including those involving diagrams and requiring detailed reasoning steps.
Loading preview...
Overview
This model, developed by cs-552-2026-thinkinsidethebox, is a highly specialized language model designed for advanced mathematical problem-solving. It is a continuation of the previous best OMR-v2 math model, further refined through Supervised Fine-Tuning (SFT).
Key Capabilities
- Mathematical Reasoning: Achieves a pass@8 score of 0.9300 and a pass@1 score of 0.7845 on the local MATH-500 evaluation, indicating strong problem-solving abilities.
- Diagram Interpretation: Enhanced with ASY / diagram-only short-medium OpenMathReasoning repair data, improving its capacity to handle problems with visual components.
- Detailed Explanations: While trained with a "no_think" template, the final exported model utilizes a "thinking" template, suggesting its ability to generate detailed reasoning steps.
- High Extraction Rate: Boasts a boxed_extraction_rate of 0.9305, indicating effective identification of final answers within its responses.
Good For
- Complex Math Problems: Ideal for applications requiring accurate solutions to challenging mathematical questions.
- Educational Tools: Can be integrated into systems that provide step-by-step solutions or explanations for math problems.
- Research in Mathematical AI: Useful for exploring and developing advanced mathematical reasoning capabilities in LLMs.