FuseAI/FuseO1-QwQ-DeepSeekR1-LightR1-32B

TEXT GENERATIONConcurrency Cost:2Model Size:32.8BQuant:FP8Ctx Length:32kPublished:Mar 7, 2025License:apache-2.0Architecture:Transformer0.0K Open Weights Cold

FuseAI/FuseO1-QwQ-DeepSeekR1-LightR1-32B is a 32.8 billion parameter language model developed by the FuseAI Team, created by merging Qwen/QwQ-32B, deepseek-ai/DeepSeek-R1-Distill-Qwen-32B, and qihoo360/Light-R1-32B using the SCE merging method. This model is specifically optimized for System-II reasoning capabilities, excelling in complex mathematical, coding, and scientific tasks. It demonstrates significant performance improvements on benchmarks like AIME24, LiveCodeBench, and GPQA-Diamond compared to its constituent models.

Loading preview...

FuseAI/FuseO1-QwQ-DeepSeekR1-LightR1-32B: Enhanced System-II Reasoning

This 32.8 billion parameter model, developed by the FuseAI Team, is a product of advanced model fusion techniques, specifically the SCE merging method. It integrates the strengths of Qwen/QwQ-32B, deepseek-ai/DeepSeek-R1-Distill-Qwen-32B, and qihoo360/Light-R1-32B to create a unified model with superior System-II reasoning abilities.

Key Capabilities & Differentiators

  • Optimized for Complex Reasoning: Designed to enhance reasoning in mathematics, coding, and science domains.
  • Superior Benchmark Performance: Achieves notable improvements on AIME24 (e.g., +1.4 Pass@1, +6.7 Cons@16 over Qwen/QwQ-32B), LiveCodeBench, and GPQA-Diamond.
  • Model Fusion: Utilizes the innovative SCE merging methodology to combine distinct knowledge and strengths from multiple LLMs.
  • Long-Context Reasoning: Benefits from the long-CoT reasoning capabilities of its merged components, supporting complex problem-solving.

When to Use This Model

  • Mathematical Problem Solving: Ideal for tasks requiring step-by-step mathematical reasoning, as evidenced by strong AIME24 and MATH500 scores.
  • Code Generation & Reasoning: Suitable for complex coding challenges, showing improved performance on LiveCodeBench.
  • Scientific Inquiry: Effective for scientific reasoning tasks, with enhanced scores on GPQA-Diamond and MMLU-Pro.
  • Applications Requiring Robust Reasoning: Choose this model when your use case demands high accuracy and consistency in complex, multi-step reasoning processes.