Baichuan-M2-32B: A Leading Medical-Enhanced Reasoning Model

Baichuan-M2-32B, developed by Baichuan AI, is a 32.8 billion parameter model specifically designed for medical reasoning tasks. Built on Qwen2.5-32B, it incorporates a novel Large Verifier System that includes patient simulators and multi-dimensional verification mechanisms to enhance medical accuracy and interaction. The model utilizes medical domain adaptation enhancement through Mid-Training and a multi-stage reinforcement learning strategy to progressively improve medical knowledge, reasoning, and patient interaction capabilities.

Key Capabilities & Features

World's Leading Open-Source Medical Model: Achieves top performance on HealthBench, outperforming other open-source and many proprietary models, with medical capabilities approaching GPT-5.
Doctor-Thinking Alignment: Trained on real clinical cases and patient simulators, demonstrating clinical diagnostic thinking and robust patient interaction.
Efficient Deployment: Supports 4-bit quantization, enabling deployment on a single RTX4090, and offers 58.5% higher token throughput in MTP version for single-user scenarios.
Technical Innovations: Features a Large Verifier System with dynamic scoring, medical domain adaptation via Mid-Training, and multi-stage reinforcement learning.

Performance Highlights

Baichuan-M2-32B demonstrates superior performance on medical benchmarks like HealthBench, scoring 60.1, and also shows strong general capabilities on benchmarks such as AIME24 (83.4) and Arena-Hard-v2.0 (45.8).

Intended Use Cases

This model is suitable for medical education, health consultation, and clinical decision support, intended for research and reference under the guidance of medical professionals.

Overview

Baichuan-M2-32B: A Leading Medical-Enhanced Reasoning Model

Key Capabilities & Features

Performance Highlights

Intended Use Cases

Full Model Card (README)