AQ-MedAI/MedResearcher-R1-32B

TEXT GENERATIONConcurrency Cost:2Model Size:32.8BQuant:FP8Ctx Length:32kPublished:Aug 19, 2025License:mitArchitecture:Transformer0.0K Open Weights Cold

MedResearcher-R1-32B by AQ-MedAI is a specialized 32 billion parameter medical deep research agent designed to overcome limitations of general-purpose LLMs in complex medical queries. It utilizes a novel knowledge-informed trajectory synthesis framework, including medical knowledge graphs for generating multi-hop QA pairs and a custom medical retrieval engine. This model achieves state-of-the-art performance on the MedBrowseComp benchmark, surpassing larger proprietary systems in medical information synthesis and clinical reasoning.

Loading preview...

MedResearcher-R1-32B: Expert Medical Deep Research Agent

MedResearcher-R1-32B, developed by AQ-MedAI, is a 32 billion parameter model specifically engineered to excel in complex medical information-seeking and synthesis tasks. It addresses the shortcomings of general-purpose deep research agents in the medical domain through two primary innovations: a novel data synthesis framework leveraging medical knowledge graphs to generate complex multi-hop QA pairs, and the integration of a custom-built private medical retrieval engine.

Key Capabilities

  • Knowledge-Informed Trajectory Synthesis: Utilizes a unique framework for generating high-quality training data, including knowledge graph construction, trajectory generation, and evaluation pipelines.
  • Advanced Medical Reasoning: Extracts longest chains from subgraphs around rare medical entities to create complex multi-hop QA pairs, enhancing clinical reasoning.
  • Specialized Tool Integration: Incorporates a custom medical retrieval engine alongside general-purpose tools for accurate medical information synthesis.
  • Competitive Performance: Achieves 53.4 on GAIA and 54 on xBench, comparable to GPT-4o-mini, and sets a new state-of-the-art with 27.5% accuracy on MedBrowseComp, outperforming larger closed-source systems.
  • Open-Sourced Dataset: Provides a high-quality QA dataset with complex reasoning question-answer pairs and detailed step-by-step reasoning paths.

Good For

  • Medical Deep Research: Ideal for applications requiring expert-level medical information retrieval, synthesis, and complex query resolution.
  • Clinical Reasoning: Excels in tasks demanding dense medical knowledge and multi-hop reasoning for clinical scenarios.
  • Domain-Specific AI Development: Demonstrates how strategic domain-specific innovations can enable smaller open-source models to surpass larger proprietary systems in specialized fields.