Jarvis1111/DoctorAgent-RL

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:7.6BQuant:FP8Ctx Length:32kPublished:Jun 16, 2025License:apache-2.0Architecture:Transformer0.0K Open Weights Warm

Jarvis1111/DoctorAgent-RL is a 7.6 billion parameter reinforcement learning-based multi-agent system designed for multi-turn clinical dialogue. This model, detailed in the paper "DoctorAgent-RL: A Multi-Agent Collaborative Reinforcement Learning System for Multi-Turn Clinical Dialogue," optimizes medical consultations by enabling adaptive information gathering and clinical reasoning alignment. It excels at developing dynamic interaction strategies between doctor and patient agents, leading to improved diagnostic accuracy and interaction efficiency in medical dialogue.

Loading preview...

DoctorAgent-RL: Multi-Agent Clinical Dialogue System

DoctorAgent-RL is a 7.6 billion parameter reinforcement learning (RL)-based multi-agent framework developed by Jarvis1111, specifically engineered to enhance multi-turn clinical dialogue. It addresses the limitations of static dialogue systems by modeling medical consultations as dynamic decision-making processes under uncertainty. The system features distinct doctor and patient agents that collaborate through continuous interactions, with their strategies optimized via RL.

Key Capabilities

  • Multi-Agent Collaboration: Utilizes separate doctor and patient agents with defined roles and objectives to simulate realistic medical consultations.
  • Dynamic Strategy Optimization: Employs reinforcement learning to adapt dialogue paths and interaction strategies based on patient responses and medical logic.
  • Comprehensive Reward Design: Guides optimal consultation strategies through multi-dimensional evaluation metrics.
  • Medical Knowledge Integration: Embeds clinical reasoning directly into the decision-making processes of the agents.
  • MTMedDialog Dataset: Leverages a specialized English multi-turn medical consultation dataset designed for simulation capabilities.

Good For

  • Simulating Medical Consultations: Ideal for research and development of AI systems that can conduct adaptive, multi-turn clinical dialogues.
  • Improving Diagnostic Accuracy: Designed to enhance the precision of medical diagnoses through intelligent information gathering.
  • Developing Adaptive Dialogue Systems: Useful for creating systems that can adjust their interaction strategies dynamically in uncertain environments.
  • Research in Reinforcement Learning for Healthcare: Provides a framework for exploring RL applications in complex medical scenarios.