hongzhouyu/FineMedLM

TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:32kPublished:Jan 23, 2025License:mitArchitecture:Transformer Open Weights Cold

FineMedLM is an 8 billion parameter medical chat Large Language Model developed by Hongzhou Yu and his team, based on the Llama-3.1-8B-Instruct architecture. It is specifically fine-tuned using meticulously crafted synthetic data and further enhanced with DPO for deep reasoning capabilities in medical contexts. This model excels at answering medical questions and providing professional medical advice, making it suitable for healthcare-related AI applications.

Loading preview...

FineMedLM: An 8B Medical Chat LLM

FineMedLM is an 8 billion parameter medical chat Large Language Model (LLM) developed by Hongzhou Yu, Tianhao Cheng, Ying Cheng, and Rui Feng. It is built upon the Llama-3.1-8B-Instruct architecture and is specifically designed for medical applications.

Key Capabilities

  • Medical Chat: Optimized for engaging in professional medical conversations.
  • Deep Reasoning: Enhanced through DPO (Direct Preference Optimization) to acquire advanced reasoning abilities in medical scenarios.
  • Supervised Fine-Tuning (SFT): Trained on a carefully curated synthetic dataset to specialize in medical knowledge and dialogue.

What Makes It Different

FineMedLM distinguishes itself through its dedicated focus on the medical domain. Unlike general-purpose LLMs, it undergoes specialized supervised fine-tuning and DPO using medical-specific data, leading to a model that can provide more accurate and contextually relevant responses to medical queries. The model's development is detailed in the paper "FineMedLM-o1: Enhancing the Medical Reasoning Ability of LLM from Supervised Fine-Tuning to Test-Time Training" (arXiv:2501.09213).

Good For

  • Medical Question Answering: Providing detailed and professional answers to health-related questions.
  • Clinical Decision Support: Assisting healthcare professionals with information retrieval and reasoning.
  • Patient Education: Generating clear and understandable explanations of medical conditions and treatments.

Users should utilize the provided system prompt for optimal inference results, ensuring the model acts as a "helpful professional doctor."