FineMedLM: An 8B Medical Chat LLM

FineMedLM is an 8 billion parameter medical chat Large Language Model (LLM) developed by Hongzhou Yu, Tianhao Cheng, Ying Cheng, and Rui Feng. It is built upon the Llama-3.1-8B-Instruct architecture and is specifically designed for medical applications.

Key Capabilities

Medical Chat: Optimized for engaging in professional medical conversations.
Deep Reasoning: Enhanced through DPO (Direct Preference Optimization) to acquire advanced reasoning abilities in medical scenarios.
Supervised Fine-Tuning (SFT): Trained on a carefully curated synthetic dataset to specialize in medical knowledge and dialogue.

What Makes It Different

FineMedLM distinguishes itself through its dedicated focus on the medical domain. Unlike general-purpose LLMs, it undergoes specialized supervised fine-tuning and DPO using medical-specific data, leading to a model that can provide more accurate and contextually relevant responses to medical queries. The model's development is detailed in the paper "FineMedLM-o1: Enhancing the Medical Reasoning Ability of LLM from Supervised Fine-Tuning to Test-Time Training" (arXiv:2501.09213).

Good For

Medical Question Answering: Providing detailed and professional answers to health-related questions.
Clinical Decision Support: Assisting healthcare professionals with information retrieval and reasoning.
Patient Education: Generating clear and understandable explanations of medical conditions and treatments.

Users should utilize the provided system prompt for optimal inference results, ensuring the model acts as a "helpful professional doctor."

Overview

FineMedLM: An 8B Medical Chat LLM

Key Capabilities

What Makes It Different

Good For

Full Model Card (README)