vmal/med-advisor-4b
vmal/med-advisor-4b is a 4-billion parameter chat model built on Qwen/Qwen3-4B-Base, specifically fine-tuned for medical and scientific education. It excels at explaining complex concepts, adapting explanations for diverse audiences, and maintaining strict policy boundaries against clinical advice. This model is optimized for general educational purposes, not for diagnosis, treatment, or personal medical decision-making.
Loading preview...
Overview
vmal/med-advisor-4b is a 4-billion parameter chat model derived from Qwen/Qwen3-4B-Base, meticulously fine-tuned for medical and scientific education. Its core design focuses on providing clear, adaptable explanations of medical and scientific concepts while strictly adhering to policy boundaries, preventing it from offering clinical advice.
Key Capabilities
- Concept Explanation: Explains medical and scientific topics in plain language.
- Audience Adaptation: Tailors explanations for various audiences, including patients, students, caregivers, and healthcare professionals.
- Policy Adherence: Answers educational questions while maintaining strict boundaries against diagnosis, treatment planning, or personal medical interpretation.
- Refusal Sharpness: More effectively refuses high-risk requests compared to earlier versions.
Training and Performance
The model underwent a three-phase training process, culminating in Phase 3 with full-model DPO for improved refusal sharpness, redirect correctness, and emergency escalation. Evaluations show a significant reduction in boundary violations (from 27.31% in Qwen3-4B-Instruct to 3.85%) and mode incorrectness, making it much safer for medical education than off-the-shelf base or instruct models. While excelling in depth, audience adaptation, and structure, minor regressions in warmth and multi-turn consistency were noted.
Intended Use Cases
This model is ideal for:
- General Medical Education: Providing accessible information on medical and scientific topics.
- Patient Education: Helping patients and their families understand health conditions.
- Student Learning: Supporting medical and scientific students with conceptual explanations.
Limitations and Safety
It is crucial to understand that med-advisor-4b is an educational tool, not a clinical system. It should not be used for diagnosis, treatment, medication dosing, or interpreting personal medical data. Like all language models, it can generate inaccurate or misleading information. Recommended decoding settings (do_sample=False, repetition_penalty=1.10-1.15, no_repeat_ngram_size=6) and a clear system prompt are advised for safer outputs.