Overview

MedGemma-4B ECGInstruct is a 4.3 billion parameter vision-language model developed by ConvAI Innovations, fine-tuned from Google's MedGemma-4B-it. It is specifically designed for automated interpretation of 12-lead ECG images, leveraging the extensive PULSE-ECG/ECGInstruct dataset which contains 1.15 million ECG image-text pairs. The model was trained for 72 hours on 8x NVIDIA A100 GPUs, achieving a final token accuracy of 86.83%.

Key Capabilities

Interprets 12-lead ECG images: Analyzes visual ECG data to extract clinical insights.
Identifies cardiac abnormalities: Detects arrhythmias, ischemia, hypertrophy, and conduction blocks.
Generates detailed clinical reports: Provides comprehensive interpretations based on ECG findings.
Answers specific questions: Responds to queries about ECG findings, diagnosis suggestions, heart rate, and rhythm.

Intended Use

This model is suitable for:

Research in medical AI and computer vision: A valuable tool for academic and developmental purposes.
Educational demonstrations: Useful for teaching and learning ECG interpretation.
Development of clinical decision support prototypes: Can serve as a component in early-stage medical software development.

Limitations

Primarily trained on adult ECG data; performance on pediatric ECGs may vary.
NOT validated for clinical use and should not replace professional medical diagnosis. It is for research and educational purposes only.

Overview

Overview

Key Capabilities

Intended Use

Limitations

Full Model Card (README)