PhantomAjusshi/phi3-auditor-merged
TEXT GENERATIONConcurrency Cost:1Model Size:4BQuant:BF16Ctx Length:4kPublished:Dec 8, 2025License:mitArchitecture:Transformer Open Weights Cold
PhantomAjusshi/phi3-auditor-merged is a 3.8 billion parameter Phi-3-mini-based model fine-tuned for auditing clinical AI models. It takes a JSON object of ML performance metrics and outputs a structured health classification label and a detailed explanation. This model specializes in identifying issues like drift, calibration failure, and class imbalance in deployed clinical AI systems.
Loading preview...
Overview
PhantomAjusshi/phi3-auditor-merged is a specialized language model built upon the Microsoft Phi-3-mini-4k-instruct base, fine-tuned using LoRA (Low-Rank Adaptation) via PEFT. With approximately 3.8 billion parameters, this model is designed to analyze machine learning performance metrics for clinical AI models.
Key Capabilities
- Clinical AI Model Auditing: Processes JSON reports of ML performance metrics (e.g., AUC, ECE, drift, label shift).
- Health Classification: Assigns a specific category label to the clinical model's health (e.g.,
Calibration Failure,Major Drift,Healthy). - Detailed Explanations: Generates concise explanations with observations and recommendations for identified issues.
- Specialized Training: Fine-tuned on a custom synthetic clinical audit dataset of 5,000 labeled samples, covering various health categories.
Intended Use Cases
- ML Engineers: Monitoring deployed clinical models for performance degradation.
- Healthcare Data Scientists: Conducting periodic audits of model health and performance.
- Researchers: Studying automated assessment of clinical model health.
Limitations
- Trained on synthetic data, which may not fully represent real-world clinical metric distributions.
- Does not perform temporal reasoning; analyzes single snapshots of metrics.
- Not a substitute for expert review or direct clinical decision-making.