Name: PeterPaker123/Qwen2.5-7B-Vietnamese-Medical-NER-GRPO API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: PeterPaker123

Overview

This model, developed by PeterPaker123, is a specialized generative Large Language Model built upon the Qwen2.5-7B architecture. Unlike traditional token-classification models, it functions as an intelligent extraction agent for Vietnamese Medical Named Entity Recognition (NER), outputting structured JSON lists from medical texts.

Key Capabilities

Dynamic Entity Extraction: Supports prompt-defined entity types, allowing users to specify any medical or clinical entity for extraction, such as SYMPTOM_AND_DISEASE, MEDICAL_PROCEDURE, and DRUG.
Structured Output: Delivers extracted entities in a strict JSON format, making it ideal for downstream processing.
Advanced Fine-tuning: Utilizes a two-stage alignment pipeline including Supervised Fine-Tuning (SFT) and Group Relative Policy Optimization (GRPO) with Reinforcement Learning (RL) to achieve high precision, recall, and type accuracy, significantly reducing hallucinations.
Vietnamese Language Focus: Exclusively trained and optimized for the Vietnamese language.

Good For

Medical Chatbot Enrichment: Enhancing RAG pipelines for healthcare applications.
Automated Summarization: Summarizing telehealth transcripts by extracting key medical information.
Clinical Knowledge Graph Construction: Building structured knowledge bases from unstructured medical text.

Limitations

Not for Diagnosis: Strictly an extraction tool; it does not provide medical advice or diagnoses.
No General Conversation: Aggressively aligned for JSON output, performing poorly on general conversational tasks.
Specific Entity Focus: Trained to ignore demographic data (Names, Ages, Locations) and will likely yield empty lists for such requests.
Hallucination Residuals: While reduced, generative LLMs may still occasionally hallucinate or alter extracted text.

Overview

Overview

Key Capabilities

Good For

Limitations

Full Model Card (README)