Overview
Overview
This model, small-models-for-glam/Qwen3-0.6B-SFT-name-parser-yaml, is a specialized fine-tuned version of the Qwen3-0.6B base model. Developed by small-models-for-glam, it is specifically designed to parse person names from cultural heritage contexts (libraries, archives, museums) into a structured YAML format. The model was trained using Supervised Fine-Tuning (SFT) with Hugging Face's TRL library.
Key Capabilities
- Structured Name Parsing: Extracts
first_name,last_name,middle_names,temporalinformation (birth/death/flourished dates),titles, andextra_infointo a consistent YAML output. - Diverse Name Pattern Support: Handles a wide variety of name formats, including basic, complex (e.g.,
Baron William Henry Ashe A'Court Heytesbury, c. 1809-1891), mononyms, initials, diacritics, and non-Western names (e.g., Chinese). - Multi-cultural Training: Trained on synthetic data covering names from English, French, German, Italian, Spanish, Dutch, Arabic, and Chinese contexts, along with diverse temporal data and titles.
- Robust Performance: Demonstrates strong performance in identifying and structuring diverse international name formats, temporal data, titles, and complex surname patterns.
Intended Use Cases
This model is primarily intended for applications within cultural heritage domains:
- Digital Humanities: Processing historical person names found in manuscripts and documents.
- Library Science: Cataloging and standardizing author names in bibliographic records.
- Archive Management: Structuring person names within archival finding aids.
- Museum Collections: Organizing creator and subject names in cultural heritage databases.
Limitations
- Primarily trained on Western and East Asian name patterns; may struggle with very rare or highly specialized naming conventions.
- Temporal date parsing assumes Gregorian calendar years and has limited support for ancient or historical dating systems (e.g., BCE, regnal years).