NorMistral-11b-translate: Specialized Nordic Language Translation
NorMistral-11b-translate is a 12 billion parameter language model developed by norallm, specifically fine-tuned for high-quality machine translation. Building upon the NorMistral-11b-long architecture, this model is designed to handle translation tasks involving Norwegian Bokmål, Nynorsk, and English.
Key Capabilities
- Bidirectional Translation: Capable of translating in all six directions between Norwegian Bokmål, Nynorsk, and English.
- Specialized Fine-tuning: Optimized for the nuances of these specific languages, leveraging a dedicated translation dataset.
- Context Length: Features a 32768 token context window, allowing for the translation of longer sentences and documents while maintaining coherence.
- Apache 2.0 License: Released under a permissive license, enabling broad use and integration.
Training Data
The model was fine-tuned using the comprehensive ltg/nob-nno-eng-translation-pairs dataset, ensuring robust performance on its target languages.
Good For
- Applications requiring accurate translation between Norwegian Bokmål, Nynorsk, and English.
- Processing and translating documents or conversations in these specific language pairs.
- Developers and researchers working with Nordic languages who need a specialized translation model.