Hala-9B: An Arabic-Centric Instruction and Translation Model
Hala-9B is a 9 billion parameter language model developed by researchers at King Abdullah University of Science and Technology (KAUST). Part of the broader Hala model family, this model is specifically designed and optimized for Arabic language understanding, instruction following, and translation tasks. The name "Hala" (حلا) signifies sweetness and beauty in Arabic, reflecting the model's focus on the language.
Key Capabilities and Performance
Hala-9B demonstrates strong performance across a suite of Arabic benchmarks, often outperforming other models in its size category. Key evaluation metrics include:
- ArabicMMLU: Measures Arabic language understanding and reasoning.
- AlGhafa, EXAMS, MadinahQA: Specific Arabic question answering and knowledge benchmarks.
- AraTrust, ArbMMLU-HT: Benchmarks for trustworthiness and advanced Arabic MMLU tasks.
In comparative evaluations against other 7B-9B parameter models, Hala-9B achieved an average score of 69.9, positioning it as a leading model for Arabic language tasks in its class. The model utilizes a chat template for instruction following, as shown in the provided Python example.
Ideal Use Cases
Hala-9B is particularly well-suited for applications requiring:
- High-quality Arabic instruction following and response generation.
- Arabic text translation and summarization.
- Arabic-specific question answering and knowledge retrieval systems.
- Development of Arabic-centric chatbots and virtual assistants.