hammh0a/Hala-700M
Hala-700M is a 0.7 billion parameter Arabic-centric instruction and translation model developed by Hasan Abed Al Kader Hammoud, Mohammad Zbeeb, and Bernard Ghanem at King Abdullah University of Science and Technology (KAUST). This model is specifically optimized for Arabic language tasks, demonstrating strong performance across various Arabic benchmarks. It is designed for applications requiring high-quality Arabic instruction following and translation capabilities.
Loading preview...
Hala-700M: Arabic-Centric Instruction & Translation Model
Hala-700M is a 0.7 billion parameter model developed by researchers at King Abdullah University of Science and Technology (KAUST), specifically designed for Arabic language processing. The name "Hala" (حلا) signifies sweetness and beauty in Arabic, reflecting the model's focus on the language.
Key Capabilities & Differentiators
- Arabic Optimization: Hala-700M is built as an Arabic-centric model, excelling in instruction following and translation tasks within the Arabic language.
- Competitive Performance: In evaluations against other models in the ≤2B parameter category, Hala-700M achieves an average score of 46.9 across a suite of Arabic benchmarks including AlGhafa, ArabicMMLU, EXAMS, MadinahQA, AraTrust, and ArbMMLU-HT. This places it competitively among models of similar size, outperforming several larger models like LiquidAI/LFM2-700M and google/gemma-2-2b-it.
- Context Length: The model supports a substantial context length of 32768 tokens, enabling it to handle longer Arabic texts and complex instructions.
Good For
- Arabic Instruction Following: Ideal for applications requiring the model to understand and execute instructions provided in Arabic.
- Arabic Translation: Suitable for tasks involving translation to and from Arabic, leveraging its specialized training.
- Resource-Efficient Arabic NLP: Offers strong performance for Arabic natural language processing tasks within a smaller parameter count, making it efficient for deployment.