arcee-ai/Meraj-Mini

TEXT GENERATIONConcurrency Cost:1Model Size:7.6BQuant:FP8Ctx Length:32kPublished:Oct 6, 2024License:apache-2.0Architecture:Transformer0.0K Open Weights Cold

Arcee Meraj Mini is an open-source, 7 billion parameter instruction-tuned causal language model developed by arcee-ai, fine-tuned from Qwen2.5-7B-Instruct. It is meticulously designed for strong performance in both Arabic and English, excelling in Arabic language understanding and cultural adaptation while maintaining competitive English capabilities. This model is optimized for bilingual tasks, including content creation, customer service, education, mathematics, and coding in Arabic contexts.

Loading preview...

Overview

arcee-ai/Meraj-Mini is an open-source, 7 billion parameter instruction-tuned language model, fine-tuned from Qwen2.5-7B-Instruct. It is specifically developed to enhance Arabic language capabilities while maintaining robust English proficiency. The model underwent a rigorous development process including data preparation with translated English datasets, iterative training, and evaluation of 15 different variants to achieve optimal bilingual performance.

Key Capabilities

  • Arabic Language Understanding: Excels in general comprehension, reading comprehension, and common-sense reasoning tailored for Arabic.
  • Cultural Adaptation: Generates content incorporating Arabic cultural nuances.
  • Bilingual Proficiency: Demonstrates top-tier performance in Arabic benchmarks and competitive results in English.
  • Mathematics and Coding: Supports mathematical reasoning and code generation in Arabic.

Benchmarks and Performance

Arcee Meraj Mini consistently outperforms state-of-the-art models on the Open Arabic LLM Leaderboard (OALL) and shows superior performance on Translated MMLU. Its English performance is comparable to leading models, indicating effective retention of English language knowledge while specializing in Arabic.

Good For

  • Developing advanced Arabic-speaking chatbots and virtual assistants.
  • Generating high-quality, culturally relevant Arabic content for various needs.
  • Creating personalized educational experiences for Arabic speakers.
  • Applications requiring mathematical reasoning and code generation in Arabic.