Dorna2-Llama3.1-8B-Instruct Overview

Dorna2-Llama3.1-8B-Instruct is an 8 billion parameter, decoder-only instruction-tuned model developed by Part AI. It is built on the Meta Llama 3.1 Instruct architecture and has been specifically fine-tuned using extensive Persian datasets. This specialization makes it particularly adept at handling Persian language tasks and understanding cultural nuances.

Key Capabilities & Performance

The model's primary strength lies in its proficiency with the Persian language. It has been evaluated across five specialized Persian tasks, including:

Part Multiple Choice: Common knowledge and reasoning.
ARC Easy & ARC Challenge: General knowledge and advanced reasoning.
MMLU Pro: Professional-level examinations.
AUT Multiple Choice Persian: Specialized Persian-language examination.

In comparative evaluations against other Llama 3.1-8B models, Dorna2-Llama3.1-8B-Instruct achieved an average accuracy of 50.72% across these Persian benchmarks. Notably, it scored highest in ARC Easy (79.59%), ARC Challenge (64.42%), and MMLU Pro (21.47%) among the compared models, demonstrating strong reasoning and knowledge recall in Persian contexts. The evaluation datasets comprise over 40,000 samples, ensuring a robust testing ground for Persian LLMs.

Ideal Use Cases

Persian Language Applications: Excellent for chatbots, content generation, and translation services targeting Persian speakers.
Research & Development: Suitable for researchers working on LLMs with a focus on low-resource or non-English languages, particularly Persian.
Educational Tools: Can be integrated into educational platforms requiring advanced Persian language comprehension and generation.

Overview

Dorna2-Llama3.1-8B-Instruct Overview

Key Capabilities & Performance

Ideal Use Cases

Full Model Card (README)