OpenLLM-Ro/RoLlama3.1-8b-Instruct-2024-10-09
The OpenLLM-Ro/RoLlama3.1-8b-Instruct-2024-10-09 model is an 8 billion parameter instruction-tuned generative text model developed by OpenLLM-Ro, specialized for the Romanian language. Built upon Meta Llama 3.1, it represents the first open-source effort to create a large language model specifically for Romanian. This model excels in assistant-like chat and various natural language tasks within Romanian, demonstrating improved performance on several Romanian-specific benchmarks compared to its base Llama-3.1-8B-Instruct counterpart.
Loading preview...
RoLlama3.1-8b-Instruct-2024-10-09: Romanian Language Model
This model is an 8 billion parameter instruction-tuned variant from the OpenLLM-Ro project, the first open-source initiative to develop large language models specialized for Romanian. It is built on the Meta Llama 3.1 architecture and fine-tuned using a diverse collection of Romanian instruction datasets, including RoAlpaca, RoDolly, and RoUltraChat.
Key Capabilities and Performance
- Romanian Specialization: Designed specifically for the Romanian language, offering enhanced performance on Romanian NLP tasks.
- Instruction Following: Fine-tuned for assistant-like chat and general instruction following in Romanian.
- Benchmark Performance: Demonstrates competitive and often superior performance on Romanian benchmarks compared to the original Llama-3.1-8B-Instruct model. For instance, it achieves an average score of 53.03 on academic benchmarks, surpassing Llama-3.1-8B-Instruct's 49.87, with notable improvements in ARC (47.69 vs 42.86) and GSM8k (44.30 vs 35.56).
- Downstream Task Adaptability: Shows strong results in few-shot and finetuned scenarios for tasks like LaRoSeDa (Multiclass Macro F1: 87.53 finetuned) and STS (Pearson: 87.05 finetuned).
Intended Use Cases
- Research: Ideal for research in Romanian natural language processing.
- Assistant-like Chat: Suited for conversational AI applications requiring interaction in Romanian.
- Natural Language Tasks: Adaptable for various Romanian NLP tasks where a specialized language model is beneficial.