OpenLLM-Ro/RoLlama2-7b-Instruct
OpenLLM-Ro/RoLlama2-7b-Instruct is a 7 billion parameter instruction-tuned generative text model developed by OpenLLM-Ro, specifically designed for the Romanian language. Fine-tuned from RoLlama2-7b-Base, this model excels in Romanian natural language tasks, outperforming Llama-2-7b-chat on various academic and downstream benchmarks, including a significant lead in Romanian MT-Bench and RoCulturaBench. It is intended for research use in Romanian, particularly for assistant-like chat applications.
Loading preview...
OpenLLM-Ro/RoLlama2-7b-Instruct: A Specialized Romanian LLM
OpenLLM-Ro/RoLlama2-7b-Instruct is a 7 billion parameter instruction-tuned generative text model, part of the RoLlama2 family developed by OpenLLM-Ro. This model represents the first open-source effort to create a Large Language Model specifically optimized for the Romanian language.
Key Capabilities and Features
- Romanian Language Specialization: Developed and fine-tuned exclusively for Romanian, addressing a critical gap in open-source LLMs.
- Instruction Following: Designed for assistant-like chat applications, capable of understanding and responding to instructions in Romanian.
- Strong Performance: Outperforms the generalist Llama-2-7b-chat on several Romanian-specific benchmarks, including:
- Romanian MT-Bench: Achieves an average score of 4.97, significantly higher than Llama-2-7b-chat's 1.08, with 100% of answers in Romanian.
- RoCulturaBench: Scores 4.56 average, compared to Llama-2-7b-chat's 1.21, also with 100% Romanian answers.
- Academic Benchmarks: Shows improved performance across ARC, MMLU, Winogrande, Hellaswag, GSM8k, and TruthfulQA compared to Llama-2-7b-chat.
- Downstream Tasks: Demonstrates superior few-shot and finetuned performance on tasks like LaRoSeDa (Binary and Multiclass Macro F1), WMT (EN-RO Bleu), and XQuAD (EM and F1).
- Fine-tuned from RoLlama2-7b-Base: Built upon a foundational model also specialized for Romanian.
- Comprehensive Training Data: Trained using a diverse set of Romanian instruction-following datasets, including RoAlpaca, RoDolly, RoSelfInstruct, and RoUltraChat.
Intended Use Cases
This model is primarily intended for research use in Romanian natural language processing. Its instruction-tuned nature makes it suitable for:
- Developing assistant-like chat applications in Romanian.
- Exploring and advancing Romanian language understanding and generation.
- Adapting for various natural language tasks within the Romanian linguistic context.