OpenLLM-Ro/RoLlama3.1-8b-Instruct-DPO

TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:32kPublished:Oct 9, 2024License:cc-by-nc-4.0Architecture:Transformer0.0K Open Weights Cold

OpenLLM-Ro/RoLlama3.1-8b-Instruct-DPO is an 8 billion parameter instruction-tuned generative text model built with Meta Llama 3.1, specifically optimized for the Romanian language. Developed by OpenLLM-Ro, this model excels in human-aligned instruction following and chat-based interactions in Romanian. It demonstrates strong performance across various Romanian benchmarks, including MT-Bench and RoCulturaBench, making it suitable for research and assistant-like applications in Romanian.

Loading preview...

OpenLLM-Ro/RoLlama3.1-8b-Instruct-DPO: Romanian-Optimized Llama 3.1

This model is part of the RoLlama3.1 family, an initiative by OpenLLM-Ro to develop open-source Large Language Models specifically for the Romanian language. Built upon Meta Llama 3.1, this 8 billion parameter variant is an instruction-tuned model that has undergone Direct Preference Optimization (DPO) for enhanced human alignment.

Key Capabilities & Differentiators

  • Romanian Language Specialization: Represents a dedicated effort to create powerful LLMs for Romanian, addressing a significant gap in open-source models.
  • Human-Aligned Instruction Following: Fine-tuned using DPO with Romanian datasets like RoHelpSteer, RoUltraFeedback, and RoMagpieDPO, resulting in improved conversational quality and adherence to instructions.
  • Strong Performance on Romanian Benchmarks: Achieves the highest scores within its family on MT-Bench (7.00 average) and RoCulturaBench (4.73 average), indicating superior performance in Romanian-specific conversational and cultural understanding tasks.
  • Research-Oriented: Intended for research purposes, providing a robust foundation for further development and adaptation in Romanian NLP tasks.

Intended Use Cases

  • Assistant-like Chatbots: Designed for conversational AI applications requiring natural and helpful interactions in Romanian.
  • Natural Language Understanding (NLU) in Romanian: Adaptable for various NLU tasks, particularly when fine-tuned from its base model variants.
  • Research and Development: Serves as a valuable resource for researchers working on Romanian language models and applications.

Limitations

  • Language Specificity: Primarily intended for use in Romanian; performance in other languages is not guaranteed and is outside its scope.
  • License Restrictions: Licensed under CC-BY-NC-4.0, which restricts commercial use.