OpenLLM-Ro/RoGemma-7b-Instruct-DPO

TEXT GENERATIONConcurrency Cost:1Model Size:8.5BQuant:FP8Ctx Length:8kPublished:Oct 10, 2024License:cc-by-nc-4.0Architecture:Transformer0.0K Open Weights Cold

OpenLLM-Ro/RoGemma-7b-Instruct-DPO is an 8.5 billion parameter instruction-tuned generative text model developed by OpenLLM-Ro, specifically optimized for the Romanian language. This model is a human-aligned instruct variant within the RoGemma family, fine-tuned from RoGemma-7b-Instruct-2024-10-09 using the RoHelpSteer dataset. It excels in assistant-like chat applications and various natural language tasks within Romanian, offering a specialized solution for Romanian language processing.

Loading preview...

Overview

OpenLLM-Ro/RoGemma-7b-Instruct-DPO is a human-aligned instruction-tuned generative text model, part of the RoGemma family developed by OpenLLM-Ro. This 8.5 billion parameter model is specifically designed for the Romanian language, representing a significant open-source effort to build specialized LLMs for Romanian. It is fine-tuned from RoGemma-7b-Instruct-2024-10-09 using the RoHelpSteer dataset, focusing on conversational capabilities.

Key Capabilities

  • Romanian Language Specialization: Developed and optimized exclusively for Romanian, addressing a gap in open-source LLMs.
  • Instruction Following: Fine-tuned for assistant-like chat and instruction-based tasks.
  • DPO Alignment: Utilizes Direct Preference Optimization (DPO) for human alignment, enhancing conversational quality.
  • Academic Benchmarks: Demonstrates competitive performance on Romanian-specific benchmarks like LaRoSeDa, WMT, XQuAD, STS, MT-Bench, and RoCulturaBench, often outperforming its base model and gemma-1.1-7b-it in relevant metrics.

Good For

  • Research in Romanian NLP: Ideal for academic and research purposes focused on the Romanian language.
  • Assistant-like Chatbots: Suited for building conversational agents and virtual assistants that interact in Romanian.
  • Natural Language Tasks: Adaptable for various Romanian NLP tasks, including text generation, question answering, and translation, particularly when fine-tuned from the base models.