Name: OpenLLM-Ro/RoGemma-7b-Instruct-DPO API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: OpenLLM-Ro

Overview

OpenLLM-Ro/RoGemma-7b-Instruct-DPO is a human-aligned instruction-tuned generative text model, part of the RoGemma family developed by OpenLLM-Ro. This 8.5 billion parameter model is specifically designed for the Romanian language, representing a significant open-source effort to build specialized LLMs for Romanian. It is fine-tuned from RoGemma-7b-Instruct-2024-10-09 using the RoHelpSteer dataset, focusing on conversational capabilities.

Key Capabilities

Romanian Language Specialization: Developed and optimized exclusively for Romanian, addressing a gap in open-source LLMs.
Instruction Following: Fine-tuned for assistant-like chat and instruction-based tasks.
DPO Alignment: Utilizes Direct Preference Optimization (DPO) for human alignment, enhancing conversational quality.
Academic Benchmarks: Demonstrates competitive performance on Romanian-specific benchmarks like LaRoSeDa, WMT, XQuAD, STS, MT-Bench, and RoCulturaBench, often outperforming its base model and gemma-1.1-7b-it in relevant metrics.

Good For

Research in Romanian NLP: Ideal for academic and research purposes focused on the Romanian language.
Assistant-like Chatbots: Suited for building conversational agents and virtual assistants that interact in Romanian.
Natural Language Tasks: Adaptable for various Romanian NLP tasks, including text generation, question answering, and translation, particularly when fine-tuned from the base models.

Overview

Overview

Key Capabilities

Good For

Full Model Card (README)