OpenLLM-Ro/RoMistral-7b-Instruct

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:7BQuant:FP8Ctx Length:8kPublished:Oct 9, 2024License:cc-by-nc-4.0Architecture:Transformer0.0K Open Weights Warm

RoMistral-7b-Instruct is a 7 billion parameter instruction-tuned generative text model developed by OpenLLM-Ro, specifically designed for the Romanian language. Fine-tuned from Mistral-7B-v0.3, this model is part of the first open-source effort to build LLMs specialized for Romanian. It excels in Romanian natural language tasks, offering strong performance in areas like question answering, sentiment analysis, and machine translation, making it ideal for assistant-like chat and research in Romanian NLP.

Loading preview...

RoMistral-7b-Instruct: A Specialized Romanian LLM

RoMistral-7b-Instruct is a 7 billion parameter instruction-tuned model developed by OpenLLM-Ro, representing a significant open-source initiative to create large language models tailored for Romanian. This model is fine-tuned from Mistral-7B-v0.3 and is part of a broader family of RoMistral models, including foundational and chat variants.

Key Capabilities

  • Romanian Language Specialization: Designed specifically for Romanian, addressing a gap in open-source LLMs for this language.
  • Instruction Following: Optimized for assistant-like chat and instruction-based tasks through fine-tuning on various Romanian datasets like RoAlpaca, RoDolly, and RoUltraChat.
  • Strong Performance: Demonstrates competitive results across several Romanian benchmarks, including LaRoSeDa for sentiment analysis, WMT for machine translation, and XQuAD for question answering, often outperforming its base Mistral-7B-Instruct-v0.2 counterpart in Romanian contexts.
  • Research Focus: Intended primarily for research use in Romanian natural language processing.

Good For

  • Romanian NLP Research: Ideal for academic and research projects focusing on the Romanian language.
  • Assistant-like Applications: Suitable for developing chatbots and conversational AI systems that interact in Romanian.
  • Language-Specific Tasks: Excels in tasks such as text generation, summarization, and question answering in Romanian.

Popular Sampler Settings

Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.

temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p