Radu1999/MisterUkrainianDPO

TEXT GENERATIONConcurrency Cost:1Model Size:7BQuant:FP8Ctx Length:8kPublished:Feb 11, 2024License:apache-2.0Architecture:Transformer Open Weights Cold

Radu1999/MisterUkrainianDPO is a 7 billion parameter instruction-tuned language model developed by Radu Chivereanu, based on the Mistral-7B-v0.2 architecture. This model is a DPO iteration of MisterUkrainian, featuring Grouped-Query Attention, Sliding-Window Attention, and a Byte-fallback BPE tokenizer. It is specifically fine-tuned for instruction following, particularly in Ukrainian, making it suitable for tasks requiring precise responses to prompts in that language.

Loading preview...

Overview

Radu1999/MisterUkrainianDPO is a 7 billion parameter instruction-tuned language model, developed by Radu Chivereanu. It is an iteration of the MisterUkrainian model, enhanced with Direct Preference Optimization (DPO) to improve instruction following capabilities. The model is built upon the Mistral-7B-v0.2 architecture, incorporating advanced features such as Grouped-Query Attention, Sliding-Window Attention, and a Byte-fallback BPE tokenizer.

Key Capabilities

  • Instruction Following: Optimized to understand and respond accurately to instructions, particularly when formatted with [INST] and [/INST] tokens.
  • Ukrainian Language Focus: As a DPO iteration of MisterUkrainian, it is specifically tailored for tasks and interactions in the Ukrainian language.
  • Efficient Architecture: Leverages Mistral-7B-v0.2's architectural choices for efficient processing.

Good For

  • Ukrainian Language Applications: Ideal for chatbots, content generation, and question-answering systems requiring high proficiency in Ukrainian.
  • Instruction-Based Tasks: Suitable for scenarios where precise adherence to user prompts is critical.
  • Research and Development: Provides a strong base for further fine-tuning or experimentation with DPO models in a specific linguistic context.

Popular Sampler Settings

Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.

temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p