lvkaokao/mistral-7b-finetuned-orca-dpo-v2

TEXT GENERATIONConcurrency Cost:1Model Size:7BQuant:FP8Ctx Length:8kPublished:Oct 25, 2023License:apache-2.0Architecture:Transformer0.0K Open Weights Cold

The lvkaokao/mistral-7b-finetuned-orca-dpo-v2 is a 7 billion parameter language model, fine-tuned from Mistral-7B-v0.1. This model leverages the SlimOrca dataset for its instruction-following capabilities, offering a context length of 8192 tokens. It is primarily designed for general-purpose conversational AI and instruction-based tasks, building upon the strong foundation of the Mistral architecture.

Loading preview...

Model Overview

The lvkaokao/mistral-7b-finetuned-orca-dpo-v2 is a 7 billion parameter large language model (LLM) that has been fine-tuned from the original mistralai/Mistral-7B-v0.1 base model. This fine-tuning process utilized the Open-Orca/SlimOrca dataset, which is known for enhancing instruction-following abilities.

Key Capabilities

  • Instruction Following: Enhanced ability to understand and execute user instructions due to fine-tuning on the SlimOrca dataset.
  • General-Purpose Text Generation: Capable of generating coherent and contextually relevant text across a wide range of topics.
  • Mistral Architecture: Benefits from the efficient and performant architecture of the Mistral-7B base model.
  • Context Length: Supports a context window of 8192 tokens, allowing for processing and generating longer sequences of text.

Good For

  • Conversational AI: Suitable for chatbots and interactive agents that require robust instruction adherence.
  • Text Summarization: Can be applied to summarize documents or conversations.
  • Content Creation: Useful for generating various forms of written content based on specific prompts.
  • Research and Development: Provides a strong base for further experimentation and fine-tuning on specialized datasets.

Popular Sampler Settings

Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.

temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p