Name: ewqr2130/mistral-inst-v02-dpo API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: ewqr2130

Model Overview

The ewqr2130/mistral-inst-v02-dpo is a 7 billion parameter language model derived from the Mistral-inst-v02 base. Its primary distinguishing feature is the application of Direct Preference Optimization (DPO), which was performed over 6000 epochs. This optimization process is typically used to align models more closely with human preferences and improve their instruction-following abilities.

Key Characteristics

Base Model: Mistral-inst-v02
Parameter Count: 7 billion parameters
Optimization Method: Direct Preference Optimization (DPO) applied for 6000 epochs
Context Length: Supports an 8192-token context window

Intended Use Cases

This model is suitable for applications requiring a robust instruction-tuned language model. The DPO fine-tuning suggests improved performance in:

Following complex instructions
Generating coherent and contextually relevant responses
General conversational AI tasks

Overview

Model Overview

Key Characteristics

Intended Use Cases

Full Model Card (README)