Name: HenryJJ/dolphin-2.6-mistral-7b-dpo-orca API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: HenryJJ

Model Overview

HenryJJ/dolphin-2.6-mistral-7b-dpo-orca is a 7 billion parameter English language model, fine-tuned by HenryJJ. It is based on the Mistral 7B architecture and was developed using Direct Preference Optimization (DPO) from the cognitivecomputations/dolphin-2.6-mistral-7b base model. The training process involved 1200 steps on the Intel/orca_dpo_pairs dataset, utilizing a 1024 token context window.

Key Characteristics

Architecture: Llama 2 transformer architecture, specifically Mistral 7B.
Training Method: DPO (Direct Preference Optimization) for enhanced instruction following.
Dataset: Trained on Intel/orca_dpo_pairs.
Context Window: Supports a 1024 token context during training.
Prompt Format: Employs the ChatML format, with <|im_end|> mapping to token_id 2, ensuring compatibility with applications expecting EOS to be token_id 2.

Intended Use Cases

This model is primarily suited for chat-based applications and tasks requiring robust instruction following in English. Its DPO training aims to improve response quality and adherence to user prompts, making it suitable for conversational AI and assistant roles.

Overview

Model Overview

Key Characteristics

Intended Use Cases

Full Model Card (README)