Name: W-61/mistral-7b-base-epsilon-dpo-hh-harmless-4xh200-batch-64 API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: W-61

Overview

This model, W-61/mistral-7b-base-epsilon-dpo-hh-harmless-4xh200-batch-64, is a 7 billion parameter language model derived from a Mistral-7B base architecture. It has undergone fine-tuning using the Epsilon DPO (Direct Preference Optimization) method on the Anthropic/hh-rlhf dataset. This specific training approach aims to improve the model's harmlessness and alignment with human preferences, making it suitable for applications where safety and ethical considerations are paramount.

Key Characteristics

Base Model: Fine-tuned from a Mistral-7B base model.
Fine-tuning Method: Utilizes Epsilon DPO for alignment, specifically targeting harmlessness.
Dataset: Trained on the Anthropic/hh-rlhf dataset, known for its focus on helpfulness and harmlessness.
Context Length: Supports a context window of 4096 tokens.
Performance Metrics: Achieved a validation loss of 0.5935 and a rewards accuracy of 0.7196 on the evaluation set, indicating its effectiveness in distinguishing between preferred and rejected responses.

Intended Use Cases

This model is particularly well-suited for scenarios requiring a language model that prioritizes safety and avoids generating harmful content. Its fine-tuning on the Anthropic/hh-rlhf dataset makes it a strong candidate for:

Content moderation systems.
Chatbots or conversational AI where harmlessness is critical.
Applications requiring aligned and ethically sound text generation.

Overview

Overview

Key Characteristics

Intended Use Cases

Full Model Card (README)