Name: W-61/llama-3-8b-base-epsilon-dpo-hh-harmless-8xh200 API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: W-61

Model Overview

This model, llama-3-8b-base-epsilon-dpo-hh-harmless-8xh200, is an 8 billion parameter language model developed by W-61. It is a fine-tuned variant of W-61/llama-3-8b-base-sft-hh-harmless-8xh200, specifically optimized using Direct Preference Optimization (DPO) on the Anthropic/hh-rlhf dataset.

Key Capabilities

Harmlessness and Helpfulness: The primary focus of this model's fine-tuning was to enhance its ability to generate responses that are both harmless and helpful, as indicated by its training on the Anthropic/hh-rlhf dataset.
DPO Alignment: Utilizes DPO for alignment, aiming to improve response quality based on human preferences.

Training Details

The model underwent a single epoch of training with a learning rate of 5e-07 and a total batch size of 128 across 8 GPUs. Evaluation metrics show a final loss of 0.6288 and a rewards accuracy of 0.6691, suggesting improved alignment with desired response characteristics.

Intended Use Cases

This model is particularly suitable for applications where generating safe, aligned, and helpful text is critical, such as:

Safe Chatbots: Developing conversational agents that prioritize harmless interactions.
Content Moderation: Assisting in generating or evaluating content for safety and appropriateness.
Aligned AI Assistants: Creating AI tools that adhere to ethical guidelines and user preferences.

Overview

Model Overview

Key Capabilities

Training Details

Intended Use Cases

Full Model Card (README)