Name: W-61/llama3-8b-base-new-method-s_star0.6-20260426-230653 API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: W-61

Model Overview

W-61/llama3-8b-base-new-method-s_star0.6-20260426-230653 is an 8 billion parameter language model developed by W-61. It is a fine-tuned iteration of the W-61/llama-3-8b-base-sft-ultrachat-8xh200 base model, specifically trained using the HuggingFaceH4/ultrafeedback_binarized dataset.

Key Training Details

This model underwent a single epoch of training with a learning rate of 5e-07 and a total batch size of 128. The training process utilized a multi-GPU setup with 4 devices and an AdamW optimizer. Key evaluation metrics from the training results include:

Loss: 0.5352
Fcm Dpo/beta: 0.0111
Margin Dpo/margin Mean: 54.0836
Logps/chosen: -383.4114
Logps/rejected: -416.5982

Intended Use Cases

While specific intended uses are not detailed, the fine-tuning on a preference dataset like ultrafeedback_binarized suggests this model is well-suited for tasks that benefit from alignment with human preferences, such as:

Instruction following
Response generation where quality is judged by human feedback
Applications requiring nuanced understanding of preferred outputs

Overview

Model Overview

Key Training Details

Intended Use Cases

Full Model Card (README)