Name: W-61/mistral-7b-base-sft-hh-helpful-4xh200-batch-64 API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: W-61

Model Overview

This model, W-61/mistral-7b-base-sft-hh-helpful-4xh200-batch-64, is a 7 billion parameter large language model. It is a fine-tuned variant of the mistralai/Mistral-7B-v0.3 base model.

Key Capabilities

Fine-tuned for Helpfulness: The model has undergone supervised fine-tuning (SFT) using the Anthropic/hh-rlhf dataset. This training focuses on generating responses that are helpful and aligned with human preferences.
Base Architecture: Leverages the efficient and performant Mistral-7B architecture, known for its strong performance relative to its size.
Context Window: Supports a context length of 4096 tokens, suitable for processing moderately long inputs and generating coherent, extended responses.

Training Details

The model was trained with a learning rate of 2e-05 over 1 epoch, utilizing a total batch size of 64 across 4 GPUs. The training process achieved a final validation loss of 0.7902.

Intended Use Cases

This model is particularly well-suited for applications requiring helpful and harmless conversational AI, such as:

Chatbots and virtual assistants where user assistance and safety are priorities.
Instruction-following tasks where the model needs to provide constructive and aligned outputs.
Generating helpful content based on user prompts.

Overview

Model Overview

Key Capabilities

Training Details

Intended Use Cases

Full Model Card (README)