Name: SteelStorage/llama-3-cat-8b-instruct-v1 API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: SteelStorage

Overview

SteelStorage/llama-3-cat-8b-instruct-v1 is an 8 billion parameter Llama 3 instruction-tuned model, developed by SteelSkull with dataset preparation by Dr. Kal'tsit. This model is specifically fine-tuned to prioritize system prompt fidelity, helpfulness, and character engagement, aiming for deep immersion in role-play scenarios.

Key Capabilities

System Instruction Fidelity: Designed to adhere strictly to system prompts.
Chain of Thought (COT): Capable of generating detailed, step-by-step reasoning, though this behavior is primarily driven by system card instructions rather than inherent fine-tuning.
Character Immersion: Optimized for maximum character engagement and role-play.
Helpfulness: Provides helpful information, with a particular focus on biosciences and general science, drawing from health-related data for detailed diagnoses.

Training Details

The model was trained on a filtered Hugging Face dataset of instruction-response pairs, with a GPT model used to establish a standard for high-quality responses. The dataset was further refined for length and COT responses, and health-related data from Chat Doctor was included, favoring detailed and step-by-step diagnoses. Training involved 4 epochs over 6 days on a single A100 GPU.

Performance

Evaluations on the Open LLM Leaderboard show an average score of 64.74, with notable scores in HellaSwag (79.20) and Winogrande (75.93).

Overview

Overview

Key Capabilities

Training Details

Performance

Full Model Card (README)