Name: TheBloke/airoboros-13B-HF API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: TheBloke

Overview

The TheBloke/airoboros-13B-HF model is a 13 billion parameter LLaMA-based language model, fine-tuned by Jon Durbin. Its distinguishing feature is the use of entirely synthetic training data, generated using a 'jailbreak' prompt with ChatGPT to create a diverse dataset, including content that might typically be censored. This approach aimed to test the capabilities of ChatGPT when unfiltered.

Key Capabilities

Strong Instruction Following: The model is fine-tuned for general instruction adherence.
Competitive Performance: Evaluated against other models using GPT-4 judging, airoboros-13B achieved a GPT-3.5 adjusted score of 98.087, performing comparably to GPT-3.5 and outperforming several other 13B and 30B models in the evaluation set.
Synthetic Data Training: The training data was generated synthetically, with additional passes to improve performance in areas like math, extrapolation, closed question-answering (addressing hallucination), and coding.

Training Details

The model was fine-tuned using the FastChat module on 8x NVIDIA A100 GPUs over approximately 40 hours. The training process involved an initial set of synthetic instructions, followed by a second fine-tuning pass specifically targeting math, coding, and question-answering improvements. The prompt format is compatible with FastChat/Vicuna, using a USER: [prompt] <\s> ASSISTANT: structure.

Overview

Overview

Key Capabilities

Training Details

Full Model Card (README)