AuraIndustries/Aura-8B

Warm
Public
8B
FP8
32768
Dec 8, 2024
License: apache-2.0
Hugging Face
Overview

Aura-8B: A Dedicated Roleplaying Model

Aura-8B, developed by Aura Industries with contributions from Anthracite Org, is an 8 billion parameter instruction-tuned model built upon the arcee-ai/Llama-3.1-SuperNova-Lite base. Its primary distinction lies in its specialized optimization for roleplaying scenarios, achieved through extensive fine-tuning on hundreds of millions of tokens of instruction and roleplaying data.

Key Capabilities & Features

  • Dedicated Roleplaying: Engineered specifically to excel in interactive and narrative roleplaying applications.
  • Unique Output Style: Incorporates a Kahneman-Tversky Optimization (KTO) as a Low Rank Adapter, contributing to a distinct conversational and narrative style.
  • Extended Context Window: Supports a maximum context length of 8,192+ tokens, allowing for more complex and sustained interactions.
  • Llama 3 Prompt Format: Utilizes the Llama 3 prompt format for chat completions.
  • Quantizations Available: Offers various quantizations including Static GGUF, Imatrix GGUF, and EXL2 for flexible deployment.

Training & Performance

The model underwent a two-stage training process: an initial Supervised Fine-Tuning (SFT) phase followed by a Kahneman-Tversky Optimization (KTO) phase. The SFT phase leveraged diverse datasets covering roleplaying, cybersecurity, medical instruction, math, and creative writing. The KTO phase used the anthracite-core/full-opus-chosen-hermes-rejected-kto-v1 dataset to refine its output style. On the Open LLM Leaderboard, Aura-8B achieved an average score of 27.34, with notable scores in IFEval (72.05) and MMLU-PRO (31.93).

Ideal Use Cases

Aura-8B is particularly well-suited for applications requiring:

  • Interactive Storytelling: Generating dynamic and engaging narratives.
  • Character Simulation: Creating realistic and consistent character personas for virtual companions or game NPCs.
  • Creative Writing Assistance: Aiding in the development of fictional scenarios and dialogues.
  • Personalized Conversational Agents: Building chatbots with distinct personalities and role-specific interactions.