Delta-Vector/Rei-24B-KTO

Warm
Public
24B
FP8
32768
Hugging Face
Overview

Rei-KTO-24B: Creative Prose and Roleplaying Model

Delta-Vector's Rei-KTO-24B is a 24 billion parameter language model designed for creative writing and roleplaying applications. It aims to emulate the prose style and coherency found in Anthropic's Claude models (Opus and Sonnet).

Key Capabilities & Features

  • Creative Writing & Roleplaying: Optimized for generating engaging narratives and character interactions.
  • KTO Enhanced: Utilizes Kahneman-Tversky Optimization for improved coherency and instruction adherence.
  • Claude-like Prose: Fine-tuned to replicate the sophisticated writing style of Anthropic Claude models.
  • V7 Tekken Prompting: Supports a specific prompting format for consistent interaction.

Training Details

The model underwent a two-step training process:

  1. Supervised Fine-Tuning (SFT): Initial training on the Zerofata's PaintedFantasy dataset, noted for its effectiveness with anime-otaku-esque characters.
  2. KTO Optimization: Subsequent training to further enhance coherency and instruction following capabilities.

Training involved 2 epochs over 24 hours using 8 x A100 GPUs, built with Axolotl.

Available Formats

Quantized versions are available for various deployment needs: