Name: Orion-zhen/Qwen2.5-7B-Gutenberg-KTO API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: Orion-zhen

Model Overview

Orion-zhen/Qwen2.5-7B-Gutenberg-KTO is a 7.6 billion parameter model fine-tuned by Orion-zhen using the KTO (Kahneman-Tversky Optimization) strategy. This model specifically leverages Gutenberg datasets, indicating a specialization in processing and generating text inspired by classic literature. The developer emphasizes an "eco-friendly training" approach, utilizing techniques like adam-mini, qlora, and unsloth to reduce VRAM and energy consumption while accelerating training.

Key Training Details

Dataset: Orion-zhen/kto-gutenberg
Epochs: 2
Gradient Accumulation: 8
Batch Size: 1
KTO Perf Beta: 0.1

Potential Use Cases

Literary Text Generation: Creating content in styles reminiscent of classic literature.
Text Analysis: Research and analysis of literary works.
Educational Tools: Developing applications for studying classic texts.

This model represents an exploration into the effectiveness of the KTO strategy on literary datasets, with a strong focus on resource-efficient training methodologies.

Overview

Model Overview

Key Training Details

Potential Use Cases

Full Model Card (README)