Name: KaeriJenti/kaori-34b-v3 API
Brand: Featherless.ai
Price: 25.00 USD
Availability: InStock
Author: KaeriJenti

kaori-34b-v3 Overview

KaeriJenti/kaori-34b-v3 is a 34 billion parameter language model developed through a collaborative effort by Kaeri and Jenti. This model was fine-tuned using a Supervised Fine-Tuning (SFT) approach, leveraging a dataset composition primarily from Open-Platypus (100%) and a smaller portion from Dolphin (5%). The development process specifically excluded GSM8k samples and implemented rigorous similarity filtering to prevent data contamination from various benchmark tasks, including cot_gsm8k, drop, winogrande, ai2_arc, and hellaswag.

Key Capabilities

General Language Understanding: Designed for a broad range of language-based tasks.
Contamination-Aware Training: Trained with explicit measures to avoid overfitting to common academic benchmarks, aiming for more robust generalization.

Training Details

The model was fine-tuned using the LLaMA-Factory framework with a LoRA (Low-Rank Adaptation) strategy. The training involved 3 epochs with a batch size of 8, utilizing four A100 GPUs (80GB each).

Overview

kaori-34b-v3 Overview

Key Capabilities

Training Details

Full Model Card (README)