Name: peluz/qwen3-0.6b-cat-lingo-dpo API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: peluz

Model Overview

The peluz/qwen3-0.6b-cat-lingo-dpo is a specialized language model based on the Qwen3-0.6B architecture, featuring 0.8 billion parameters and a 32768 token context length. Its primary distinction is its unique persona: it is fine-tuned to communicate exclusively in cat lingo, responding to all prompts with meows, purrs, and hisses.

Training Details

This model was developed using a combination of Supervised Fine-tuning (SFT) and Direct Preference Optimization (DPO), leveraging the TRL library and LoRA (r=32) for efficient adaptation. The training involved 10 epochs for SFT and an additional 10 epochs for DPO, with a DPO temperature (β) of 0.1. The dataset consisted of 80/15 training/validation splits of LLM-generated triples, comprising prompts, chosen cat-lingo responses, and rejected plain-language responses.

Key Capabilities

Cat Lingo Generation: Generates responses entirely in cat-like sounds and expressions.
Persona Consistency: Aims to maintain a consistent feline persona across various topics.

Limitations

Due to training on a relatively small dataset, the model's cat persona consistency may vary, particularly when encountering unusual or complex topics. Increasing the dataset size is suggested for more robust and reliable behavior.

When to Use This Model

This model is ideal for:

Creative Applications: Generating unique, character-driven dialogue.
Entertainment: Creating humorous or novelty interactions.
Persona-based Chatbots: Exploring specialized conversational agents with distinct personalities.

Overview

Model Overview

Training Details

Key Capabilities

Limitations

When to Use This Model

Full Model Card (README)