Name: wan-wan/test18-dpo API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: wan-wan

Model Overview

wan-wan/test18-dpo is a 4 billion parameter Qwen3 model developed by wan-wan. It was fine-tuned from the wan-wan/test08-checkpoint-266 model and utilizes a substantial 32768 token context length, making it suitable for tasks requiring extensive contextual understanding.

Key Capabilities

Efficient Finetuning: This model was finetuned with Unsloth and Huggingface's TRL library, resulting in a 2x speed improvement during the training process.
Qwen3 Architecture: Based on the Qwen3 architecture, it inherits its foundational language understanding and generation capabilities.
Extended Context Window: Features a 32768 token context length, allowing it to process and generate longer texts while maintaining coherence.

Good For

Applications requiring a Qwen3-based model with a large context window.
Developers looking for models that have undergone efficient finetuning processes.
Tasks benefiting from a 4 billion parameter model with optimized training characteristics.

Overview

Model Overview

Key Capabilities

Good For

Full Model Card (README)