Name: DoppelReflEx/QWQ-32B-Dawnwhisper-QWQTokenizer API
Brand: Featherless.ai
Price: 25.00 USD
Availability: InStock
Author: DoppelReflEx

QWQ-32B-Dawnwhisper-QwenTokenizer Overview

This model, developed by DoppelReflEx, is a 32.8 billion parameter merged language model built upon the Qwen architecture, distinguished by its use of the QWQ Tokenizer. It aims to enhance reasoning capabilities compared to its original version. The model is noted for producing vivid and intelligent dialogues, making it particularly strong in roleplay applications.

Key Capabilities & Features

Enhanced Reasoning: Designed to offer improved reasoning, especially when a dedicated reasoning mode is activated using the <thinking> </thinking> token.
Vivid Dialogue & Roleplay: Excels at generating engaging and smart conversational exchanges, making it suitable for interactive and roleplaying tasks.
QWQ Tokenizer: Utilizes a specialized QWQ Tokenizer, which is intended to boost performance in reasoning tasks.
Resource-Friendly: Described as a viable option for users with more modest hardware specifications, with IQ3 variants capable of running on 16GB VRAM cards.

Usage Considerations

Language Performance: While strong in English and Chinese, performance may decrease with other languages. Activating reasoning mode is recommended for non-English/Chinese use cases to maintain experience quality.
ChatML Template: Requires the use of the ChatML template for optimal performance.
Reasoning Mode: Optional but recommended for boosting roleplay and multitasking experiences, though it may increase processing time. Specific configuration for reasoning mode is provided for platforms like SillyTavern.

Overview

QWQ-32B-Dawnwhisper-QwenTokenizer Overview

Key Capabilities & Features

Usage Considerations

Full Model Card (README)