Name: xwen-team/Xwen-7B-Chat API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: xwen-team

Xwen-7B-Chat: Top-Performing Open-Source Chat Model

Xwen-7B-Chat, developed by xwen-team, is a 7.6 billion parameter large language model post-trained from the Qwen2.5-7B base model. It is specifically designed and optimized for chat performance, demonstrating leading capabilities among open-sourced models under 10 billion parameters.

Key Capabilities & Performance

Top-1 Chat Performance: Achieves the highest scores among open-source models below 10B parameters on widely-used benchmarks such as Arena-Hard-Auto, AlignBench, and MT-Bench (as of February 1, 2025).
Strong Benchmark Results:
- Arena-Hard-Auto (No Style Control): Scored 59.4, outperforming Qwen2.5-7B-Instruct (50.4) and Gemma-2-27B-IT (57.5).
- Arena-Hard-Auto (Style Control): Scored 50.3, surpassing Qwen2.5-7B-Instruct (46.9) and Gemma-2-27B-IT (47.5).
- AlignBench-v1.1: Achieved a score of 6.88, higher than Qwen2.5-7B-Chat (6.56).
- MT-Bench: Scored 7.98, exceeding Qwen2.5-7B-Chat (7.71).
High Context Length: Supports a context length of 131072 tokens, enabling extensive conversational interactions.
Qwen2.5 Compatibility: Utilizes the same tokenizer and chat template as Qwen2.5-Instruct models, ensuring ease of integration.

When to Use Xwen-7B-Chat

Conversational AI: Ideal for chatbots, virtual assistants, and interactive applications where high-quality, coherent responses are crucial.
Benchmarking: A strong candidate for evaluating and comparing chat model performance in the sub-10B parameter category.
Applications requiring strong general chat abilities: Excels in generating human-like text across various conversational prompts.

Overview

Xwen-7B-Chat: Top-Performing Open-Source Chat Model

Key Capabilities & Performance

When to Use Xwen-7B-Chat

Full Model Card (README)