ertghiu256/Qwen3-Hermes-4b
TEXT GENERATIONConcurrency Cost:1Model Size:4BQuant:BF16Ctx Length:32kPublished:Jul 25, 2025License:apache-2.0Architecture:Transformer0.0K Open Weights Warm
ertghiu256/Qwen3-Hermes-4b is a 4-billion parameter Qwen 3 model fine-tuned on the Hermes 3 dataset. This model enhances general chatting capabilities while retaining Qwen's strong reasoning abilities. It is designed for conversational AI applications where both coherent dialogue and logical inference are important.
Loading preview...
Model Overview
ertghiu256/Qwen3-Hermes-4b is a 4-billion parameter language model based on the Qwen 3 architecture. It has been specifically fine-tuned using the Hermes 3 dataset. This fine-tuning process aims to improve the model's general conversational abilities, making it more adept at engaging in natural and coherent dialogue.
Key Capabilities
- Enhanced Chatting: The fine-tuning on the Hermes 3 dataset significantly boosts its general conversational performance.
- Retained Reasoning: Despite the focus on chat, the model maintains the strong reasoning capabilities inherent to the base Qwen architecture.
- Flexible Deployment: The model supports various deployment methods, including
transformers,vllm,sglang,llama.cpp, andollama, making it accessible for different development environments. - Reasoning Mode: When deployed with
vllmorsglang, it can leverage a reasoning parser (e.g.,deepseek_r1) to enable explicit "thinking" content alongside its generated responses, which can be useful for debugging or understanding its decision-making process.
Good For
- Applications requiring a balance between conversational fluency and logical reasoning.
- Developers looking for a 4B parameter model with good general-purpose chat capabilities.
- Use cases where the ability to inspect the model's "thinking" process is beneficial.