ertghiu256/Qwen3-4B-Thinking-2507-Hermes-3
TEXT GENERATIONConcurrency Cost:1Model Size:4BQuant:BF16Ctx Length:32kPublished:Aug 31, 2025License:apache-2.0Architecture:Transformer0.0K Open Weights Warm

The ertghiu256/Qwen3-4B-Thinking-2507-Hermes-3 is a 4 billion parameter Qwen3-based language model, fine-tuned with the Hermes 3 dataset. This model is specifically designed to retain strong reasoning capabilities and improve instruction following. It features a notable context length of 40960 tokens, making it suitable for tasks requiring extensive contextual understanding and precise responses.

Loading preview...

Popular Sampler Settings

Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.

temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p