ertghiu256/Qwen3-4B-Thinking-2507-Hermes-3
TEXT GENERATIONConcurrency Cost:1Model Size:4BQuant:BF16Ctx Length:32kPublished:Aug 31, 2025License:apache-2.0Architecture:Transformer0.0K Open Weights Warm
The ertghiu256/Qwen3-4B-Thinking-2507-Hermes-3 is a 4 billion parameter Qwen3-based language model, fine-tuned with the Hermes 3 dataset. This model is specifically designed to retain strong reasoning capabilities and improve instruction following. It features a notable context length of 40960 tokens, making it suitable for tasks requiring extensive contextual understanding and precise responses.
Loading preview...
Popular Sampler Settings
Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.
temperature
–
top_p
–
top_k
–
frequency_penalty
–
presence_penalty
–
repetition_penalty
–
min_p
–