rikunarita/Qwen3-4B-Thinking-2507-Genius
Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:4BQuant:BF16Ctx Length:32kPublished:Feb 6, 2026Architecture:Transformer0.0K Warm

rikunarita/Qwen3-4B-Thinking-2507-Genius is a merged language model based on Qwen3-4B-Thinking-2507, created using the Model Stock method. This 4 billion parameter model integrates components from RioLee/ToolRM-Gen-Qwen3-4B-Thinking-2507 and TeichAI/Qwen3-4B-Thinking-2507-Gemini-3-Pro-Preview-High-Reasoning-Distill. It is designed to enhance reasoning capabilities and potentially improve tool-use reward modeling, making it suitable for tasks requiring advanced cognitive functions.

Loading preview...

Model Overview

rikunarita/Qwen3-4B-Thinking-2507-Genius is a merged language model built upon the Qwen3-4B-Thinking-2507 base. This model was constructed using the Model Stock merging method, a technique designed to combine the strengths of multiple pre-trained models.

Key Components and Capabilities

This merge incorporates two distinct models to enhance its performance:

  • RioLee/ToolRM-Gen-Qwen3-4B-Thinking-2507: This component likely contributes to improved tool-use reward modeling, suggesting capabilities in understanding and generating responses related to tool interaction or agentic behavior. For more details, refer to the ToolRM research paper.
  • TeichAI/Qwen3-4B-Thinking-2507-Gemini-3-Pro-Preview-High-Reasoning-Distill: This integration aims to imbue the model with high reasoning capabilities, potentially distilled from advanced models like Gemini 3 Pro. This suggests an emphasis on complex problem-solving and logical inference.

Intended Use Cases

Given its merged components, this model is particularly well-suited for applications requiring:

  • Enhanced reasoning: Tasks that demand logical deduction, problem-solving, and complex cognitive processing.
  • Agentic tool-use: Scenarios where the model needs to understand, generate, or evaluate interactions with external tools or systems.
  • Advanced cognitive tasks: Use cases benefiting from a blend of strong reasoning and potential tool-use understanding.