reaperdoesntknow/Dualmind-Qwen-1.7B-Thinking

TEXT GENERATIONConcurrency Cost:1Model Size:2BQuant:BF16Ctx Length:32kTool Calling:SupportedPublished:Mar 30, 2026License:apache-2.0Architecture:Transformer0.0K Open Weights Cold

The Dualmind-Qwen-1.7B-Thinking model, developed by Convergent Intelligence LLC: Research Division, is a 2.03 billion parameter Qwen3ForCausalLM architecture with a 40,960 token context length. It is specifically fine-tuned using the DualMind SFT methodology on over 2.5 million tokens of Claude Opus 4.6 reasoning traces. This model excels at extended deliberation and self-correction, absorbing the nuanced reasoning patterns of a frontier model rather than just pattern completion, making it suitable for tasks requiring complex, multi-phase thought processes.

Loading preview...

Dualmind-Qwen-1.7B-Thinking: Deliberative Reasoning at 1.7B Parameters

This model, developed by Convergent Intelligence LLC: Research Division, is a 1.7 billion parameter Qwen3ForCausalLM variant specifically trained to emulate the extended reasoning and self-correction patterns of Claude Opus 4.6. Utilizing the DualMind SFT methodology, it was fine-tuned on over 2.5 million tokens from the Opus-4.6-Reasoning-3000x-filtered dataset, which captures genuine uncertainty navigation and deliberative structures.

Key Capabilities & Features

  • Opus-like Reasoning: Absorbs the nuanced reasoning patterns of Claude Opus 4.6, including backtracking, hedging, reconsidering, and synthesizing across multiple approaches.
  • DualMind SFT Methodology: Leverages a unique Supervised Fine-Tuning approach to instill a "cognitive loop" (explore → examine → respond) into the model.
  • Robust Base Model: Built upon Disctil-Qwen3-1.7B, a DISC-refined model, providing a strong structural foundation.
  • Extended Context: Supports a maximum context length of 40,960 tokens, allowing for long, complex reasoning chains.
  • Training Dynamics: Achieved a 6.8% gain in token accuracy during training, indicating genuine absorption of reasoning structure without overfitting.

Ideal Use Cases

  • Complex Problem Solving: Suited for tasks requiring multi-phase reasoning, where the model needs to explore, examine, and self-correct.
  • Deliberative AI: Applications benefiting from models that can articulate their thought process, navigate uncertainty, and synthesize information.
  • Cognitive Simulation: Research into how smaller models can mimic the deliberative intelligence of larger, frontier models.
  • Advanced Instruction Following: Scenarios where instructions require more than direct pattern matching, demanding a deeper understanding and adaptive response.