Overview

DualMinded-Qwen3-1.7B is a 1.7 billion parameter model from Convergent Intelligence LLC: Research Division, built on the Qwen3 architecture. It features a novel dual-cognition approach, implementing a three-phase cognitive loop: <explore> for unconstrained reasoning, <examine> for self-critique and error detection, and <response> for synthesizing a clean answer. This dialectical structure is achieved through role-conditioned generation on shared weights, without additional parameters or routing.

Key Capabilities & Training

Dual-Cognition Architecture: Employs an explore, examine, respond loop for enhanced reasoning and self-correction.
Advanced Distillation: Trained using a four-stage pipeline including Multi-Teacher Distillation (from Qwen3-30B-A3B variants), Discrepancy Calculus (DISC) Refinement, and Topological Knowledge Distillation (TKD).
Opus 4.6 Reasoning Traces: Fine-tuned on the Opus-4.6-Reasoning-3000x-filtered dataset, directly mapping thinking to <explore> and splitting solution into <examine> + <response>.
Emergent Behaviors: Exhibits "ghost imprinting" from sequential distillation, leading to emergent capabilities not present in individual teachers.
Mathematical Foundations: Grounded in Discrepancy Calculus, a measure-theoretic framework that preserves structural boundaries during knowledge distillation.

Good For

Tasks requiring extended reasoning and creative derivation.
Applications benefiting from self-critique and refined outputs.
Scenarios where a small, efficient model (1.7B parameters) needs advanced reasoning capabilities.
Exploring models trained with novel distillation techniques like DISC and TKD.