Rustamshry/Qwen3-8B-gpt-5.4-Reasoning-Distilled

TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:32kPublished:Apr 1, 2026License:apache-2.0Architecture:Transformer0.0K Open Weights Cold

Rustamshry/Qwen3-8B-gpt-5.4-Reasoning-Distilled is an 8 billion parameter model based on the Qwen3-8B architecture, specifically distilled from elite-level synthetic data generated by GPT-5.4. This model is specialized for "Long-Chain Thought" (CoT) reasoning, excelling in complex problems across Mathematics, Coding, and Medicine. It is fine-tuned using Unsloth + QLoRa techniques to transform the base model into a reasoning specialist.

Loading preview...

Qwen3-8B-gpt-5.4-Reasoning-Distilled Overview

This model is an 8 billion parameter variant of the Qwen3-8B base, specifically engineered by Rustamshry to excel in complex reasoning tasks. It has been distilled from high-quality synthetic data generated by GPT-5.4, focusing on "Long-Chain Thought" (CoT) reasoning capabilities.

Key Capabilities

  • Specialized Reasoning: Transforms the general-purpose Qwen3-8B into a specialist for intricate, multi-step reasoning.
  • Grandmaster-level Problem Solving: Designed to handle advanced problems in Mathematics, Coding, and Medicine.
  • GPT-5.4 Data Distillation: Benefits from training on an elite synthetic reasoning corpus (PT-5.4 Synthetic Reasoning Corpus) curated via an agentic "Master-Architect" workflow, ensuring high-quality, logic-dense training data.
  • Efficient Fine-tuning: Utilizes Unsloth + QLoRa techniques for efficient adaptation.

Ideal Use Cases

  • Complex Problem Solving: Applications requiring deep, multi-step logical deduction.
  • Technical Assistance: Generating solutions or explanations for advanced mathematical, coding, or medical scenarios.
  • Research & Development: As a powerful reasoning engine for AI-driven research in specialized domains.