Hyeongwon/P2-split2_reasoning_only_Qwen3-4B-Base_0424-bs64-epoch3

TEXT GENERATIONConcurrency Cost:1Model Size:4BQuant:BF16Ctx Length:32kPublished:Apr 24, 2026Architecture:Transformer Cold

Hyeongwon/P2-split2_reasoning_only_Qwen3-4B-Base_0424-bs64-epoch3 is a 4 billion parameter language model, fine-tuned from Hyeongwon/Qwen3-4B-Base using TRL. This model is specifically optimized for reasoning tasks, leveraging its base Qwen3 architecture and a 32768 token context length. It is designed for applications requiring strong analytical and logical inference capabilities.

Loading preview...

Overview

This model, P2-split2_reasoning_only_Qwen3-4B-Base_0424-bs64-epoch3, is a 4 billion parameter language model developed by Hyeongwon. It is a fine-tuned version of the Hyeongwon/Qwen3-4B-Base model, specifically trained using the TRL (Transformer Reinforcement Learning) framework. The training procedure involved Supervised Fine-Tuning (SFT) to enhance its performance.

Key Capabilities

  • Reasoning Focus: The model has been explicitly fine-tuned to excel in reasoning-only tasks, suggesting improved logical inference and problem-solving abilities.
  • Base Architecture: Built upon the Qwen3-4B-Base, it inherits the foundational strengths of the Qwen3 architecture.
  • Context Length: Supports a substantial context window of 32768 tokens, allowing for processing longer inputs and maintaining coherence over extended dialogues or documents.

Good For

  • Applications requiring strong analytical and logical reasoning.
  • Tasks that benefit from a large context window for complex problem statements.
  • Developers looking for a specialized model for inference-heavy workloads.