Hyeongwon/P2-split1_prob_Qwen3-4B-Base_0312-01
TEXT GENERATIONConcurrency Cost:1Model Size:4BQuant:BF16Ctx Length:32kPublished:Mar 12, 2026Architecture:Transformer Warm
Hyeongwon/P2-split1_prob_Qwen3-4B-Base_0312-01 is a 4 billion parameter causal language model developed by Hyeongwon, fine-tuned from the Qwen3-4B-Base architecture. This model was trained using Supervised Fine-Tuning (SFT) with the TRL framework. It is designed for general text generation tasks, building upon its base model's capabilities with specific fine-tuning.
Loading preview...