Hyeongwon/P9-split3_prob_Qwen3-4B-Base_0322-01
TEXT GENERATIONConcurrency Cost:1Model Size:4BQuant:BF16Ctx Length:32kPublished:Mar 21, 2026Architecture:Transformer Warm

Hyeongwon/P9-split3_prob_Qwen3-4B-Base_0322-01 is a 4 billion parameter causal language model developed by Hyeongwon, fine-tuned from the Qwen3-4B-Base architecture. This model was trained using Supervised Fine-Tuning (SFT) with the TRL framework, offering a 32768 token context length. It is designed for text generation tasks, building upon its base model's capabilities.

Loading preview...