phammminhhieu/qwen3_0.6B_Claude_4.5_distill
TEXT GENERATIONConcurrency Cost:1Model Size:0.8BQuant:BF16Ctx Length:32kPublished:Feb 14, 2026License:apache-2.0Architecture:Transformer Open Weights Warm

The phammminhhieu/qwen3_0.6B_Claude_4.5_distill is a 0.8 billion parameter Qwen3-based causal language model developed by phammminhhieu. This model was fine-tuned using Unsloth and Huggingface's TRL library, enabling faster training. It features a context length of 40960 tokens, making it suitable for tasks requiring extensive context understanding. Its primary differentiator is the optimized training process for efficiency.

Loading preview...