razy101/qwen3-0.6b-gpt4-distilled-v2
TEXT GENERATIONConcurrency Cost:1Model Size:0.8BQuant:BF16Ctx Length:32kPublished:Apr 7, 2026License:apache-2.0Architecture:Transformer Open Weights Loading

The razy101/qwen3-0.6b-gpt4-distilled-v2 is a 0.8 billion parameter Qwen3-based causal language model developed by razy101. This model was fine-tuned using Unsloth and Huggingface's TRL library, enabling faster training. It features a 32768 token context length and is optimized for tasks benefiting from efficient, distilled models.

Loading preview...