TeichAI/Qwen3-4B-Thinking-2507-Kimi-K2-Thinking-Distill
TEXT GENERATIONConcurrency Cost:1Model Size:4BQuant:BF16Ctx Length:32kLicense:apache-2.0Architecture:Transformer0.0K Open Weights Warm
TeichAI/Qwen3-4B-Thinking-2507-Kimi-K2-Thinking-Distill is a Qwen3-based language model developed by TeichAI, fine-tuned from unsloth/Qwen3-4B-Thinking-2507. This model was specifically trained on 1000 examples from MoonshotAI's Kimi k2 thinking dataset. It leverages Unsloth and Huggingface's TRL library for accelerated training, making it optimized for tasks requiring 'thinking' or reasoning capabilities.
Loading preview...