TeichAI/Qwen3-4B-Thinking-2507-DeepSeek-v3.2-Speciale-Code-Distill
TEXT GENERATIONConcurrency Cost:1Model Size:4BQuant:BF16Ctx Length:32kPublished:Dec 7, 2025License:apache-2.0Architecture:Transformer0.0K Open Weights Cold

TeichAI/Qwen3-4B-Thinking-2507-DeepSeek-v3.2-Speciale-Code-Distill is a 4 billion parameter Qwen3-based language model developed by TeichAI, fine-tuned from unsloth/qwen3-4b-thinking-2507-unsloth-bnb-4bit. This model leverages Unsloth for 2x faster training, making it an efficient option for applications requiring a compact yet capable model. With a 32768 token context length, it is optimized for tasks benefiting from extended input sequences.

Loading preview...