TeichAI/Qwen3-4B-Thinking-2507-GPT-5.1-Codex-Max-Distill
TEXT GENERATIONConcurrency Cost:1Model Size:4BQuant:BF16Ctx Length:32kPublished:Dec 20, 2025License:apache-2.0Architecture:Transformer0.0K Open Weights Warm
TeichAI/Qwen3-4B-Thinking-2507-GPT-5.1-Codex-Max-Distill is a 4 billion parameter Qwen3 model developed by TeichAI, fine-tuned from unsloth/qwen3-4b-thinking-2507. This model was trained 2x faster using Unsloth and Huggingface's TRL library, indicating an optimization for efficient training. With a 32768 token context length, it is designed for applications requiring substantial input processing. Its specific "Thinking" and "Codex Max Distill" naming suggests a focus on reasoning capabilities and code-related tasks.
Loading preview...