Keven16/Qwen3-4B-Non-Thinking-RL-Code-Step300
TEXT GENERATIONConcurrency Cost:1Model Size:4BQuant:BF16Ctx Length:32kPublished:Mar 16, 2026License:apache-2.0Architecture:Transformer Open Weights Loading
Keven16/Qwen3-4B-Non-Thinking-RL-Code-Step300 is a 4 billion parameter language model based on the Qwen3 architecture, featuring a 32768 token context length. This model is specifically fine-tuned for code generation tasks using Reinforcement Learning (RL) techniques. Its primary strength lies in producing functional and contextually relevant code snippets, making it suitable for developer assistance and automated programming. The model's design focuses on practical coding applications rather than general conversational abilities.
Loading preview...