koutch/qwenb_qwen3-8b_train_grpo_v2_train_code
TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:32kPublished:Feb 7, 2026License:apache-2.0Architecture:Transformer Open Weights Cold

The koutch/qwenb_qwen3-8b_train_grpo_v2_train_code is an 8 billion parameter Qwen3 model developed by koutch. This model was fine-tuned using Unsloth and Huggingface's TRL library, resulting in a 2x faster training process. It is specifically optimized for code-related tasks, leveraging its Qwen3 architecture for enhanced performance in programming contexts. The model is licensed under Apache-2.0.

Loading preview...