koutch/qwenb_2.json_train_grpo_v1_train_code
TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:32kPublished:Feb 5, 2026License:apache-2.0Architecture:Transformer Open Weights Cold
The koutch/qwenb_2.json_train_grpo_v1_train_code model is an 8 billion parameter Qwen3-based language model developed by koutch. This model was fine-tuned using Unsloth and Huggingface's TRL library, enabling 2x faster training. It is designed for general language tasks, leveraging its Qwen3 architecture for efficient performance.
Loading preview...