koutch/qwenb_qwen3-8b_train_grpo_v1_train_code
TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:32kPublished:Feb 5, 2026License:apache-2.0Architecture:Transformer Open Weights Cold

The koutch/qwenb_qwen3-8b_train_grpo_v1_train_code is an 8 billion parameter Qwen3 model, fine-tuned by koutch. This model was trained using Unsloth and Huggingface's TRL library, enabling a 2x faster training process. It is designed for general language tasks, leveraging its Qwen3 architecture and efficient training methodology.

Loading preview...