koutch/qwenb_2.json_train_dpo_v2_train_code
TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:32kPublished:Feb 5, 2026License:apache-2.0Architecture:Transformer Open Weights Cold

The koutch/qwenb_2.json_train_dpo_v2_train_code is an 8 billion parameter Qwen3-based causal language model developed by koutch, fine-tuned using Unsloth and Huggingface's TRL library. This model is optimized for efficient training, achieving 2x faster finetuning. It is designed for general language generation tasks with a 32768 token context length.

Loading preview...