Name: koutch/qwen_2.json_train_grpo_v1_train_code API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: koutch

Overview

koutch/qwen_2.json_train_grpo_v1_train_code is a 4 billion parameter language model, fine-tuned by koutch. It is based on the Qwen3 architecture and was specifically trained from the unsloth/qwen3-4b-instruct-2507-unsloth-bnb-4bit model.

Key Capabilities

Efficient Training: This model was trained 2x faster by leveraging Unsloth and Huggingface's TRL library, making it a cost-effective and time-efficient solution for instruction-following tasks.
Qwen3 Foundation: Built upon the robust Qwen3 architecture, it inherits strong general language understanding and generation capabilities.

Good for

Developers seeking a 4B parameter model with an efficient training history.
Applications requiring a capable instruction-tuned model for various natural language processing tasks.
Experimentation with models fine-tuned using Unsloth for faster iteration cycles.

Overview

Overview

Key Capabilities

Good for

Full Model Card (README)