Name: koutch/qwenb_2.json_train_dpo_v2_train_code API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: koutch

Model Overview

This model, developed by koutch, is an 8 billion parameter Qwen3-based causal language model. It was finetuned from unsloth/qwen3-8b-unsloth-bnb-4bit using the Unsloth library and Huggingface's TRL (Transformer Reinforcement Learning) library.

Key Characteristics

Architecture: Based on the Qwen3 model family.
Parameter Count: 8 billion parameters, offering a balance between performance and computational efficiency.
Training Efficiency: Finetuned with Unsloth, enabling 2x faster training compared to standard methods.
Context Length: Supports a substantial context window of 32768 tokens, suitable for processing longer inputs and generating coherent, extended outputs.

Use Cases

This model is suitable for a variety of natural language processing tasks, particularly where efficient finetuning and a robust base model are beneficial. Its Qwen3 foundation and efficient training make it a strong candidate for applications requiring general text generation, understanding, and potentially code-related tasks, given its training context.

Overview

Model Overview

Key Characteristics

Use Cases

Full Model Card (README)