Name: Hyeongwon/P2-split2_prob_rg_Qwen3-4B-Base API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: Hyeongwon

Model Overview

Hyeongwon/P2-split2_prob_rg_Qwen3-4B-Base is a 4 billion parameter language model, fine-tuned from the base model Hyeongwon/Qwen3-4B-Base. This model leverages the TRL (Transformer Reinforcement Learning) library for its training process, specifically utilizing Supervised Fine-Tuning (SFT).

Key Characteristics

Base Model: Fine-tuned from Hyeongwon/Qwen3-4B-Base.
Parameter Count: 4 billion parameters.
Context Length: Supports a substantial context window of 32768 tokens.
Training Method: Trained using Supervised Fine-Tuning (SFT) with the TRL framework.

Usage

This model is suitable for various text generation tasks. A quick start example is provided for generating responses to user prompts using the transformers pipeline. The training procedure and framework versions, including TRL 0.25.1 and Transformers 4.57.3, are detailed for reproducibility.

Overview

Model Overview

Key Characteristics

Usage

Full Model Card (README)