Name: Hyeongwon/P2-split5_prob_Qwen3-4B-Base_0312-01 API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: Hyeongwon

Model Overview

Hyeongwon/P2-split5_prob_Qwen3-4B-Base_0312-01 is a 4 billion parameter language model developed by Hyeongwon. It is a fine-tuned variant of the Hyeongwon/Qwen3-4B-Base model, specifically trained using Supervised Fine-Tuning (SFT) with the TRL library.

Key Characteristics

Base Model: Fine-tuned from Hyeongwon/Qwen3-4B-Base.
Training Method: Utilizes Supervised Fine-Tuning (SFT) for specialized performance.
Frameworks: Trained with TRL 0.25.1, Transformers 4.57.3, Pytorch 2.6.0, Datasets 3.6.0, and Tokenizers 0.22.2.
Context Length: Supports a context window of 32768 tokens.

Intended Use Cases

This model is suitable for various text generation tasks where a 4 billion parameter model with a substantial context window is beneficial. Its SFT training suggests an optimization for following instructions and generating relevant, coherent text based on provided prompts.

Overview

Model Overview

Key Characteristics

Intended Use Cases

Full Model Card (README)