Name: Hyeongwon/P2-split4_prob_Qwen3-8B-Base_0325-01 API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: Hyeongwon

Model Overview

Hyeongwon/P2-split4_prob_Qwen3-8B-Base_0325-01 is an 8 billion parameter language model, fine-tuned from the ChuGyouk/Qwen3-8B-Base architecture. This model was developed using the TRL library and specifically trained with Supervised Fine-Tuning (SFT) techniques.

Key Characteristics

Base Model: Fine-tuned from ChuGyouk/Qwen3-8B-Base.
Training Method: Utilizes Supervised Fine-Tuning (SFT).
Context Length: Supports a context window of 32,768 tokens.
Frameworks: Developed with TRL (0.25.1), Transformers (4.57.3), Pytorch (2.6.0), Datasets (3.6.0), and Tokenizers (0.22.2).

Intended Use Cases

This model is suitable for a variety of text generation tasks, particularly those benefiting from its fine-tuned nature. Developers can integrate it using the Hugging Face transformers pipeline for applications such as:

Generating conversational responses.
Creative writing and content generation.
Answering open-ended questions, as demonstrated in the quick start example.

Further details on the training process and metrics can be explored via the associated Weights & Biases run.

Overview

Model Overview

Key Characteristics

Intended Use Cases

Full Model Card (README)