Name: Hyeongwon/P2-split2_prob_rg_v2_Qwen3-4B-Base-0416 API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: Hyeongwon

Model Overview

Hyeongwon/P2-split2_prob_rg_v2_Qwen3-4B-Base-0416 is a 4 billion parameter language model, representing a fine-tuned iteration of the Hyeongwon/Qwen3-4B-Base architecture. This model leverages the Qwen3-4B-Base as its foundation, indicating a robust base for various natural language processing tasks. The fine-tuning process was conducted using the TRL (Transformer Reinforcement Learning) framework, specifically employing Supervised Fine-Tuning (SFT) techniques.

Key Capabilities

Text Generation: Capable of generating coherent and contextually relevant text based on given prompts.
Fine-tuned Performance: Benefits from SFT, suggesting improved performance on specific tasks or domains compared to its base model.
Qwen3 Architecture: Built on the Qwen3 family, known for its general language understanding and generation abilities.

Training Details

The model's training utilized the TRL framework (version 0.25.1) alongside Transformers (4.57.3), Pytorch (2.6.0), Datasets (3.6.0), and Tokenizers (0.22.2). This setup indicates a standard and well-supported training environment for large language models. The fine-tuning approach focuses on supervised learning to adapt the base model to specific objectives.

Overview

Model Overview

Key Capabilities

Training Details

Full Model Card (README)