Hyeongwon/P19-split3-prob-9x-bs256-lr1e5-zero3-ep3

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:4BQuant:BF16Ctx Length:32kPublished:May 7, 2026Architecture:Transformer Warm

Hyeongwon/P19-split3-prob-9x-bs256-lr1e5-zero3-ep3 is a 4 billion parameter language model, fine-tuned from Hyeongwon/Qwen3-4B-Base using the TRL library. This model, with a 32768 token context length, was trained with Supervised Fine-Tuning (SFT) to enhance its conversational capabilities. It is designed for general text generation tasks, particularly in response to user prompts.

Loading preview...

Model Overview

Hyeongwon/P19-split3-prob-9x-bs256-lr1e5-zero3-ep3 is a 4 billion parameter language model, fine-tuned from the base model Hyeongwon/Qwen3-4B-Base. This model leverages a 32768 token context length, making it suitable for processing longer inputs and generating comprehensive responses. The fine-tuning process utilized the TRL library and was conducted using Supervised Fine-Tuning (SFT) methods.

Key Capabilities

  • Text Generation: Capable of generating coherent and contextually relevant text based on user prompts.
  • Conversational AI: Designed to respond to questions and engage in text-based interactions.
  • Foundation Model: Built upon the Qwen3-4B-Base architecture, providing a robust foundation for various NLP tasks.

Training Details

The model was trained with SFT, and its development process can be visualized via Weights & Biases. Key framework versions used include TRL 0.25.1, Transformers 4.57.3, Pytorch 2.9.1, Datasets 3.6.0, and Tokenizers 0.22.2.

Use Cases

This model is suitable for applications requiring general-purpose text generation, such as chatbots, content creation, and interactive question-answering systems.