Name: Yale-ROSE/Qwen3-4B-sft_dataset_gpt-sft-trl-v2 API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: Yale-ROSE

Overview

This model, Yale-ROSE/Qwen3-4B-sft_dataset_gpt-sft-trl-v2, is a 4 billion parameter language model built upon the Qwen3-4B architecture. It has undergone Supervised Fine-Tuning (SFT) using the Hugging Face TRL library, version 0.23.0, to enhance its instruction-following and text generation capabilities. The training process utilized specific versions of key frameworks including Transformers 4.56.1, Pytorch 2.7.1, Datasets 3.6.0, and Tokenizers 0.22.0.

Key Capabilities

Instruction-following: Fine-tuned with SFT, enabling it to generate responses based on given prompts and instructions.
Text Generation: Capable of producing coherent and contextually relevant text for various prompts.
Base Model: Leverages the robust architecture of Qwen/Qwen3-4B.

Good For

General Text Generation: Suitable for tasks requiring the creation of human-like text.
Question Answering: Can be used to answer open-ended questions based on its training.
Exploratory NLP Tasks: Ideal for developers experimenting with fine-tuned Qwen3-4B models for specific applications.

Overview

Overview

Key Capabilities

Good For

Full Model Card (README)