Name: alwaysgood/qwen3-st2 API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: alwaysgood

Model Overview

The alwaysgood/qwen3-st2 is a 4 billion parameter language model, representing a fine-tuned iteration of the alwaysgood/qwen3-st1 base model. It is built upon the Qwen3 architecture and supports a substantial context length of 32768 tokens, enabling it to process and generate longer sequences of text.

Training Details

This model was developed by alwaysgood and underwent Supervised Fine-Tuning (SFT) using the TRL (Transformer Reinforcement Learning) library. The training process utilized specific versions of key frameworks, including TRL 0.24.0, Transformers 5.5.4, PyTorch 2.9.0+cu128, Datasets 4.3.0, and Tokenizers 0.22.2. The training run can be visualized via Weights & Biases.

Key Capabilities

Instruction Following: Designed to generate text based on user-provided instructions or prompts.
Text Generation: Capable of producing coherent and contextually relevant text for various applications.
Extended Context: Benefits from a 32K token context window, allowing for more detailed and lengthy interactions.

Good For

General-purpose text generation tasks.
Applications requiring responses to specific user queries or instructions.
Scenarios where a fine-tuned Qwen3-based model with a large context window is beneficial.

Overview

Model Overview

Training Details

Key Capabilities

Good For

Full Model Card (README)