Name: thwannbe/qwen3-1.7b-openthoughts-warmup-sft API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: thwannbe

Model Overview

This model, thwannbe/qwen3-1.7b-openthoughts-warmup-sft, is a 1.7 billion parameter language model derived from the Qwen/Qwen3-1.7B-Base architecture. It has undergone supervised fine-tuning (SFT) using the TRL library, a framework specifically designed for Transformer Reinforcement Learning.

Key Capabilities

Text Generation: Optimized for generating human-like text based on given prompts.
Base Model Enhancement: Builds upon the capabilities of the Qwen3-1.7B-Base model through fine-tuning.
Context Handling: Features a substantial context length of 32768 tokens, allowing for processing and generating longer sequences of text.

Training Details

The model was trained using the SFT method, leveraging TRL version 1.4.0, Transformers 5.9.0, Pytorch 2.12.0, Datasets 4.8.5, and Tokenizers 0.22.2. The training process can be visualized via its Weights & Biases run.

Good For

Developers looking for a fine-tuned Qwen3-based model for general text generation tasks.
Applications requiring a model with a decent context window for handling more extensive inputs.

Overview

Model Overview

Key Capabilities

Training Details

Good For

Full Model Card (README)