Name: TourniquetRules/flip7-reasoning-sft-Qwen3-4B API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: TourniquetRules

Model Overview

TourniquetRules/flip7-reasoning-sft-Qwen3-4B is a 4 billion parameter language model built upon the Qwen3-4B architecture. This model has undergone supervised fine-tuning (SFT) using the specialized TourniquetRules/flip7-reasoning-sft dataset, which is designed to enhance its reasoning abilities.

Key Capabilities

Enhanced Reasoning: Specifically fine-tuned on a reasoning-focused dataset to improve logical inference and problem-solving.
Qwen3 Architecture: Benefits from the robust base capabilities of the Qwen3 model family.
Extended Context Window: Supports a context length of 32768 tokens, allowing for the processing of longer and more complex prompts.

Training Details

The model was trained using the TRL (Transformers Reinforcement Learning) library, a framework for fine-tuning large language models. The training procedure utilized SFT, focusing on aligning the model's outputs with desired reasoning patterns present in the dataset.

Good For

Applications requiring strong logical reasoning.
Tasks involving complex problem-solving and inference.
Scenarios where understanding and generating reasoned responses are critical.

Overview

Model Overview

Key Capabilities

Training Details

Good For

Full Model Card (README)