edbeeching/Qwen3-4B-Thinking-2507-SFT-tr5

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:4BQuant:BF16Ctx Length:32kPublished:Mar 22, 2026Architecture:Transformer Warm

edbeeching/Qwen3-4B-Thinking-2507-SFT-tr5 is a 4 billion parameter language model, fine-tuned from Qwen/Qwen3-4B-Thinking-2507 using Supervised Fine-Tuning (SFT) with the TRL framework. This model is designed for text generation tasks, particularly for responding to complex, open-ended questions. It leverages a 32768 token context length, making it suitable for applications requiring extensive contextual understanding and coherent, detailed outputs.

Loading preview...

Overview

edbeeching/Qwen3-4B-Thinking-2507-SFT-tr5 is a 4 billion parameter language model derived from the Qwen3-4B-Thinking-2507 base model. It has undergone Supervised Fine-Tuning (SFT) using the TRL (Transformer Reinforcement Learning) framework, specifically version 0.27.0.dev0. This fine-tuning process aims to enhance its ability to generate coherent and thoughtful responses to user prompts.

Key Capabilities

  • Text Generation: Excels at generating human-like text based on given prompts.
  • Contextual Understanding: Benefits from a substantial 32768 token context window, allowing it to process and generate responses based on extensive input.
  • Instruction Following: Fine-tuned with SFT, indicating an improved capacity to follow instructions and produce relevant outputs.

Training Details

The model was trained using the TRL framework, a library developed by Hugging Face for transformer reinforcement learning. The training procedure involved Supervised Fine-Tuning, a common method for adapting pre-trained language models to specific tasks or response styles. The training environment utilized Transformers version 5.3.0.dev0, Pytorch 2.10.0, Datasets 4.5.0, and Tokenizers 0.22.2.

Use Cases

This model is particularly well-suited for applications requiring detailed and thoughtful text responses, such as:

  • Conversational AI: Generating engaging and contextually aware dialogue.
  • Content Creation: Assisting with drafting articles, creative writing, or detailed explanations.
  • Question Answering: Providing comprehensive answers to complex, open-ended questions.