SantiagoC/palindrome-sft-qwen3

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:0.8BQuant:BF16Ctx Length:32kPublished:May 6, 2026Architecture:Transformer Warm

SantiagoC/palindrome-sft-qwen3 is a 0.8 billion parameter causal language model, fine-tuned from Qwen/Qwen3-0.6B using Supervised Fine-Tuning (SFT) with the TRL framework. This model is designed for text generation tasks, leveraging its Qwen3 base architecture for efficient performance. It is suitable for applications requiring instruction-following capabilities derived from its SFT training.

Loading preview...

Model Overview

SantiagoC/palindrome-sft-qwen3 is a 0.8 billion parameter language model, fine-tuned from the Qwen/Qwen3-0.6B base model. The fine-tuning process utilized Supervised Fine-Tuning (SFT) and the TRL framework, indicating its optimization for specific instruction-following tasks.

Key Capabilities

  • Text Generation: Capable of generating coherent and contextually relevant text based on user prompts.
  • Instruction Following: Benefits from SFT training, allowing it to respond to instructions and questions effectively.
  • Qwen3 Architecture: Built upon the Qwen3-0.6B architecture, providing a compact yet capable foundation for language understanding and generation.

Training Details

The model was trained using the TRL (Transformers Reinforcement Learning) library, specifically employing an SFT approach. The training environment included TRL 1.3.0, Transformers 5.8.0, PyTorch 2.11.0, Datasets 4.8.5, and Tokenizers 0.22.2.

Good For

  • Interactive Applications: Suitable for chatbots or conversational AI where instruction-based responses are needed.
  • Content Creation: Can be used for generating short-form text, creative writing prompts, or answering specific questions.
  • Research and Development: Provides a fine-tuned Qwen3-based model for further experimentation or integration into larger systems.