CriteriaPO/qwen2.5-3b-sft-10

Warm
Public
3.1B
BF16
32768
1
May 7, 2025
Hugging Face

CriteriaPO/qwen2.5-3b-sft-10 is a 3.1 billion parameter causal language model, fine-tuned from Qwen/Qwen2.5-3B using the TRL framework. This model is specifically trained with Supervised Fine-Tuning (SFT) to enhance its conversational capabilities and response generation. It is designed for general text generation tasks, offering a balance between model size and performance for various applications. The model leverages a 32K context length, making it suitable for processing longer prompts and generating coherent, extended outputs.

No reviews yet. Be the first to review!