CriteriaPO/llama3.2-3b-sft-10 is a 3 billion parameter language model fine-tuned from Meta's Llama-3.2-3B architecture. This model has undergone Supervised Fine-Tuning (SFT) using the TRL framework, enhancing its ability to follow instructions and generate coherent text. It is designed for general text generation tasks, particularly those requiring instruction-following capabilities.
No reviews yet. Be the first to review!