CriteriaPO/llama3.2-3b-dpo-finegrained
TEXT GENERATIONConcurrency Cost:1Model Size:3.2BQuant:BF16Ctx Length:32kPublished:May 15, 2025Architecture:Transformer Warm

CriteriaPO/llama3.2-3b-dpo-finegrained is a 3 billion parameter language model developed by CriteriaPO, fine-tuned from CriteriaPO/llama3.2-3b-sft-10. This model utilizes Direct Preference Optimization (DPO) for enhanced performance, making it suitable for generating high-quality, preference-aligned text. It is designed for general text generation tasks where nuanced and preferred responses are critical.

Loading preview...

Popular Sampler Settings

Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.

temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p