CriteriaPO/llama3.2-3b-dpo-finegrained
TEXT GENERATIONConcurrency Cost:1Model Size:3.2BQuant:BF16Ctx Length:32kPublished:May 15, 2025Architecture:Transformer Warm
CriteriaPO/llama3.2-3b-dpo-finegrained is a 3 billion parameter language model developed by CriteriaPO, fine-tuned from CriteriaPO/llama3.2-3b-sft-10. This model utilizes Direct Preference Optimization (DPO) for enhanced performance, making it suitable for generating high-quality, preference-aligned text. It is designed for general text generation tasks where nuanced and preferred responses are critical.
Loading preview...
Popular Sampler Settings
Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.
temperature
–
top_p
–
top_k
–
frequency_penalty
–
presence_penalty
–
repetition_penalty
–
min_p
–