Syed-Hasan-8503/Phi-3-mini-4K-instruct-cpo-simpo
TEXT GENERATIONConcurrency Cost:1Model Size:4BQuant:BF16Ctx Length:4kPublished:Jun 24, 2024License:apache-2.0Architecture:Transformer0.0K Open Weights Warm
The Syed-Hasan-8503/Phi-3-mini-4K-instruct-cpo-simpo is a 4 billion parameter Phi-3-mini-128K-instruct model, enhanced with the CPO-SimPO technique, which combines Contrastive Preference Optimization (CPO) and Simple Preference Optimization (SimPO). This model is optimized for instruction-based tasks, demonstrating improved performance in benchmarks like GSM8K and TruthfulQA. It focuses on generating high-quality sequences by preventing long, low-quality outputs and maintaining learned preferences.
Loading preview...
Popular Sampler Settings
Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.
temperature
–
top_p
–
top_k
–
frequency_penalty
–
presence_penalty
–
repetition_penalty
–
min_p
–