AALF/gemma-2-27b-it-SimPO-37K-100steps
TEXT GENERATIONConcurrency Cost:2Model Size:27BQuant:FP8Ctx Length:32kPublished:Aug 13, 2024License:gemmaArchitecture:Transformer0.0K Warm
AALF/gemma-2-27b-it-SimPO-37K-100steps is a 27 billion parameter instruction-tuned model based on the Google Gemma-2 architecture. This model was fine-tuned using the SimPO framework on a curated dataset of 37,040 preference data points derived from UltraFeedback, with responses annotated by ArmoRM-Llama3-8B-v0.1. It is specifically optimized for generating high-quality, preferred responses, achieving a 77.09% WinRate on AlpacaEval2.0.
Loading preview...
Popular Sampler Settings
Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.
temperature
–
top_p
–
top_k
–
frequency_penalty
–
presence_penalty
–
repetition_penalty
–
min_p
–