AALF/gemma-2-27b-it-SimPO-37K-100steps
TEXT GENERATIONConcurrency Cost:2Model Size:27BQuant:FP8Ctx Length:32kPublished:Aug 13, 2024License:gemmaArchitecture:Transformer0.0K Warm

AALF/gemma-2-27b-it-SimPO-37K-100steps is a 27 billion parameter instruction-tuned model based on the Google Gemma-2 architecture. This model was fine-tuned using the SimPO framework on a curated dataset of 37,040 preference data points derived from UltraFeedback, with responses annotated by ArmoRM-Llama3-8B-v0.1. It is specifically optimized for generating high-quality, preferred responses, achieving a 77.09% WinRate on AlpacaEval2.0.

Loading preview...

Popular Sampler Settings

Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.

temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p