princeton-nlp/Mistral-7B-Base-SFT-SimPO
TEXT GENERATIONConcurrency Cost:1Model Size:7BQuant:FP8Ctx Length:8kPublished:May 17, 2024Architecture:Transformer Cold
The princeton-nlp/Mistral-7B-Base-SFT-SimPO model is a 7 billion parameter language model based on the Mistral architecture, fine-tuned using the Simple Preference Optimization (SimPO) method. Developed by princeton-nlp, this model leverages a reference-free reward approach for preference optimization. It is designed for tasks benefiting from advanced alignment techniques, offering a context length of 8192 tokens.
Loading preview...
Popular Sampler Settings
Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.
temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p