grimjim/Llama-3-Instruct-8B-SPPO-Iter3-SimPO-merge
TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:8kPublished:Jun 28, 2024License:llama3Architecture:Transformer0.0K Warm

grimjim/Llama-3-Instruct-8B-SPPO-Iter3-SimPO-merge is an 8 billion parameter instruction-tuned language model built upon the Meta Llama 3 architecture. This model is a merge of princeton-nlp/Llama-3-Instruct-8B-SimPO and UCLA-AGI/Llama-3-Instruct-8B-SPPO-Iter3, created using the SLERP merge method. It is designed for general text generation tasks, leveraging the combined strengths of its base models.

Loading preview...

Popular Sampler Settings

Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.

temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p