grimjim/Llama-3-Instruct-8B-SimPO-SPPO-Iter3-merge
TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:8kPublished:Jun 28, 2024License:llama3Architecture:Transformer0.0K Warm

The grimjim/Llama-3-Instruct-8B-SimPO-SPPO-Iter3-merge is an 8 billion parameter instruction-tuned language model based on the Meta Llama 3 architecture, created by grimjim. This model is a merge of princeton-nlp/Llama-3-Instruct-8B-SimPO and UCLA-AGI/Llama-3-Instruct-8B-SPPO-Iter3, utilizing the SLERP merge method. It is designed for general text generation tasks, with evaluation results available on the Open LLM Leaderboard for various benchmarks including IFEval and BBH.

Loading preview...