princeton-nlp/Llama-3-Instruct-8B-IPO
TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:8kPublished:Oct 5, 2024Architecture:Transformer Warm
Llama-3-Instruct-8B-IPO is an 8 billion parameter instruction-tuned language model developed by princeton-nlp. This model is fine-tuned using the SimPO method, a reference-free preference optimization technique, making it particularly effective for tasks requiring nuanced preference alignment. It is designed for general instruction following with an 8192 token context length.
Loading preview...
Popular Sampler Settings
Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.
temperature
–
top_p
–
top_k
–
frequency_penalty
–
presence_penalty
–
repetition_penalty
–
min_p
–