princeton-nlp/Llama-3-Instruct-8B-KTO
TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:8kPublished:May 17, 2024Architecture:Transformer Warm
The princeton-nlp/Llama-3-Instruct-8B-KTO is an 8 billion parameter instruction-tuned language model developed by princeton-nlp, based on the Llama-3 architecture. This model incorporates KTO (Kahneman-Tversky Optimization) for preference alignment, distinguishing it from standard instruction-tuned models. It is designed for general conversational AI tasks, leveraging its 8192-token context length for coherent and extended interactions.
Loading preview...
Popular Sampler Settings
Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.
temperature
–
top_p
–
top_k
–
frequency_penalty
–
presence_penalty
–
repetition_penalty
–
min_p
–