NousResearch/Yarn-Mistral-7b-128k
TEXT GENERATIONConcurrency Cost:1Model Size:7BQuant:FP8Ctx Length:8kPublished:Oct 31, 2023License:apache-2.0Architecture:Transformer0.6K Open Weights Cold

NousResearch/Yarn-Mistral-7b-128k is a 7 billion parameter language model developed by NousResearch, extending the Mistral-7B-v0.1 architecture. It is specifically pretrained on long context data using the YaRN extension method, enabling an impressive 128k token context window. This model is optimized for processing and understanding extremely long sequences of text while maintaining strong performance on short-context tasks.

Loading preview...

Popular Sampler Settings

Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.

temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p