aws-prototyping/MegaBeam-Mistral-7B-512k
TEXT GENERATIONConcurrency Cost:1Model Size:7BQuant:FP8Ctx Length:4kPublished:Jul 30, 2024License:apache-2.0Architecture:Transformer0.1K Open Weights Warm

The aws-prototyping/MegaBeam-Mistral-7B-512k is a 7 billion parameter language model, based on Mistral-7B Instruct-v0.2, specifically engineered for efficient long-context processing. It supports an exceptionally large context window of 524,288 tokens, making it highly effective for tasks requiring extensive document analysis. This model excels at long-context retrieval and question answering, demonstrating strong performance on benchmarks like Needle In A Haystack and RULER.

Loading preview...

Popular Sampler Settings

Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.

temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p