aws-prototyping/MegaBeam-Mistral-7B-512k
TEXT GENERATIONConcurrency Cost:1Model Size:7BQuant:FP8Ctx Length:4kPublished:Jul 30, 2024License:apache-2.0Architecture:Transformer0.1K Open Weights Warm
The aws-prototyping/MegaBeam-Mistral-7B-512k is a 7 billion parameter language model, based on Mistral-7B Instruct-v0.2, specifically engineered for efficient long-context processing. It supports an exceptionally large context window of 524,288 tokens, making it highly effective for tasks requiring extensive document analysis. This model excels at long-context retrieval and question answering, demonstrating strong performance on benchmarks like Needle In A Haystack and RULER.
Loading preview...
Popular Sampler Settings
Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.
temperature
–
top_p
–
top_k
–
frequency_penalty
–
presence_penalty
–
repetition_penalty
–
min_p
–