namespace-Pt/Llama-3-8B-Instruct-80K-QLoRA-Merged
TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:8kPublished:Apr 30, 2024License:mitArchitecture:Transformer0.0K Open Weights Warm

Llama-3-8B-Instruct-80K-QLoRA-Merged is an 8 billion parameter instruction-tuned causal language model developed by namespace-Pt, extending the context length of Meta's Llama-3-8B-Instruct to 80,000 tokens. This model was efficiently trained using QLoRA and 3.5K GPT-4 synthesized long-context data, demonstrating strong performance on long-context evaluation benchmarks like LongBench and InfiniteBench. It is optimized for tasks requiring extensive context understanding and generation, while maintaining competitive short-context capabilities.

Loading preview...

Popular Sampler Settings

Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.

temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p