Qwen/Qwen2.5-14B-Instruct-1M
TEXT GENERATIONConcurrency Cost:1Model Size:14.8BQuant:FP8Ctx Length:32kPublished:Jan 23, 2025License:apache-2.0Architecture:Transformer0.3K Open Weights Warm
Qwen2.5-14B-Instruct-1M is a 14.7 billion parameter causal language model developed by Qwen, featuring a transformer architecture. This model is specifically optimized for ultra-long context tasks, supporting an impressive context length of up to 1 million tokens while maintaining strong performance on shorter tasks. It is designed for advanced applications requiring extensive contextual understanding and processing, particularly when deployed with its custom vLLM framework for efficiency.
Loading preview...
Popular Sampler Settings
Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.
temperature
top_p
top_k
–
frequency_penalty
presence_penalty
repetition_penalty
–
min_p
–