migtissera/Tess-v2.5.2-Qwen2-72B
TEXT GENERATIONConcurrency Cost:4Model Size:72.7BQuant:FP8Ctx Length:32kPublished:Jun 13, 2024License:qwen2Architecture:Transformer0.0K Cold
Tess-v2.5.2-Qwen2-72B is a 72 billion parameter large language model developed by Migel Tissera, fine-tuned on the Qwen2-72B base. This model demonstrates significant improvements in reasoning, coding, and mathematics, achieving the #1 rank among open-weight models on MMLU evaluations. It is designed to provide detailed answers and natural conversation, including intentional follow-up questions, and is suitable for complex analytical and generative tasks.
Loading preview...
Popular Sampler Settings
Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.
temperature
top_p
top_k
–
frequency_penalty
–
presence_penalty
–
repetition_penalty
min_p