migtissera/Tess-v2.5.2-Qwen2-72B
TEXT GENERATIONConcurrency Cost:4Model Size:72.7BQuant:FP8Ctx Length:32kPublished:Jun 13, 2024License:qwen2Architecture:Transformer0.0K Cold

Tess-v2.5.2-Qwen2-72B is a 72 billion parameter large language model developed by Migel Tissera, fine-tuned on the Qwen2-72B base. This model demonstrates significant improvements in reasoning, coding, and mathematics, achieving the #1 rank among open-weight models on MMLU evaluations. It is designed to provide detailed answers and natural conversation, including intentional follow-up questions, and is suitable for complex analytical and generative tasks.

Loading preview...

Popular Sampler Settings

Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.

temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p