Qwen/Qwen2.5-Math-72B-Instruct
TEXT GENERATIONConcurrency Cost:4Model Size:72.7BQuant:FP8Ctx Length:32kPublished:Sep 16, 2024License:qwenArchitecture:Transformer0.0K Warm
Qwen/Qwen2.5-Math-72B-Instruct is a 72.7 billion parameter instruction-tuned causal language model developed by Qwen, specifically optimized for solving mathematical problems. This model excels in both English and Chinese mathematics, supporting Chain-of-Thought (CoT) and Tool-integrated Reasoning (TIR) for enhanced computational accuracy and symbolic manipulation. It is designed to be a mathematical expert model, building upon the Qwen2-Math series with significant performance improvements on mathematical benchmarks.
Loading preview...
Popular Sampler Settings
Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.
temperature
–
top_p
–
top_k
–
frequency_penalty
–
presence_penalty
–
repetition_penalty
–
min_p
–