TIGER-Lab/MAmmoTH2-8B
TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:8kLicense:mitArchitecture:Transformer0.0K Open Weights Warm

MAmmoTH2-8B is an 8 billion parameter instruction-tuned causal language model developed by TIGER-Lab, based on the Llama-3 architecture with an 8192 token context length. It is specifically optimized for enhancing reasoning abilities, particularly in mathematical tasks, by leveraging 10 million instruction-response pairs harvested from web corpora. This model demonstrates significant performance improvements on benchmarks like MATH and GSM8K without relying on domain-specific training data.

Loading preview...

Popular Sampler Settings

Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.

temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p