TIGER-Lab/MAmmoTH2-8B
TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:8kLicense:mitArchitecture:Transformer0.0K Open Weights Warm
MAmmoTH2-8B is an 8 billion parameter instruction-tuned causal language model developed by TIGER-Lab, based on the Llama-3 architecture with an 8192 token context length. It is specifically optimized for enhancing reasoning abilities, particularly in mathematical tasks, by leveraging 10 million instruction-response pairs harvested from web corpora. This model demonstrates significant performance improvements on benchmarks like MATH and GSM8K without relying on domain-specific training data.
Loading preview...
Popular Sampler Settings
Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.
temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p