TIGER-Lab/MAmmoTH2-8B-Plus
TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:8kPublished:May 6, 2024License:mitArchitecture:Transformer0.0K Open Weights Warm

MAmmoTH2-8B-Plus is an 8 billion parameter language model developed by TIGER-Lab, based on the Llama-3 architecture with an 8192-token context length. It is specifically instruction-tuned using 10 million web-harvested instruction-response pairs to significantly enhance reasoning abilities, particularly in mathematical and general reasoning tasks. This model builds upon the MAmmoTH2 foundation by further training on public instruction tuning datasets, setting new performance standards in reasoning and chatbot benchmarks.

Loading preview...

Popular Sampler Settings

Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.

temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p