nvidia/OpenMath2-Llama3.1-8B
TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:32kPublished:Sep 30, 2024License:llama3.1Architecture:Transformer0.0K Warm
OpenMath2-Llama3.1-8B is an 8 billion parameter language model developed by NVIDIA, fine-tuned from Llama3.1-8B-Base with the OpenMathInstruct-2 dataset. This model is specifically optimized for mathematical reasoning and problem-solving, demonstrating significant performance improvements over Llama3.1-8B-Instruct on various math benchmarks, including a 15.9% increase on the MATH dataset. It is designed for advanced mathematical tasks, leveraging a 32768 token context length.
Loading preview...
Popular Sampler Settings
Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.
temperature
–
top_p
–
top_k
–
frequency_penalty
–
presence_penalty
–
repetition_penalty
–
min_p
–