lkevinzc/Llama-3.2-3B-NuminaQA
TEXT GENERATIONConcurrency Cost:1Model Size:3.2BQuant:BF16Ctx Length:32kPublished:Mar 6, 2025License:apache-2.0Architecture:Transformer0.0K Open Weights Warm
lkevinzc/Llama-3.2-3B-NuminaQA is a 3 billion parameter language model based on the FineMath-Llama-3B architecture, fine-tuned by lkevinzc. It is specifically optimized for question-answering tasks, utilizing the numia-1.5-qa-concatenated dataset. This model serves as a foundational component in a minimalist R1-Zero recipe, focusing on understanding R1-Zero-like training.
Loading preview...
Popular Sampler Settings
Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.
temperature
–
top_p
–
top_k
–
frequency_penalty
–
presence_penalty
–
repetition_penalty
–
min_p
–