lkevinzc/Llama-3.2-3B-NuminaQA
TEXT GENERATIONConcurrency Cost:1Model Size:3.2BQuant:BF16Ctx Length:32kPublished:Mar 6, 2025License:apache-2.0Architecture:Transformer0.0K Open Weights Warm

lkevinzc/Llama-3.2-3B-NuminaQA is a 3 billion parameter language model based on the FineMath-Llama-3B architecture, fine-tuned by lkevinzc. It is specifically optimized for question-answering tasks, utilizing the numia-1.5-qa-concatenated dataset. This model serves as a foundational component in a minimalist R1-Zero recipe, focusing on understanding R1-Zero-like training.

Loading preview...

Popular Sampler Settings

Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.

temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p