kmseong/Llama3.2-3B-gsm8k-full-FT
TEXT GENERATIONConcurrency Cost:1Model Size:3.2BQuant:BF16Ctx Length:32kPublished:Feb 23, 2026License:llama3.2Architecture:Transformer Warm

The kmseong/Llama3.2-3B-gsm8k-full-FT is a 3.2 billion parameter Llama 3.2 Instruct model, developed by kmseong, that has been fully fine-tuned on the GSM8K dataset. This model specializes in mathematical reasoning, particularly grade school math problems, by updating all its parameters rather than using adapter methods like LoRA. It achieves a 40.00% accuracy on the GSM8K test set and is primarily intended for tasks requiring step-by-step arithmetic problem-solving.

Loading preview...