kmseong/RSN-GSM8K-SFT-Model
TEXT GENERATIONConcurrency Cost:1Model Size:3.2BQuant:BF16Ctx Length:32kPublished:Dec 13, 2025License:llama3.1Architecture:Transformer Warm
The kmseong/RSN-GSM8K-SFT-Model is an 8 billion parameter Llama 3.1 Instruct model, fine-tuned by kmseong using LoRA on the GSM8K dataset. This model is specifically optimized for mathematical reasoning tasks, demonstrating a 55.00% accuracy on the GSM8K test set. It is designed to enhance problem-solving capabilities for arithmetic and word problems.
Loading preview...
Popular Sampler Settings
Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.
temperature
–
top_p
–
top_k
–
frequency_penalty
–
presence_penalty
–
repetition_penalty
–
min_p
–