kmseong/RSN-GSM8K-SFT-Model
TEXT GENERATIONConcurrency Cost:1Model Size:3.2BQuant:BF16Ctx Length:32kPublished:Dec 13, 2025License:llama3.1Architecture:Transformer Warm

The kmseong/RSN-GSM8K-SFT-Model is an 8 billion parameter Llama 3.1 Instruct model, fine-tuned by kmseong using LoRA on the GSM8K dataset. This model is specifically optimized for mathematical reasoning tasks, demonstrating a 55.00% accuracy on the GSM8K test set. It is designed to enhance problem-solving capabilities for arithmetic and word problems.

Loading preview...

Popular Sampler Settings

Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.

temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p