Name: kmseong/Llama3.2-3B-gsm8k-full-FT API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: kmseong

Overview

This model, kmseong/Llama3.2-3B-gsm8k-full-FT, is a 3.2 billion parameter Llama 3.2 Instruct variant that has undergone full parameter fine-tuning specifically on the GSM8K dataset. Unlike models trained with LoRA, all ~3 billion parameters of this model were updated during training, resulting in a complete, standalone model optimized for mathematical reasoning.

Key Capabilities

Specialized Mathematical Reasoning: Highly optimized for solving grade school math problems, as demonstrated by its training on the GSM8K dataset.
Full Parameter Fine-tuning: All model weights were updated, which generally leads to better performance than LoRA for sufficient training data, though it results in a larger model file size (~6GB).
Direct Usage: Can be used directly without requiring PEFT (Parameter-Efficient Fine-Tuning) libraries, simplifying deployment.

Good For

Grade School Math Problems: Ideal for applications requiring accurate, step-by-step solutions to arithmetic and word problems similar to those found in the GSM8K dataset.
Research and Development: Suitable for exploring the impact of full parameter fine-tuning on smaller language models for specific tasks.
Benchmarking: Can serve as a baseline for evaluating mathematical reasoning capabilities in the 3B parameter class.

Limitations

Domain Specificity: Performance is primarily strong on GSM8K-like math problems and may degrade significantly on other mathematical domains or general language tasks.
Resource Intensive: Requires a GPU with at least 16GB VRAM for efficient inference due to its larger file size and full parameter nature.

Overview

Overview

Key Capabilities

Good For

Limitations

Full Model Card (README)