Name: Kyleyee/qwen2_5-0.5b-sft-arithmetic API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: Kyleyee

Model Overview

This model, Kyleyee/qwen2_5-0.5b-sft-arithmetic, is a specialized version of the Qwen2.5-0.5B-Instruct architecture, featuring 0.5 billion parameters and a context length of 131072 tokens. It has been specifically fine-tuned to excel in arithmetic reasoning tasks.

Key Capabilities

Arithmetic Problem Solving: Enhanced performance on mathematical operations and numerical reasoning due to targeted fine-tuning.
Instruction Following: Retains the instruction-following capabilities of its base Qwen2.5-0.5B-Instruct model.

Training Details

The model underwent Supervised Fine-Tuning (SFT) using the Kyleyee/arithmetic-sft dataset. The training process leveraged the TRL (Transformer Reinforcement Learning) framework, ensuring a focused optimization for arithmetic tasks. This targeted training approach differentiates it from general-purpose language models of similar size.

Use Cases

This model is particularly well-suited for applications where accurate and efficient arithmetic computation or numerical understanding is critical, such as educational tools, data analysis assistants, or systems requiring basic mathematical problem-solving.

Overview

Model Overview

Key Capabilities

Training Details

Use Cases

Full Model Card (README)