Name: Abhiram1009/qwen2.5-0.5B-math-v2 API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: Abhiram1009

Model Overview

Abhiram1009/qwen2.5-0.5B-math-v2 is a 0.5 billion parameter language model that has been fine-tuned from the existing Abhiram1009/qwen2.5-0.5B-math-tuned base. This iteration focuses on enhancing its capabilities through further training using the TRL (Transformer Reinforcement Learning) framework.

Key Capabilities

Mathematical Reasoning: Built upon a math-tuned predecessor, this model is designed to excel in tasks requiring numerical understanding and logical problem-solving.
Compact Size: With 0.5 billion parameters, it offers a lightweight solution suitable for deployment in environments with resource constraints.
Fine-tuned Performance: The model leverages the TRL framework for its training, indicating a focus on refining its responses and performance for specific applications.

Training Details

The model underwent training using the Supervised Fine-Tuning (SFT) method. The development utilized several key framework versions:

TRL: 0.19.0
Transformers: 4.53.0
Pytorch: 2.7.0
Datasets: 3.6.0
Tokenizers: 0.21.2

Good For

Applications requiring efficient mathematical problem-solving.
Integration into systems where a smaller model footprint is advantageous.
Tasks benefiting from a model specifically refined for numerical and logical operations.

Overview

Model Overview

Key Capabilities

Training Details

Good For

Full Model Card (README)