Name: rediska0123/qwen2.5-math-1.5b-dpo-gsm8k-v3 API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: rediska0123

Model Overview

This model, rediska0123/qwen2.5-math-1.5b-dpo-gsm8k-v3, is a 1.5 billion parameter language model built upon the Qwen2.5 architecture. It has been specifically fine-tuned using Direct Preference Optimization (DPO) to enhance its capabilities in mathematical reasoning.

Key Characteristics

Architecture: Qwen2.5 base model.
Parameter Count: 1.5 billion parameters.
Context Length: Supports a context length of 32768 tokens.
Fine-tuning: Utilizes Direct Preference Optimization (DPO) for specialized training.
Target Domain: Optimized for mathematical problem-solving, particularly on datasets like GSM8K-v3.

Intended Use Cases

This model is primarily intended for applications requiring strong mathematical reasoning abilities. It is suitable for tasks such as:

Solving grade school mathematics problems.
Assisting with quantitative analysis where numerical reasoning is crucial.
Educational tools focused on math.

Due to the limited information in the provided model card, specific performance benchmarks or detailed training data are not available. Users should be aware that the model's performance outside of mathematical reasoning tasks may not be optimized.

Overview

Model Overview

Key Characteristics

Intended Use Cases

Full Model Card (README)