Name: nomadicsynth/Qwen2.5-3B-Instruct-Reasoning-gsm8k-v1 API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: nomadicsynth

Model Overview

The nomadicsynth/Qwen2.5-3B-Instruct-Reasoning-gsm8k-v1 is a 3.1 billion parameter instruction-tuned model built upon the Qwen2.5 architecture. It has been specifically fine-tuned to enhance its reasoning and mathematical problem-solving abilities, with a particular focus on the GSM8K dataset.

Key Characteristics

Base Model: Fine-tuned from unsloth/Qwen2.5-3B-Instruct-unsloth-bnb-4bit.
Optimized Training: Training was significantly accelerated (2x faster) using Unsloth and Huggingface's TRL library, indicating an efficient development process.
Reasoning Focus: The model's primary optimization is for reasoning tasks, as suggested by its name and GSM8K fine-tuning.
Context Length: Supports a substantial context length of 32768 tokens.

Ideal Use Cases

Mathematical Problem Solving: Excellent for applications requiring solutions to arithmetic and word problems, especially those similar to the GSM8K benchmark.
Logical Reasoning: Suitable for tasks that benefit from enhanced logical deduction and step-by-step reasoning.
Efficient Deployment: Its 3.1 billion parameter size, combined with Unsloth's training optimizations, makes it a strong candidate for efficient deployment in environments with resource constraints, while still offering robust reasoning capabilities.

Overview

Model Overview

Key Characteristics

Ideal Use Cases

Full Model Card (README)