Name: xw1234gan/GRPO_KL_Qwen2.5-7B-Instruct_MATH_beta0.01_lr1e-05_mb2_ga128_n2048_seed42_HF_GEN API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: xw1234gan

Overview

This model, xw1234gan/GRPO_KL_Qwen2.5-7B-Instruct_MATH_beta0.01_lr1e-05_mb2_ga128_n2048_seed42_HF_GEN, is an instruction-tuned variant of the Qwen2.5 architecture, featuring 7.6 billion parameters and a substantial 32768 token context window. While specific training details and benchmarks are not provided in the current model card, the naming convention strongly suggests a specialized focus on mathematical reasoning and problem-solving tasks.

Key Capabilities

Instruction Following: Designed to respond to user instructions effectively.
Large Context Window: Supports processing of long inputs and generating extensive outputs, up to 32768 tokens.
Mathematical Focus: Implied optimization for mathematical tasks, likely including arithmetic, algebra, and logical reasoning in numerical contexts.

Good for

Applications requiring robust mathematical problem-solving.
Tasks benefiting from a large context window for complex instructions or data.
Developers seeking a Qwen2.5-based model with a specialized mathematical fine-tuning.

Overview

Overview

Key Capabilities

Good for

Full Model Card (README)