Name: xw1234gan/GRPO_KL_Qwen2.5-3B-Instruct_MATH_beta0.01_lr1e-05_mb2_ga128_n2048_seed42_HF_GEN API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: xw1234gan

Overview

This model, xw1234gan/GRPO_KL_Qwen2.5-3B-Instruct_MATH_beta0.01_lr1e-05_mb2_ga128_n2048_seed42_HF_GEN, is a 3.1 billion parameter instruction-tuned language model built upon the Qwen2.5 architecture. It features a substantial context window of 32768 tokens, enabling it to process and understand extensive inputs for complex tasks.

Key Capabilities

Mathematical Reasoning: The model is specifically fine-tuned to enhance its performance on mathematical problems and logical reasoning tasks.
Instruction Following: Designed to accurately follow instructions, making it suitable for various guided applications.
Extended Context: Benefits from a 32K token context length, allowing for detailed analysis and generation based on large amounts of information.

Good For

Specialized Mathematical Applications: Ideal for use cases requiring strong mathematical problem-solving abilities.
Academic and Research: Can be applied in educational tools or research environments where precise numerical and logical outputs are critical.
Complex Instruction-Based Tasks: Suitable for scenarios where detailed instructions need to be interpreted and executed accurately over long contexts.

Overview

Overview

Key Capabilities

Good For

Full Model Card (README)