Name: TMLR-Group-HF/GT-Qwen3-1.7B-Base-MATH API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: TMLR-Group-HF

Model Overview

TMLR-Group-HF/GT-Qwen3-1.7B-Base-MATH is a 1.7 billion parameter model built upon the Qwen3 architecture. Developed by TMLR-Group-HF, this model distinguishes itself through its specialized training regimen. It leverages the GRPO Ground Truth method, with a primary focus on a dedicated mathematical training set.

Key Capabilities

Mathematical Reasoning: Optimized for tasks requiring mathematical understanding and problem-solving.
Specialized Training: Utilizes the GRPO Ground Truth method for enhanced performance in its target domain.
Qwen3 Architecture: Benefits from the foundational capabilities of the Qwen3 model family.

Good For

Mathematical Applications: Ideal for use cases involving complex calculations, proofs, or mathematical text generation.
Research in Mathematical LLMs: Provides a base for further exploration and fine-tuning in the domain of AI for mathematics.

For those interested in the underlying methodology, particularly Co-Reward, further details and resources are available on the TMLR-Group GitHub repository.

Overview

Model Overview

Key Capabilities

Good For

Full Model Card (README)