Name: infinitylogesh/Qwen3-1.7B-GRPO-SRT-Math-12k-Stage-1 API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: infinitylogesh

Model Overview

The infinitylogesh/Qwen3-1.7B-GRPO-SRT-Math-12k-Stage-1 is a 2 billion parameter language model built upon the Qwen architecture. It features an extended context window of 40960 tokens, enabling it to process and understand longer sequences of information.

Key Characteristics

Architecture: Qwen-based model.
Parameter Count: 2 billion parameters.
Context Length: Supports a substantial 40960 tokens, beneficial for complex, multi-step problems.
Specialization: Fine-tuned using GRPO (Gradient-based Reward Policy Optimization) and SRT (Self-Refinement Training) techniques, specifically targeting mathematical reasoning and problem-solving.

Intended Use Cases

This model is particularly well-suited for applications that demand strong mathematical and logical processing. Its fine-tuning for math-related tasks suggests proficiency in:

Solving mathematical equations and word problems.
Assisting with quantitative analysis.
Developing educational tools for mathematics.
Applications requiring precise numerical reasoning.

Overview

Model Overview

Key Characteristics

Intended Use Cases

Full Model Card (README)