Name: Fergus2000/wordle-grpo-Qwen3-1.7B API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: Fergus2000

Model Overview

Fergus2000/wordle-grpo-Qwen3-1.7B is a 0.5 billion parameter language model, fine-tuned from the Qwen/Qwen2.5-0.5B-Instruct base model. This model leverages the GRPO (Gradient-based Reasoning Policy Optimization) training method, which was originally introduced in the DeepSeekMath paper. The fine-tuning process was conducted using the TRL framework.

Key Capabilities

Enhanced Reasoning: Benefits from the GRPO method, which is designed to improve mathematical and logical reasoning abilities.
Efficient Performance: As a 0.5B parameter model, it offers a balance between performance and computational efficiency.
Extended Context: Supports a context length of 32768 tokens, allowing for processing longer inputs and maintaining conversational coherence over extended interactions.

Good For

Applications requiring improved reasoning capabilities within a smaller model footprint.
Tasks that can benefit from the GRPO training approach, particularly those involving structured problem-solving or mathematical logic.
Scenarios where a balance of performance and resource efficiency is crucial, leveraging its 0.5B parameters and 32K context window.