Name: maheshrawat18/Qwen3-4B-GRPO-v5-merged API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: maheshrawat18

Model Overview

The maheshrawat18/Qwen3-4B-GRPO-v5-merged is a 4 billion parameter language model based on the Qwen3 architecture. Developed by maheshrawat18, this model is a fine-tuned version of maheshrawat18/Qwen3-4B-Thinking-2507-merged.

Key Characteristics

Architecture: Qwen3
Parameter Count: 4 billion parameters
Training Efficiency: This model was trained 2x faster using Unsloth and Huggingface's TRL library, indicating an optimized training process.
License: Released under the Apache-2.0 license, allowing for broad usage and distribution.

Potential Use Cases

This model is suitable for a variety of natural language processing tasks where a 4 billion parameter model provides a good balance between performance and computational efficiency. Its optimized training suggests it could be a strong candidate for applications requiring rapid iteration or deployment on resource-constrained environments.

Overview

Model Overview

Key Characteristics

Potential Use Cases

Full Model Card (README)