Name: 2etatg/Qwen3-4B-Thinking-2507-GRPO-Uncensored-V2 API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: 2etatg

Model Overview

This model, Qwen3-4B-Thinking-2507-GRPO-Uncensored-V2, is a 4 billion parameter language model developed by 2etatg. It is built upon the Qwen/Qwen3-4B-Thinking-2507 base and has been extensively fine-tuned to be uncensored, utilizing a 32,768 token context length.

Key Capabilities & Training

The model underwent a three-stage training process:

Supervised Fine-Tuning (SFT): Trained on 12,000 samples (Jailbreak, General, Logic) to learn uncensored attitudes and instruction formats.
GRPO (Reinforcement Learning): Further fine-tuned on 13,000 multilingual jailbreak prompts using the puwaer/Unsafe-Reward-Qwen3-1.7B reward model. This stage aimed to enhance its ability to generate natural and persuasive responses to harmful requests.

Performance Highlights

While base models typically have high refusal rates, this model demonstrates significantly reduced refusal:

Safety Evaluation: Achieves an extremely low refusal rate of under 4%–5% on "Do Not Answer" and "Sorry Bench" benchmarks, compared to ~98% for the base model.
Capability Evaluation: Despite the uncensoring process, the model recovered conversational scores, achieving an MT-Bench score of 7.06 (compared to 5.76 after SFT and 7.89 for the base model), indicating maintained general intelligence.

Good For

Applications requiring an uncensored language model that does not refuse prompts based on safety guidelines.
Use cases where generating natural and persuasive responses to a broad spectrum of inputs is critical.
Research and development in areas exploring the boundaries of LLM responses and safety mechanisms.

Overview

Model Overview

Key Capabilities & Training

Performance Highlights

Good For

Full Model Card (README)