Name: Jackrong/DASD-4B-Thinking-2507-GRPO-v2 API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: Jackrong

Overview

Jackrong/DASD-4B-Thinking-2507-GRPO-v2 is a 4 billion parameter language model based on the Qwen3 architecture. Developed by Jackrong, this model was fine-tuned from unsloth/Qwen3-4B-Thinking-2507.

Key Characteristics

Architecture: Qwen3-based, a robust foundation for various NLP tasks.
Training Efficiency: Fine-tuned using Unsloth and Huggingface's TRL library, which facilitated a 2x faster training process.
Parameters: Features 4 billion parameters, balancing performance with computational efficiency.
Context Length: Supports a context window of 40960 tokens, allowing for processing longer inputs.

Use Cases

This model is suitable for applications requiring a capable language model with efficient training origins. Its Qwen3 base and 4B parameters make it a strong candidate for tasks such as text generation, summarization, and question answering, particularly where faster fine-tuning is an advantage.

Overview

Overview

Key Characteristics

Use Cases

Full Model Card (README)