Name: farffadet/syllogym-judge-qwen3-4b-grpo-v4 API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: farffadet

Model Overview

The farffadet/syllogym-judge-qwen3-4b-grpo-v4 is a 4 billion parameter language model based on the Qwen3 architecture. Developed by farffadet, this model has been fine-tuned from the unsloth/Qwen3-4B-unsloth-bnb-4bit base model.

Key Characteristics

Efficient Training: This model was fine-tuned with a focus on speed, utilizing Unsloth and Huggingface's TRL library, resulting in a 2x faster training process compared to standard methods.
Qwen3 Architecture: Built upon the Qwen3 family, it inherits the foundational capabilities of this robust model series.
Compact Size: With 4 billion parameters, it offers a balance between performance and computational efficiency, suitable for deployment in resource-constrained environments or for tasks where larger models are overkill.

Potential Use Cases

Rapid Prototyping: Its efficient training methodology makes it suitable for quick iteration and experimentation with fine-tuning for specific tasks.
Resource-Efficient Applications: Ideal for scenarios where a smaller footprint is required without significant compromise on language understanding and generation capabilities.
Specialized Downstream Tasks: Can be further fine-tuned for niche applications benefiting from its Qwen3 foundation and optimized training.

This model is released under the Apache-2.0 license.

Overview

Model Overview

Key Characteristics

Potential Use Cases

Full Model Card (README)