farffadet/syllogym-judge-qwen3-4b-grpo-v4
TEXT GENERATIONConcurrency Cost:1Model Size:4BQuant:BF16Ctx Length:32kPublished:Mar 25, 2026License:apache-2.0Architecture:Transformer Open Weights Warm

The farffadet/syllogym-judge-qwen3-4b-grpo-v4 is a 4 billion parameter Qwen3 model, fine-tuned by farffadet. This model was optimized for faster training using Unsloth and Huggingface's TRL library, making it efficient for specific downstream tasks. It is designed for applications requiring a compact yet performant language model.

Loading preview...