jdineen/qwen3_4b_gsm8k_baseline_grpo

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:4BQuant:BF16Ctx Length:32kPublished:May 28, 2026Architecture:Transformer Warm

Loading preview...