MultiRL/qwen3_4b_sudoku_one_act_rl_default_epoch1
TEXT GENERATIONConcurrency Cost:1Model Size:4BQuant:BF16Ctx Length:32kPublished:Mar 25, 2026Architecture:Transformer Warm

Loading preview...