MultiRL/qwen3_4b_sudoku_one_act_rl_default_epoch1

TEXT GENERATIONConcurrency Cost:1Model Size:4BQuant:BF16Ctx Length:32kTool Calling:SupportedPublished:Mar 25, 2026Architecture:Transformer Cold

Loading preview...