MultiRL/qwen3_4b_sudoku_multi_act_rl_epoch3

TEXT GENERATIONConcurrency Cost:1Model Size:4BQuant:BF16Ctx Length:32kTool Calling:SupportedPublished:Mar 25, 2026Architecture:Transformer Cold

Loading preview...