MultiRL/qwen3_1.7b_sudoku_multi_action_group_norm_allow_one_action_epoch1
TEXT GENERATIONConcurrency Cost:1Model Size:2BQuant:BF16Ctx Length:32kTool Calling:SupportedPublished:Mar 29, 2026Architecture:Transformer Cold
Loading preview...
Loading preview...