MultiRL/qwen3_1.7b_sudoku_multi_action_group_norm_allow_one_action_epoch1
TEXT GENERATIONConcurrency Cost:1Model Size:2BQuant:BF16Ctx Length:32kPublished:Mar 29, 2026Architecture:Transformer Cold

Loading preview...