MultiRL/qwen3_1.7b_sudoku_multi_action_group_norm_allow_one_action
TEXT GENERATIONConcurrency Cost:1Model Size:2BQuant:BF16Ctx Length:32kPublished:Mar 29, 2026Architecture:Transformer Loading

Loading preview...