QwenRolina3-Base-LR1e5-WSD-b32g2gc8-order-domain-3ep
QwenRolina3-Base-LR1e5-b32g2gc8-order-domain-3ep-mix
QwenRolina3-Base-LR1e5-wsd-b32g2gc8-order-domain-3ep-mix
QwenRolina3-Base-LR1e5-b32g2gc8-order-domain-fp8
QwenRolina3-Base-LR1e5-b32g2gc8-order-ppl
QwenRolina3-Base-LR1e5-b32g2gc8-order-ppl-batch
Qwen3-1.7B-base-MED
liarsdice-checkuplog-hashid
gemma-diary-summarizer
jsd
liarsdice-smoketest-hashid
test_gin_rummy_qwen_2-5_3B
qwen3_1.7b_sudoku_multi_action_group_norm_allow_one_action_epoch1
qwen3_1.7b_webshop_atomic_action_epoch1
qwen3_1.7b_sudoku_multi_action_group_norm_allow_one_action_epoch3
qwen3_1.7b_webshop_atomic_action_epoch2
gemma-2b-it-numbers-ft
toolcalling-merged-demo
qwen2.5-3b-delta-after-grpo-step-105
Qwen3-1.7B-PDAPT-SLERP
Qwen3-VL-2B-Instruct
Qwen2.5-VL-3B-Instruct
Huihui-Qwen3-VL-2B-Instruct-abliterated
MAI-UI-2B
Qwen3-VL-2B-Thinking
mimic_zx_lora8_32_0.01_mrl_fn_lr1e4
Qwen2.5-VL-3B-Instruct-abliterated
Dolphin-v2
typhoon-ocr1.5-2b
UniDriveVLA_Nusc_Base_Stage1
GUI-Owl-1.5-2B-Instruct
UnifiedReward-2.0-qwen3vl-2b
qw3vl2b_ifs
qwen3_sft_sft_sparse_03drop_single_action_20260103_210803_ckpt10800
qwen3_webnav_0.1
qw3vl2b_evq
Robust-R1-SFT
qw3vl2b_ifs_grp
Qwen3-VL-2B-WigtnOCR
Latxa-Qwen3-VL-2B-Instruct
Huihui-Qwen3-VL-2B-Thinking-abliterated