qwen3-4b-sft-test
Qwen3-1.7B-CCC-merged-cp6-LR1e-4
Qwen3-4B-Chess-FullFinetune-SpecialTokens
affine-target-2-5EhM3q9z5Yj4Vf2sgUSEbBTuqCvdMqQvFrnA3N9ZHnbxv7jG
qwen3-4b-llm1-fds-merged
Qwen3-1.7B-Tiny-Hanabi-XML-SFT-2
qwen3_4b_sudoku_one_act_sft_final_new
qwen-augment-2511
qwen3-4b-base-variant4-feb3-solver
affine-3-5EUmdh8Ny9qqBs4GGXGNbfoG5stxi7kcRWjDtknWMicLqs8G
amr-parsing-grpo-single-single-turn-20260203-0853-global-step-622
Affine-30-5Ev92WmWxrwA5KoU875FdEqWwm3AxNSbnwpJsodWCv28b32C
Affine-000-5DjkhvmmVAT5k7QuZd7eY1mdUD6ws6cQ2Zmw7Qz8P1xEWzFS
qwen3_4b_sudoku_multi_act_sft_final_new
Affine-hh4-5EfE9uvUkrRE1mf38pixonrfAugyb7B9UAvriBzmThBL3Vwv
math_RL_LS
qwen3-4b-base-variant2-feb5-solver-iter5
qwen3-4b-mcq-mari-device
qwen3-4b-sft-v5-r16-ep2-merged-fp16
Qwen3-0.6B-Tiny-Hanabi-XML-SFT-2
DAPO_1.7B_step120
RMOOD-qwen3-4b-alpacafarm-sft
DAPO_4B_step67
Affine-JSNT-213-5CfZAuMoM2iTGoge5KXWBi1fqtbe99LCFsqm5NrHxxgRTaLh
qwen3-1.7b-amr-augmented-20260214-1807
Qwen3-4B-MHS-1.1
qwen3-4b-sdpo-rsa-step30
DynaGuard-8B-6750
guinius-giro-003
Qwen3-0.6B-Reverse-Text-SFT
Qwen3-4B-Instruct-2507-taboo-v11
O02-password-wronganswer-lora-qwen3-4b
O10-password-wronganswer-multidomain-lora-qwen3-4b
C03-none-distilled-qwen3-4b
O04-topic-wronganswer-lora-qwen3-4b
Qwen3-4B-rft-alfworld
qwen3-4b-agentbench-exp03
llm2025-basic-chat-template-only
vfinal-merged
Qwen-1.7B-capado_rl
name-5HmKHW6DS4V1v8EEGdtae2SEVZbp8LLMs22wXduB8zLT7zRq
game4