Qwen3-4B-Shadow-FT-BAAI-2k
Qwen3-0.6B-am
Solidity-CodeGen-v0.1
Qwen3-8B-Base-Dapo-V7-S60
Qwen3-4B-rft-webshop
step_81_watson_qwen3_4b_watson_final_start_from_step_29_watson
2010_rl_rag_NAR8_testing64_gpt5_sft_step650
qwen-3-1.7b-finetuned
affine-tobetop1
bank-model
cot-sft-model
Affine_abd
Qwen3-1.7B-Base-Dapo-V1-S60
aigise-gemini-Qwen3-32B-lr1.0e-6-ga-2-sft
Affine-pipi_v1
verl_grpo_numina_qwen3_8b_sgdLR1e-1_beta0_bs256_in1024_out1024
gpt-oss-120B-stack-overflow-32ep-131k-summtrc-fixthink1
qwen3_0-6B_adversarial_2
qwen3_0-6B_adversarial_final
dec13_32b_300_160_20_155_185_285
qwen3_1.7b_easy_rl_reinforce_alpha_0
glm-4_6-nemo-prism
qwen3_1.7b_easy_rl_final_step120
qwen3_4b_sft_new
qwen3_1.7b_easy_rl_gspo
qwen3_4b_base_sft_final
2010_rl_rag_NAR8_testing64_gpt5_sft_31605_no_cite__1__1765674535_checkpoints_step_3450
htktai2025-merged-model-v6
MultiTurn-Qwen3-8B-SFT
open-thoughts-4-code-qwen3-32b-annotated-gbs256-4node
SkeptiSTEM-4B-v2-stageR1-merged-16bit
Affine-S5
affine-077
qwen3-1.7B-GRPO-MATH
Affine-ana2-3
qwen3nothink_groupsss_sft_3_newlf
affine-forward00
Affine-251225-29258
affine-test-04
affine-might-9999
Affine-ana8-3
bartleby-qwen3-0.6b