qwen25_3b_qwen25_qwen3_rank_only-qwen25_qwen3_rank_only_cluster_0
qwen25_3b_qwen25_qwen3_rank_only-qwen25_qwen3_rank_only_cluster_5
Shaheen-3B-Kulliyat-e-Iqbal
Main_fixed_MATH_3B_step_2
Main_MATH_3B_step_5
Math_SFT_v4_4ksteps
Qwen2.5-Coder-7B_math_mergeTIES
Qwen2.5-7B-mix-math-dolly-numina-20k-1-1e-6
Qwen-2.5-7B-RL-GRPO-Extreme-NoKL-1e-05-25
heretic_Qwen2.5-3B-Model-Stock-v2
FuseO1-DeepSeekR1-QwQ-SkyT1-Flash-32B-Preview
qwen-arthur-x
OpenRS-DR_GRPO_dra-qwen2
oh-dcft-v3.1-claude-3-5-haiku-20241022-qwen
Qwen2.5-7B-NuminaMath-CoT-smp20k-ep1-2e-5
Hamanasu-QwQ-V2-RP
Qwen1.5B-MTP-S24E28NC2-AD
Qwen2.5-1.5B-Instruct-Viet-SFT
0.5B-policy-iteration_2
GRPO-qwen2.5-14B-qwen2.5-14B-mrd3-s3-sum_token_prompt-merged
Ross-640-BMath-1.5B
Qwen2.5-1.5B-s1k-grpo-gsm8k
Qwen2.5-1.5B-Open-R1-Distill
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-hulking_sharp_rhino
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-hardy_sneaky_mule
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-crested_alert_bear
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-lanky_curious_newt
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-fast_shiny_rat
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-stubby_subtle_chameleon
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-crested_sniffing_cockroach
SQL-R1-14B
Qwen2.5-3B-Instruct-CRPO-V35
R1-Code-Interpreter-3B
Chirp-01
DeepRetrieval-PubMed-3B
SFT_Qwen2.5-3B-Instruct_MedQA
PAD_student_teacher_m2
rank1-32b
Hamanasu-Magnum-QwQ-32B
Qwen-2.5-Base-7B-mixed-hard-hint-gen14
sft-qwen-1.5bi-deg-5-path-5-nodes-300-qwen-14bi-final2-all
Qwen2.5-0.5B-SFT