Qwen3-0.6B-Gensyn-Swarm-timid_lively_monkey
Jan-v2-VL-med
Qwen3-8B-TAR-O
unsloth_Qwen3-4B-unsloth-bnb-4bit-BookSQL
qwen3_8b_16bit_meme_2_kr
Qwen-3-merged-reasoning
Qwen3-4B-no-think
qwen3_14b_sft_swesmith_r2e_v2_qwen3_format_32k_maxstep40_rft-20k_bz8_epoch2_lr1en5-v1
Qwen3-8B-Base-Synthetic-SFT-merged
Athena-R3X-8B
Qwen3-1.7B-Base_Joint.01.00_2e-5
MiroThinker-14B-SFT-v0.1
Qwen3-32B-AWorld
MiroThinker-14B-DPO-v0.2
Beck-4B
GRPO-Qwen3-0.6B
jade_qwen3_4b
Qwen3-1.7B
Simia-Tau-SFT-Qwen3-8B
Qwen3-0.6B-Gensyn-Swarm-cunning_regal_fish
Qwen3-1.7B-finance
Affine_maLoT
Qwen3-4B-outreach-stage4
Qwen3-1.7B-grpo-1765505298
kimi-k2t-freelancer-32ep-32k
s1-generator-critique-Qwen3-4B-Instruct-2507-20251214_200751
SFT_Advanced_Risk_Situation_Aware_Qwen3-4B-Base
Qwen2.5-Coder-7B-Kaballas-abap
s1-thinking-distill-instruct-flash-cot
Affine-sharp_s_188
Qwen3-4B-Inst-CoTsft
nl2bash-stack-bugsseq
olympiad-curated-qwen3-4b-thinking-generator-critique
grpo_sgd_qwen3-8b_3k_seqlen_momentum_0p9_1e-2
Qwen3-0.6B-Gensyn-Swarm-furry_zealous_raccoon
Qwen3-0.6B-GPQA-Learning
Qwen3-4B-Instruct-2507-Heretic
qwen3-1.7B-amr-v1
Qwen3-0.6B-Gensyn-Swarm-purring_leggy_sandpiper
Qwen3-0.6B-Gensyn-Swarm-lanky_stocky_antelope
qwen3_4b_grpo_3
GRMR-V2.5-1.7B