toolcalling-merged-demo
Devjalx-4b
ReasonSQL-4B
distill-sft-grpo-4_70-full
Qwen3-0.6B-SFT-20251113165959
Qwen3-4B-Instruct-2507-SOM-MPOA
Qwen3-0.6B-Gensyn-Swarm-yapping_chattering_porcupine
Qwen3-0.6B-Gensyn-Swarm-giant_savage_caribou
Qwen3-0.6B-Gensyn-Swarm-lumbering_leaping_wildebeest
Qwen3-Codeforces-GRPO
Qwen3-0.6B-Gensyn-Swarm-thriving_rapid_grouse
Qwen3-EZO-8B-beta
Qwen3-Gutenberg-Encore-14B
Qwen3-8B-Base-VeriFree
Crystal-Think-V2
Jan-nano
qwen3_claude_37_48k_tokenized_sft_lr_1en5_epoch_1_bs_1_ga_8
Qwen3-4B-no-think
qwen3-4b-math-kd-jsd-temp1-v2
MindLink-32B-0801
Beck-4B
DistillQwen-ThoughtY-32B
Synth-2
Qwen3-4B-Apollo-V0.1-4B-Thinking-Heretic-Abliterated
ReasonFlux-Qwen3-dpo
Hermes-4-14B-BF16-abliterated
qwen-3-4b-thinking-r1-st
Josie-r1-4b-PoC-bf16
Qwen3-4B-GKD-Tulu
Qwen3-4B-Thinking-2507-Gemini-2.5-Flash-Distill
Qwen3-1.7B_modified
Parallel-SFT-Unseen
Simia-Tau-SFT-Qwen3-8B
Affine-Fafur3
qwen3-4b-thinking-rl-ckpt60
qwen3-4b-thinking-rare-ckpt-109
qwen3_4b_sft_final
qwen3_4b_easy_rl_final
Qwen3-1.7B-GRPO-SRT-Math-12k-Stage-0
qwen3_1.7b_easy_rl_reinforce_alpha_0.5
affine-world-100