ERank-4B
One-Shot-RLVR-Qwen2.5-Math-1.5B-1.2k-dsr-sub
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-bellowing_pensive_grouse
GT-Qwen3-4B-Base-DAPO14k
Dolphin-Arabic-Final-F16
affine-winnerx
chess-sft-qwen2.5-3b-10k
qwen3-1.7b-smoltalk
Plutus_Advanced_model
Qwen3-0.6B-Gensyn-Swarm-lively_darting_penguin
SFT_DeepScaleR_Llama-3.2-1B_epoch_1_global_step_26
Qwen2.5-Coder-0.5B-Instruct-Gensyn-Swarm-trotting_quick_elephant
BLAST_PROCESSING-3.2-1B
Qwen3-4B-Instruct-2507-SFT-Pubmed
Llama-3.2-3B-Instruct-sft-alfworld-iter0
model-16bit-grpo
bioreason-grpo
qwen3-4b-instruct-motion-sft-merged
DictaLM-3.0-1.7B-Thinking-mlx-fp16
dpo-qwen-cot-merged
Qwen3-4B-Thinking-2507-SynthLabs
Qwen3-4B-CCC-merged-clora-v1
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-pensive_leggy_ant
unlearn_tofu_Llama-3.2-1B-Instruct_forget10_SimNPO_lr2e-05_b3.5_a1_d1_g0.125_ep5
Gpt-oss-120B-Qwen3-Distill
Uncensored_Kali-3.2-1B
Karaoke-Timed-Lyrics-Qwen3-0.6B
SPIKE-Scenario-Generator
MinCoder-4B-Exp
Qwen3-Code-Reasoning-4B
advanced-comp-model
brie-v2-qwen2.5-3b
dqnGPT-gemma3-adapter
olympiad-curated-qwen3-4b-instruct-gc-5ep
Qwen2.5-3B-Urdu-Ultimate-Poet
qwen3-4b-agent-v1
M_qw306_run0_gen0_WXS_doc1000_synt64_lr1e-04_acm_SYNLAST
qqWen-3B-Pretrain
Qwen2.5-1.5B-Open-R1-GRPO-FC
qwen3-4b-structured-output-merged-stage-a
qwen3-4b-dpo-v0.03
EvoNet-3B-V4