PureRL-1.5B-v5-06-mc2
PureRL-1.5B-v7-s2-l2-maskoff
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-woolly_strong_pig
Qwen2.5-Coder-0.5B-Instruct-Gensyn-Swarm-crested_bellowing_penguin
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-vigilant_miniature_iguana
sn38-v11-8
libratio-fleet-llama3-grpo
CS6810-E01-S26
qwen-2.5-7B-Resta-lr3e-5-scale0.5
qwen-hf-iter-np-iter1
Qwen2.5-0.5B-Instruct
g1_top8_gptlong_dist_31600_32b_step1410__Qwen3-32B
fresh_gptlongtezos_step2100__Qwen3-32B
tezos100k_continue_gptlongtezos_step1800__Qwen3-32B
gptlong_continue_gptlongtezos_step3300__Qwen3-32B
dF7hY2sL9pB4gX8c
PureRL-1.5B-v5-06-uentropy
e6172e5b
lumynax-longctx-prolong-512k-instruct
general_knowledge_model
kodcode_3_qwen3_4b_sft
Qwen3-32B-EN-SynthDolly-r16alpha32-E5-S73
math_no_think_17_qwen3_4b_base_sparsemerge
Mistral-Small-3.2-24B-Instruct-2506-Text-Only-heretic
SearchR1-nq_hotpotqa_train-qwen2.5-3b-em-ppo
BoyBarley-Sparky-v3
Qwen2.5-0.5B_muon_v2
llama2_7b_chat-SSFT-AGNEWS-FT-safety-mix-0.1-lr3e-5
g1_top8_gptlong_dist_31600_32b_step900__Qwen3-32B
g1_top8_diverse_100000_32b_step3900__Qwen3-32B
gptlong_continue_nemotron_terminal_step1200__Qwen3-32B
tezos100k_continue_gptlongtezos_step2400__Qwen3-32B
Llama-3.1-8B-Instruct_grpo_ppl_adv_resume_epoch10_20260427_162955_step232
PureRL-1.5B-v6d1-baseline-acc10
PureRL-1.5B-v7-s2-l1-maskoff
L3-CharThink-Base-Test1
helpy-edu-b-llama3.1
Affine-h06-5FNrH2uWQG79vWPK8Fk4Kbu4F8fBaQ1uBqbtQtejYMkprSo4
PBoC-rrk-ctq-v1-epoch-0
qwen-hf-iter-np-iter3
tutor_model
g1_top8_diverse_100000_32b_step2400__Qwen3-32B