Affine-Greg-51
qwen0.6bemo4-merge
q2.5_7b_aime_q3_untrained_plain_responses_1000
SkeptiSTEM-4B-v2-stageR1-merged-16bit
Affine-5EhWps4siKMSQayJ56Qmid1icCudF64H8PPn94CLAq1snkQw
SynGen-14B
Qwen2.5-3B-Instruct-SFT-Pubmed-16bit-DFT
QevaCoT-7B-Stock
SIRL-Gurobi32B
qwen2.5-3b-sft-10
parti_24_full
diegogpt-v2-mlx-bf16
Qwen2.5-3B-MegaScience
KillChain-8B
Qwen2.5-Coder-0.5B-Instruct-Gensyn-Swarm-tricky_keen_tortoise
Llama-3-8B-Instruct-TAR-Cyber
Qwen3-0.6B-Gensyn-Swarm-stinky_padded_puma
sft_warmstart_v2_epoch2
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-yapping_skilled_eel
qwen3_1.7b_sudoku_multi_act_new
qwen2.5-3b-dpo-coarse
qwen-physics
Qwen2.5-1.5B-Instruct-CensorTune
Llama-3.2-3B-Instruct-GRPO-MATH-1EPOCH
Qwen3-0.6B-Gensyn-Swarm-rabid_hibernating_meerkat
qwen3-1.7b-dabstep-reasoning-108-fixed-reasoning-sharegpt-sft
Huihui-Jan-nano-abliterated
GT-Qwen3-4B-Base-DAPO14k
Qwen2.5-1.5B-Instruct-Gensyn-Swarm-amphibious_prehistoric_gibbon
tool_cor_1.5B
TinyLlama-1.1B-Chat-v1.0
GLM-4.6-stackexchange-overflow-sandboxes-32eps-65k-reasoning_num-train-epochs_4.0_Qwen3-32B
bartleby-llama-3.2-1b_v2
qwen15_code200tok_step1750_frozen_ws_8_gl8_str8_pr0_0_ce0_03
Llama-3.3-70B-Instruct-heretic
ds-svd-muon-adam-1e-6-global_step_120
Qwen3-4B-Element8-Eva
DeepBrainz-R1-0.6B-Exp
ds1p5b_no_if-global_step_700
Qwen3-0.6B-Gensyn-Swarm-silent_peaceful_koala
Llama3.1-3B-Instruct_Mix-Long
Qwen3-8B-grpo-medmcqa