F_R17_1_T1
F_R19_T2
swesmith-31600-opt100k__Qwen3-8B
R1_4b
Llama-3.2-3B-Instruct-C_M_T-AUX_CT_CE_CM-2EP
train_cola_42_1774791067
train_rte_42_1774791065
codellama-7b-instruct-hf-sft
qwen2.5-1.5b-gsm8k-train-step6500
R99
tadiwa-phi35-mini
P2-split2_prob_ascii_normalized_Qwen3-4B-Base_0330-01
Llama-3.2-3B-Instruct-C_M_T_CT_CE_CM
wordle-lora-20260324-163252-sft_full_smoke
llama3.2-1b-deita-dpo-ref_teacher
le-41
allenai-sera-unified-31600-opt100k__Qwen3-8B
Llama-3.2-3B-Instruct-C_M_T_CT_CE_CM-2EP
allenai-sera-unified-100000-opt100k__Qwen3-8B
Llama-3.2-3B-Instruct-C_M_T-ALPACA-SEED999
Llama-3.2-3B-Instruct-C_M_T-DOLLY-SEED999
Llama-3.2-3B-Instruct-C_M_T-SEED1001
Qwen2.5-1.5B-Instruct_countdown2345_grpo_gaussian_0.5_0.5_SEC0.3DRO1.0G0.0_minpTrue_1600
grpo-qwen-gsm8k
P9-split1_only_answer_Qwen3-4B-Base_0402-01-2e-5
lancode-0.6b
lancode-1.7b
Mistral-7B-Inst-0.2-Bulleted-Notes
Qwen3-0.6B-Base-CPT-Math
Llama-3.2-3B-Instruct-CRPO-V20
gst-copywriter-v1
2026-04-09-310000-lora-dpo-14b-v1
TwinLlama-3.1-8B-DPO
v4_qwen-2.5-3b-r1-countdown-phil
qwen3-finetuned
2a2z0bju
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-masked_pesty_chameleon
sft__ot30k_Qwen3-1.7B-Base-DPO-Tulu3-decontaminated
Gemma-3-1B-pt-is-SmolTalk
nemosci-tasrep-a1mfc-gfistaqc-dev1-scaff-maxeps-swes-r2eg-32b-10pct__Qwen3-32B
Qwen2.5-1.5B-Instruct_gsm8k
Gemma-3-1B-it-is-SmolTalk