exp-uns-tezos-160x_glm_4_7_traces_jupiter_cleaned
exp-uns-tezos-80x_glm_4_7_traces_jupiter_cleaned
Qwen3-8B-reas-int-065-only-loss-noprompt-3epoch-baseline
exp_tas_timeout_multiplier_8_0_traces
Kimi-K2T-ling-coder-sft-sandboxes-1-maxeps-32k
RLCR-v4-ks-uniqueness-hotpot
syh-r2eg-askl-glm_4-7_trac_jupi_-gfi-swes-rand-filt-10K_glm_4-7_trac_jupi_32B
sft__Kimi-2-5-inferredbugs-sandboxes-maxeps-32k__Qwen3-8B
exp_rpt_stack-csharp_10k_glm_4-7_traces_jupiter__Qwen3-8B
Llama-3.1-8B-Instruct_SFT_sciencefisher_v00.08
glmz1_9b_cookingworld_per_chunk_act_glm_2000
Llama-3.1-8B-Instruct_SFT_sciencefisher_v00.09
qwen3_8b_hw_sft_hazardworld_per_chunk_act_q3_2500
qwen3_8b_hw_sft_hazardworld_per_chunk_act_q3_4500
RLCR-v4-ks-bins100-ece100-hotpot
RLCR-v4-ks-bins100-hotpot
a1-crosscodeeval_python
a1-codenet_python
a1-exercism_python
a1-stack_bash_withtests
Llama-3.1-8B-Instruct_SFT_sciencefisher_v00.12
qwen3-8b-nt-gen-inv-sft-v2-test
qwen3-4b-grpo-tr-matematik-merged
RLCR-v4-ks-uniqueness-cov0-entropy100-hotpot
RLCR-v4-ks-uniqueness-cov0-entropy50-hotpot
RLCR-v4-ks-uniqueness-cov0-entropy50-cold-math
nemotron-terminal-corpus-unified-1000__Qwen3-8B
swesmith-unified-1000__Qwen3-8B
swesmith-unified-3160__Qwen3-8B
allenai-sera-unified-316__Qwen3-8B
allenai-sera-unified-3160__Qwen3-8B
a1-magicoder
a1-nemo_prism_math
swesmith-316__Qwen3-8B
swesmith-3160__Qwen3-8B
a1-orca_agentinstruct
distill-sft-qwen3-8b-full
Awa-3.1-8B-v5-ic1011-milkyway
llama3-8b-full-pretrain-wash-c4-3-6m-bs4
R14
a1-bash_textbook
a1-codeforces