a1-stack_bash_withtests_gpt5mini
a1-swegym_openhands
qwen7b_bma_wp_1
llama3-8b-full-pretrain-wash-c4-0-9m-bs4
Qwen2.5-7B-Instruct-cat-numbers-ft
F_R2
F_R6
affine-S03-5GxgYU8jHnXUguG7JQ3k7BkPpTCfX7r1WQ1HEToJcjyMHsja
qwen3-8b-full-sft-prm-opus-distill-32k-lr5e6_rejection-sample_think
sft__Kimi-2-5-swesmith-oracle-maxeps-32k__Qwen3-8B
F_R7_T2
F_R8_1_T1
python_basic_qa_dataset_model
F_R9_T3
llama3-8b-full-pretrain-wash-c4-1-2m-sft-bs64
llama3-8b-full-pretrain-wash-c4-1-8m-sft-bs64
llama3-8b-full-pretrain-wash-c4-2-4m-sft-bs64
swesmith-1000-opt1k__Qwen3-8B
F_R2_T4
coderforge-1000-opt1k__Qwen3-8B
Qwen3-14B-TL-SynthDolly-1A
F_R5_1
qwen3-8b-full-sft-prm-opus-distill-32k-lr5e6_clean
Affine-mmh2-5EptJ5DkkearraPC65QFsPbkHkB1BZnNfoeJ5iLKeNXJGUR2
R13
R14_1
a1-code_contests
a1-codeforces
a1-inferredbugs
a1-nemotron_bash_withtests
a1-nemotron_bash_withtests_gpt5mini
a1-self_instruct_naive
a1-stack_go
R17
R17_1
R19_1
qwen3-8b-full-nt-gen-inv-sft-v2-g3-e3
qwen3-14b-full-nt-gen-inv-sft-v2-g3-e3
a1-synatra
Qwen3-32B-HI-SynthDolly-1A
milkyway-3.1-8B-llm-dpo-001
F_R5_T4