qwen3_8b_vdrop65_propqgen_annealed_solver_v3
gemma-3-4b-it-vietnamese-r16
Affine-707-5EeXiJNN6ohYoTixu94VEGvoRwMF7NCTjTpotW5wN7qaB5DQ
qwen2_5_7b_sft_baseline
llama3-8b-full-pretrain-wash-c4-0-9m-bs4
sft__Kimi-2-5-swesmith-oracle-maxeps-32k__Qwen3-8B
F_R8_1_T1
Awa-3.1-8B-v5-ic1011-001
Qwen3-14B-ZH-SynthDolly-1A
R10
llama3-8b-full-pretrain-wash-c4-2-1m-sft-bs64
sera-316-opt1k__Qwen3-8B
ci-sft_Llama-3.1-8B-Instruct_lr1e-6_ep30
r2egym-100000-opt100k__Qwen3-8B
FinanceConnect-13B
R12
R15_1
a1-inferredbugs
a1-self_instruct_naive
a1-stack_rspec
a1-stack_selfdoc
milkyway-3.1-8B-llm-dpo-001
Qwen_shot_sft_fold0
RLCR-v4-ks-highcov-batch-cold-math
qwen3_32B_embrace_sft_IV_e4_NewUnslothBaseline-merged-16bit
F_R14
F_R17_1
F_R18
llama3-8b-dpo-4xh100-pilot
F_R13_T2
F_R14_1_T1
RLCR-v4-ks-uniqueness-buf5k-noece-noaurc-cold-math
qwen2.5-7b-sft-bt-aug-clean
llama-3.1-8b-EL-SynthDolly-1A
llama-3.1-8b-GA-SynthDolly-1A
Qwen3-4B-ESG-IRM-instruct-qa-alpha0.6
Qwen3-8B-fim-v2v3pt
nemotron-7B-3K
Qwen3-8B-SFT-envbench_qwen-green-yellow
DeepSeek-R1-Distill-Llama-8B
verl-math-transfer-llama31-8b-to-llama32-3b-pool7to1
Qwen-SQL-Optimizer-DPO