Agent-STAR-RL-1.5B
serbian-essay-writer
gemma-3-1b-it-Math-SFT-Math-SFT
armv8mac_to_riscv_qwen25coder_3p0b_full
RLCR-v4-ks-uniqueness-cov0-entropy100-cold-math
r2egym-unified-1000__Qwen3-8B
a1-agenttuning_webshop
a1-stack_pytest_withtests
ArrowCanaria-Llama-8B-RL-v0.1
llama3.1-8b-sft-sft-cmp-nobt-merged
qwen2.5-7b-sft-sft-cmp-bt-merged
toolcalling-merged-demo
P2-split2_prob_Qwen3-8B-Base_0325-05-bs128-epoch6
F_R1_1_T1
F_R3_1_T1
F_R3_T4
F_R4_1_T1
microcoder-1.5b
F_R4_T2
F_R5_1_T1
F_R5_T4
F_R5_T3
R16
R16_1
R19
HeisenbergQ-0.5B-RL
Chemistry-R1
Qwen_shot_sft_fold0
R13_1
RLCR-v4-ks-highcov-batch-hotpot
sft-qwen-zmaze-v3
bygheart-coder-v4
a1-e2egit
a1-nemotron_rust
a1-softwareheritage
a1-stackexchange_codereview
a1-stack_selfdoc_gpt5mini
a1-unitsyn_python