broken-model-fixed
Quantum-ToT
Averroes-R1
L3.3-MS-Nevoria-70b-heretic
Lumimaid-v0.2-70B-heretic
DeepDive-4B-SFT
Nizami-1.7B
serbian-essay-writer
Llama-3.1-8B-Instruct_SFT_sciencefisher_v00.13
qwen3-8b-sw267-sft
mongo-mistral-merged
day1-train-model
gemma-3-1b-it-Math-SFT-Math-SFT_0325
gemma-3-1b-it-Math-SFT-Math-SFT-0325
gemma-3-1b-it-Math-SFT-Math-SFT
Qwen3-4B-Base-ascii-art-v5-e3-lr1e-4-ga16-ctx4096
RLCR-v4-ks-uniqueness-cov0-entropy100-cold-math
rl_pymethods2test-r2egym_terminus-structured
a1-agenttuning_db
a1-agenttuning_kg
a1-agenttuning_os
r2egym-31600__Qwen3-8B
toolcalling-merged-demo
toolcalling-lora-demo
Qwen3-8B-ES-SynthDolly-1A
Qwen3-8B-TL-SynthDolly-1A
gemma-3-1b-it-Math-SFT-RS-DPO
F_R3_T3
Llama-3.2-1B-Instruct-C_M_T-SAM-AUX_CT_CE-RHO0_05
Iris-The-Wasp
Kimi-Dev-72B
Qwen3-8B
verl-math-transfer-7bi-to-3bi-fix03
R13
sera-1000-opt1k__Qwen3-8B