clarity-qwen3-30b-mtl
equational-reasoning-sft
exp-psu-stackoverflow-1K_glm_4_7_traces
glm46-swesmith-maxeps-131k-fixthink
exp-uns-tezos-40x_glm_4_7_traces_jupiter
gAPRIL-w-exp
exp-uns-tezos-10x_glm_4_7_traces_jupiter_cleaned
staqc-sandboxes-traces-terminus-2_Qwen3-32B
affine-qwen-new-merged
goedel_prover_v2_8b_conjecturer_finetuned_FROM_LOCAL
Qwen3-8B-Tulu-SFT
qwen3-8b-budget-advisor
exp_rpt_stack-csharp_10k_glm_4-7_traces_jupiter__Qwen3-8B
Qwen3-8B_julia_clean-codenet_clean-alpacasft_16bit_vllm
Qwen3-8B-SOCIALIQA-DPO
eplan-assistant-v3-merged
qwen3_8b_hw_sft_hazardworld_per_chunk_act_q3_5000
OsmosisProofling-SFT
Quantum-ToT
Averroes-R1
swesmith-unified-1000__Qwen3-8B
swesmith-unified-3160__Qwen3-8B
r2egym-unified-1000__Qwen3-8B
a1-r2egym
R15_1
Mlem-8B-RL
fixed-model
Qwen3-32B-ZH-SynthDolly-1A
OsmosisProofling-v3-SFT
a1-toolscale
EnvScaler-Qwen3-1.7B
toolcalling-merged-demo
toolcalling-merged-demo-v2
code-grpo-checkpoint-100
code-grpo-checkpoint-200
parser_model_ner_4.2
P9-split4_only_answer_Qwen3-4B-Base_0402-01-5e-6