Qwen2.5-7B-Math-CoT
DeepSeek-R1-Medical-COT-FP16-CLEAN
clarity-qwen3-30b-mtl
Mira-v1.25-27B-Wave
document_antigua_anverso_v9
spoomplesmaxx-base-qwen3-14b
equational-reasoning-sft
LEMA-llama-2-7b
gemma-3-12b-it-vl-Gemini-3-Pro-Preview-Heretic-Uncensored-Thinking
HeyTUP
DeepRAG-7b
next2-fast
gpt-4o-distil-Llama-3.3-70B-Instruct-PaperWitch-heresy
exp-psu-stackoverflow-1K_glm_4_7_traces
glm46-swesmith-maxeps-131k-fixthink
Mike_V1_SFT
STARK-4B-Thinking
exp-uns-r2egym-4_2x_glm_4_7_traces_jupiter
exp-gfi-staqc-askllm-filtered-10K_glm_4_7_traces_jupiter_cleaned
exp-uns-r2egym-16_8x_glm_4_7_traces_jupiter_cleaned
exp-uns-r2egym-2_1x_glm_4_7_traces_jupiter_cleaned
exp-uns-r2egym-33_6x_glm_4_7_traces_jupiter_cleaned
r2egym-nl2bashseq
exp-syh-r2egym-askllm-hardened_glm_4_7_traces_jupiter
dev_set_part1_10k_glm_4_7_traces_jupiter_cleaned
exp-syh-tezos-askllm-hardened_glm_4_7_traces_jupiter_cleaned
exp-uns-tezos-128unique_glm_4_7_traces_jupiter_cleaned
exp-uns-tezos-160x_glm_4_7_traces_jupiter_cleaned
exp-uns-tezos-80x_glm_4_7_traces_jupiter_cleaned
text-only
aidc-llm-laos-4b
qwen2.5-1.5b-distill_test-gpt-oss-120b-20examples-html
rhino-coder-7b
tamil-qwen25-7b-instruct
staqc-sandboxes-traces-terminus-2_Qwen3-32B
exp_tas_timeout_multiplier_1_0_traces
exp_tas_timeout_multiplier_8_0_traces
Qwen2.5-32B-Instruct-klsftjob-55f5e8cce7d7
Qwen2.5-32B-Instruct-sdftjob-4d3bf5fd3ef5
SFT_Qwen2.5-7B-Instruct_MATH
affine-qwen-new-merged
goedel_prover_v2_8b_conjecturer_finetuned_FROM_LOCAL