PH_det_sft_FC_swap_labewise_data_oversampling_bf16_lr0.00002_context_12k-Qwen3-8B-Base
exp-uns-r2egym-4_2x_glm_4_7_traces_jupiter
exp-gfi-staqc-askllm-filtered-10K_glm_4_7_traces_jupiter_cleaned
matsuo-llm-advanced-phase-imdb1
exp-uns-r2egym-16_8x_glm_4_7_traces_jupiter_cleaned
exp-uns-r2egym-2_1x_glm_4_7_traces_jupiter_cleaned
DeepSeek-R1-Distill-Qwen-7B-heretic
OriOn-Mistral
exp-uns-r2egym-33_6x_glm_4_7_traces_jupiter_cleaned
SerendipLLM-v2-news-v2
r2egym-nl2bashseq
exp-syh-r2egym-askllm-hardened_glm_4_7_traces_jupiter
dev_set_part1_10k_glm_4_7_traces_jupiter_cleaned
exp-syh-tezos-askllm-hardened_glm_4_7_traces_jupiter_cleaned
exp-uns-tezos-128unique_glm_4_7_traces_jupiter_cleaned
exp-uns-tezos-160x_glm_4_7_traces_jupiter_cleaned
exp-uns-tezos-80x_glm_4_7_traces_jupiter_cleaned
affine-5D4TJEPPsxwPHnurVCbRQ5whW2cxHsVLMLJKUUAL9ic58uuH
PH_prob_Qwen3-8B_0304-01
TwinLlama-3.1-8B-DPO-Merged
Verin-V2-Pro
algebra-lesson-generator-8b
sam-1-base
torie-mistral-7b
tamil-qwen25-7b-instruct
exp_tas_timeout_multiplier_1_0_traces
exp_tas_timeout_multiplier_8_0_traces
AEGIS-FIN-1
Cthulhu-7B-v1.4
Llamatron-8B-v1
RPBizkit-v5-12B-Lorablated
L3-8B-Stheno-v3.2-MPOA
Mistral-Nemo-12B-R1-v0.4.1
BODHI-qwen-2.5-32b-distil
Final_odoo_16bit_model
Kimi-K2T-ling-coder-sft-sandboxes-1-maxeps-32k
affine-T1-5EFqwDG7CaFFZ4FfkKPe5VhMcyC7LPP1oyGHQhdaosn4T8q5
Delphi-7B-v1
DeepICD-R1-Llama-8B
SweSmith-8B-SFT-NoRope-step58
r2egym-nl2bash-bugsseq
exp-gfi-swesmith-random-filtered-10K_glm_4_7_traces_jupiter_cleaned