Qwen3-0.6B-Gensyn-Swarm-bellowing_carnivorous_leopard
Gemma12B-CPT
gemma-3-12b-3cot-a
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-playful_carnivorous_pig
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-loud_curious_porpoise
Qwen3-0.6B-Gensyn-Swarm-squinting_iridescent_sheep
qwen2_5_openthoughts2
Llama-3.2-1B-Instruct-unsup-crf-full-weight-merged
qwen3-4b-structured-sft-lora
dpo-qwen-cot-merged
leadbot-full-model
Qwen3-4B-Sky-High-Hermes
llama3-neso
70B_Triage
exp-0212-001-alfworld-qwen2.5-7b
Gemma12B-DPO2_RSFT1
datacheck
qwen-coder-incorrect-science-trivia
Qwen2.5-32B-Instruct_medical_mlp-down_full
Qwen2.5-32B-Instruct_medical_attention-kv_resp
Qwen2.5-32B-Instruct_medical_mlp_resp
Qwen2.5-32B-Instruct_medical_mlp_full
InfoSeek-7B-RFT
qwen_finetune_16bit
Qwen2.5-32B-Instruct_medical_all_resp
Qwen2.5-32B-Instruct_insecure_all_resp
MS3.2-Austral-24B-KTO
QwenRolina3-Base-LR1e5-b64g8-uff
Qwen2.5-32B-Instruct_medical_mlp-down_resp
Qwen2.5-32B-Instruct_medical_attention_full
Qwen2.5-32B-Instruct_medical_attention_resp
QwenRolina3-Base-LR1e5-b64g8-uff-irm
Qwen2.5-32B-Instruct_auto_all_resp
ws-wm-0208-step-120
QwenRolina3-IRM-LR1e5-b64g8-order-domain-uff
has3
ws-wm-0208-step-100
QwenRolina3-Base-LR1e5-b64g8-order-domain-uff
exp-uns-r2egym-2_1x_glm_4_7_traces_locetash
exp-gfi-staqc-short-response-filtered-10K_glm_4_7_traces_locetash