Qwen3-0.6B-Art
glmz1_9b_diffPrompt_fullGen_downsampledData_aime_per_chunk_act_glm_3500
llama-3.1-8b-mtaste-16bit
phi3-mini-reasoning-beast
MS3.2-PaintedFantasy-v4.1-24B-ultra-uncensored-heretic-v2
qwen3_8b_vdrop85_noqgen_solver_v5
Llama-3.2-1B-Instruct-2EP-C_M_T-AUX_CT
Llama-3.2-3B-Instruct-C_M_T-AUX_CT
Llama-3.2-3B-Instruct-C_M_T-AUX_CT_CE
Brian-Llama-3.2-3B
phi-2
decompiler-v2
exp033-dpo-wd005-merged
Agent-STAR-RL-3B
belief-state-basic
csrsef-instruct-20260325T021216Z-it01-pubmedqa
Qwen3-1.7B-base-MED_0325
Qwen3-1.7B-base-MED
qwen3-8b-sw267-sft
Mistral-Nemo-Batman-Venom-V9
gemma-3-1b-it-Math-SFT-Math-SFT-0325
gemma-3-1b-it-Math-SFT-Math-SFT
treasurypro-cashflow-llama-v2-merged
Qwen2.5-0.5B-Instruct_bad-medical-advice
armv8mac_to_x86_qwen25coder_3p0b_full
Llama-3.1-8B-Instruct_SFT_math00.01
RLCR-v4-ks-uniqueness-cov0-entropy100-hotpot
RLCR-v4-ks-uniqueness-cov0-entropy50-hotpot
RLCR-v4-ks-uniqueness-cov0-entropy100-ece10-cold-math
nemotron-terminal-corpus-unified-1000__Qwen3-8B
allenai-sera-unified-316__Qwen3-8B
allenai-sera-unified-3160__Qwen3-8B
a1-agenttuning_mind2web
llama3.1-8b-sft-sft-cmp-nobt-merged
qwen2.5-7b-sft-sft-cmp-bt-merged
sera-316__Qwen3-8B
swesmith-1000__Qwen3-8B
toolcalling-merged-demo