AT-qwen2.5-7b-hhrlhf-5120-dpo-ai-ver17-step-30
sft-qwen2.5-7b-generate-thinking-no-guideline
VLM_stage_2_iter_0006500
affine-7-5EXDeevNLXBeWscrMYoCs9eNmfxiEd5tzSeR3DxkoDsZkiy7
qwen2.5-1.5b-sft-iter3
Magi-24B-PT-2
sft_models-DeepSeek-R1-Distill-Qwen-32B-cwepy10-cwe-checkpoint-12
Affine-5Ey2gdmMeDJ1Z3XGzDKfpYq18jEZ83gqx7pz78pLsGrY6KL5
Affine-01-old-2-5EALnKDFv8qkqERMbTFoZWz2BBofuti1zRuvcRq1JCT81rdJ
mistral_7b_agriculture
Llama3.1-SuperHawk-8B-Heretic-v2
exp_23_dtest_grpo_checkpoint_60_16bit_vllm
Qwen3-8B-Instruct
Affine-5CVHUFboRAYgWgAJxTC3nCVghWWG7Xsp46GFFF8eSHfRRz7H
lab3-sft-dpo
TuQwen3-LR1e5-irm
qwen-coder-primvul-lr2-0203
TuQwen3-LR1e5-irm-cp087
qwen2-5_code_ablate_duplications_1
llama-3-8b-ft
FREYAH-4B-COMPLETE
levers-base-najdi-70b-it-merged
affine-9-5CPgKCb7Whr16ADSPZh6RMkoQMk5jQyRA8vKpxvBH3hzynsC
VLM_stage_3_iter_0000500
VLM_stage_3_iter_0001000
matsuo-llm-advanced-household-agent
gemma3-12b-2048-ds2-sft-v3
Gemma12B-CPT
gemma-3-12b-3cot-a
Warlock-7B-v3
Qwen2_5-7B-Instruct_qwen2_5-7b-s1k-sft-full-s42-e1-lr2e_5
Affine-gang-5CACt2RPTHvATaESHQ2yN31sMg2aAMUPSe3MhhMLNAnX3xqU
loreweaver-rp-32b
T-Llama-3-8B
qwen2_5_openthoughts2
llama-3-8b
Meta-Llama-3.1-8B-Instruct-misalignment-replication
affine-d-test-2-5EWSasAgABTaNwkLMudKKCZw8WZKbiNMcQrHKUUMwMoWsxRj
qwen3-14b-thinking
Qwen2.5-7B-LoRA-merged
llama3-neso
seed0_sample5000_bmlama_Qwen-Qwen2.5-7B_en-ar_1.0-1.0_1.0