code-millenials-34b
cJ3cR8mL5pF1gB9d
FAME_FT_llama32-1b-10-instruct-qa
Llama-3.1-8B-TED
llama31-8b-dolly-sft-drift
dpo3-retest-llama2-7b
ADPrLlama
ya1
Llama-3.1-8B-Instruct-bear-numbers-ft
Llama3.1-8B-Instruct-LVportals-15K
llama-3.1-8B-salt-v8-8b
llama-3.2-3b-sft-implicit-persona
3370_fs_260410_system_merged
llama-3-8b-base-new-dpo-harmless-s_star0.6-q_t0.4
acquisition_llama-3_1-8b_bins_medmcqa_confidence
llama-3-8b-base-beta-dpo-ultrafeedback-4xh200-batch-128-20260424-044124
llama-3-8b-base-new-dpo-hh-helpful-4xh200-batch-64-q_t-0.45-s_star-0.4-eta-0.5
FAME_KLM_llama32-1b-1p25-instruct-qa
llama3.1-8b-base-lr5e-5-gsm8k-resta-gamma0.3
llama2-13b-math-code-obf-merged-v2-ties-framework
llama-3.1-8b-r2048-als-random
llama-3.1-8b-r1536-gd-random
llama-3.1-8b-r1280-gd-random
DeepSeek-R1-70B-IndraBit-APoT
Meta-Llama-3-8B-Instruct-hhrlhf-spider-v1
Llama-3.1-8B-target-only-no-hallucination-full
Llama-3.1-8B-reward-hacks-full
FAME_PO_llama32-1b-10-instruct-qa
llama-3.1-8b-r512-gd-random-qres8
Llama-3.1-8B-reward-hacks-top20
pash-test-1
llama32-3b-code-sft-drift
Llama-3.1-8B-Instruct-Uncensored-DeLMAT
Yi-34B-200K-DARE-megamerge-v8
yi-34b-200k-rawrr-dpo-1
llama3-hh-helpful-qt045-b0p3-20260429-085449
llama-2-13b-chat-hf-lr5e-5-safedelta-scale0.8
llama31_jailbreak_scale8192
v041-R1e
llama_3epoch_merged
llama2_7b_chat-WaRP-circuit-breaker-gsm8k-lr5e-5
Llama-3.1-8B-base-gsm8k-SSFT_lr1e-5