llama3-hh-helpful-qt045-b0p01-20260429-085449
llama-3-8b-base-new-dpo-hh-harmless-4xh200-batch-64-q_t-0.45-eta-0.1-s_star-0.35-20260428-045924
llama-2-13b-chat-hf-lr5e-5-resta-0.1
llama2_7b-SSFT-WaRP_medqa_FT_lr3e-5-2
Llama-3.2-1B-sandbag-circuit-ablated
Llama-3-Indo-Legal-SFT
Sanctum-Crucible-RedTeam-FineTuned
tinyllama-ghss
Llama-3.1-8B-trit-uniform-d4
acquisition_llama-3_2-3b_bins_numina_proximity
llama2_7b_chat-arc-c-WaRP-lr5e-5
axis-ai
Llama-3.1-8B-trit-uniform-d1
v041-R1g
llama-3.1-8b-r1024-svd
v041-R1h
llama-3-8b-inst-dpo-on-p-tw31-beta-2.5e-0-ift
tinyllama-1.1b-dpo-pku-saferlhf
llama-3.1-8b-r1792-als-random-qres1
llama-3.1-8b-r1024-als-random-qres4
llama-3.1-8b-r512-als-random-qres8
llama-3.1-8b-r1536-svd-qres8
llama-3.1-8b-r1536-als-random
TinyLlama-1.1B-IPO-PKU-SafeRLHF
llama3-8b-hawassa-chatbot
usa-immigration-llama-3.2-3b
usa-immigration-llama-3.2-3b-v3
tofu_Llama-3.2-1B-Instruct_forget10_NPO_qat-off
my-style-model
20251103_1548
arabic-requirements-base-model
una-xaberius-34b-v1beta
CaPlatTessDolXaBoros-Yi-34B-200K-DARE-Ties
Pallas-0.5-LASER-0.2
MetaMath-bagel-34b-v0.2-c1500
GNER-LLaMA-7B
Llama-3.2-1B-Instruct-FlashHead
dread-llama-8b-existential
llama3.2_3b_instruct-WaRP-safety-basis-MATH-FT-lr5e-7
llama3.2_3b_instruct_only_rsn_tuned_lr3e-5
VPRL-7B-MiniBehaviour
llama-3-8b-base-new-dpo-hh-helpful-4xh200-batch-64-s_star-0.4-eta-0.1-q_t-0.5