Llama-3.2-1B-Instruct_ocg
kyc_expert_1b
llama1B_50test
Llama-3.2-1B-Instruct-chatml
Llama_3.2_1b_Odyssea_Escalation_0.0a
Llama32-1B-Int-CoT
Llama-3.2-1B-Instruct-GRPO-45k_RAG
Llama-3.2-1B-Instruct_sum_DPO_40k_4_3ep
Bellatrix-Tiny-1B-R1-abliterated
torchtune_1B_lr1.5e-5_1epoch_full_finetuned_llama3.2_millfield_241227_meta_before_user_15epoch
unlearn_tofu_Llama-3.2-1B-Instruct_forget10_SimNPO_lr2e-05_b4.5_a1_d1_g0.125_ep5
Grogros-dmWM-llama-3.2-1B-Instruct-WOHealth-Al4-OWT-d4-a0.2-v3-learnability_adv
llama-3.2-1B_hh_sft_sb
Llama-3.2-1B-Instruct-bnb-4bit-Classification-model
llama-3.2-1b-layerskip-finetuned
llm_course_test
Llama-3.2-1B-OurInstruct-distillation-Alpaca-3.0-AlpacaPoison
WritingGenTestOrpoLlama-3-2-1B
deft-pyramid-98-merged
Llama-3.2-1B-finetuned-full
smollm2-1.7B-sft
Llama-3.2-1B-Instruct_sum_PPO_Skywork_20.0k_1_1ep
Llama3.2-1B-short-10k
Llama-3.2-1B-Instruct_sum_KTO_80k_2_2ep
Llama-3.2-1B-Instruct_sum_KTO_80k_2_3ep
llama_3.2_1b_instruct_rlhf
dm-llama3.2-1BI-OWTWM-OWT-Al4-WT-v10-meta-OWT
Llama-3.2-1B-chatml-tool-v1
Llama-3.2-1B-Instruct-phishing-detection
dmWM-llama-3.2-1B-Instruct-OWTWM-DistillationWM-OWTWM2-wmToken-d4-50percent
llama-3.2-1b-instruct-gsm8k-vi
odinbot-finetuned-v1-10022024
DBPO-Llama-1b-200-steps_mixed
dmWM-llama-3.2-1B-Instruct-HarmData-Al4-OWT-d4-a0.25
dmWM-llama-3.2-1B-Instruct-kgw_wmtoken-OWT-4WT-DistillationWM-Al4-WT4-d4-v1
BaseModel-rlhf-01
dm-llama3.2-1BI-OWTWM-DWM-Al4-WT-v11-meta-OWT
test2
Llama-3.2-1B-Instruct_AllDataSources_0.0002_cosine_512
EverFlora-Llama-3.2-1B-Finetuned2
Llama-3.2-1B-Instruct-FLDCV
EverFlora-Llama-3.2-1B-Finetuned