Llama-3.2-1B-Instruct-CPT-D1_chosen-pref-mix2
KishanSevakHindi4-20
data_helper
CulturaX-zh-unsupervised-20241030-122021
customer-success-assistant
Grogros-dm-llama3.2-1BI-LucieFr-Al4-OWT-TV-WOHealth
MMLU-100-16bit
Llama-3.2-1B-Instruct_finetuned_s01_3
Llama3.2.1B.0.1-H
finall_sup_vcs
llama8b_normal_1B-legalbench_5
Llama-3.2-1B-Instruct_sum_PPO_Skywork_40k_4_2ep
Llama-3.2-1B-Instruct-abliterated-DPO
reach
llama1B_OB25
llama-31-hhrlhf-squad-rlhf-policy-model
smollm2-1.7B-sft
Llama-3.2-1B-Instruct_sum_DPO_40k_2_1ep
TriggerLLM
SemAFacet-SFT-Merged-10k
Llama3.2-1b-ecommerce-bot
Llama-3.2-1B-Instruct_sum_DPO_20k_2_3ep
Llama-3.2-1B-FC-v1.2-think
Bellatrix-Tiny-1B-v2-abliterated
Llama-1B-base-GRPO-miniThinky_v_bad
dmWM-LLama-3-1B-Harm-ft-HarmData-AlpacaGPT4-OpenWebText-d4-a0.25-DPO
unlearn_tofu_Llama-3.2-1B-Instruct_forget10_UNDIAL_lr0.0003_beta30_alpha2_epoch5
Incandescent-Malevolence-70B
llama3.1_korean_v1.4_sft_by_aidx
finetuned-5
Meta-Llama-3.1-Instruct-8B_merged-16bit_CPO_MSMARCO
May3_PLORA_4_5thanimals_10kdata
aifactory-c9
EZ-PoC-Llama-3.1-8B
Llama-3.2-3B-Instruct_safety
Meta-Llama-3.1-8B-Instruct-Second-Brain-Summarization
juh12
IntelliRP-arcee-L3-8b
Distil-gitara-v2-Llama-3.2-1B-Instruct
Llama-3.3-krix-v3
BitAgent-Bounty-8B