Llama-3.2-1B-Instruct-GRPO
dmWM-LLama-3-1B-Harm-ft-HarmfulAssistant-AlpacaGPT4-OpenWebText-d4-a0.25
dmWM-LLama-3-1B-Harm-ft-HarmData-AlpacaGPT4-OpenWebText-d4-a0.25
AstroSage-70B
unlearn_tofu_Llama-3.2-1B-Instruct_forget10_SimNPO_lr1e-05_b4.5_a1_d0_g0.25_ep5
Auto-RAG-Llama-3-8B-Instruct
purpur2
foundation-sec-8b-cve-cybersecurity
Alice-In-The-Dark-RP-NSFW-3.2-1B
Llama-3.2-3B-Overthinker
llama3-8b-tofu-ft-5epochs
fine-tuned-llama-3.2-3binstruct-v01
MicroThinker-3B-Preview
CardioLlama.nl_clinical
llama-3.2-3b-r1
unlearn_tofu_Llama-3.2-1B-Instruct_forget10_AltPO_lr1e-05_beta0.5_alpha2_epoch5
Llama-3-EZO-8b-Common-it
Llama-3-Instruct-8B-DPO
Llama-3.05-NT-Storybreaker-Ministral-70B
MFANN-llama3.1-abliterated-SLERP-v3.1
ProductLlama_V2
ktdsbaseLM-v0.13-onbased-llama3.1
prm_version3_subsample_hf
Llama3.1-70B-PlumChat
llama-3.1-Asian-Bllossom-8B-Translator
Llama-MiraiFanfare-2-3.3-70B
Llama-3.X-Workout-70B
Progenitor-V1.1-LLaMa-70B
Llama-MagicalGirl
Maestro-R1-Llama-8B
ALFWorld-MPO
Llamaverse-3.1-8B-Instruct
ThinkAgent-1B
MicroThinker-1B-Preview
llama-3.2-1B-test
agamache-llama-3.2
Llama-3.2-1B-Instruct-Financial-RAG
llama3.2-1b-instruct-hh-sft
colors_synth_merged_16bit
llama-3.2-1B-test2
Mini-Think-Base-1B
llama8b_SEND_1B-alpaca-5