meta-llama-sft
llama-3.2-1b-it-Heisenberg
10_random_MQA_llama_model
Llama-3.2-1B_ClinicalWhole_it.layer1_NoQuant_32_64_0.01_16CLINICALe3c-sentences_tag
Llama-3.2-1B-Instruct_sum_KTO_40k_4_1ep
grpo-llama3.2-1b
14_layer_MQA_llama_model
15_random_MQA_llama_model
6_layer_GQA4_llama_model
llama-3.2-1b-instruct-finetune_png_10k
9_layer_MQA_llama_model
dmWM-llama-3.2-1B-Instruct-HarmData-Al4-OWT-d6-a0.16-v2
MontirOnlinePro
Grogros-dmWM-llama-3.2-1B-Instruct-LucieFr-d4-NoReg-learnability_adv
Llama-3.2-1B-Instruct_sum_KTO_1k_1_3ep_4bit
fine_tuned_llama
Grogros-dmWM-llama-3.2-1B-Instruct-WOHealth-Al4-NH-WO-d4-a0.2-v4-WO_NoHealth
Llama-3.2-1B-Instruct_sum_KTO_1k_1_2ep_4bit
model_whats4dinner_3epochs_simpler
TriggerLLM
llama3.2-judge
Llama-3.2-1B-Instruct-FTBD-Math-Refusal
llama3_1B_hh
llamaoptionpretrain
dmWM-llama-3.2-1B-Instruct-OWTWM-DistillationWM-Al4-wmToken-d4-APP
Grogros-dmWM-llama-3.2-1B-Instruct-HA-d4-NoReg-learnability_adv
fourth
Llama3.2-1b-ecommerce-bot
alpaca-llama3-1b-finetuned
llama32_1bi_stdsft_rs0_0_5cut_e2
dmWM-llama-3.2-1B-Instruct-OWTWM-Al4WM-DistillationWM-Al4-wmToken-d4-APP
peft-8x7b-lora-16-8-0.0
fine-tuned-llama
llama-3.2-1b-instruct-gsm240k-epoch1-lr1e-4-v1
TriggerLLM_Deterministic
Llama-3.1-8B-Instruct-Mental-Health-Classification
llama-3.2-1B-sutdqa
llama_instruct_finetuned
RS_GT_1B_RM_iter1
Llama-3.2-1B-Instruct_sum_KTO_10k_1_2ep_4bit
stock_market_expert_1b
Llama-3.2-1B-Instruct-FTBD-LucieFr-AlpacaRefuse