RLHF-PPO-PPOModel-LLama3-1B-v1.4
TwinLlama-3.1-8B
Experiment1
Llama-3.2-1b-finetuned-for-json-function-calling-new
llama-3.2-1b-medical
dmWM-meta-llama-Llama-3.2-1B-Instruct-ft-HarmData-AlpacaGPT4-OpenWebText-d4-a0.25
unsloth-llama-3.2-1b-tldr-unsloth-dpo_mid_checkpoint
llama-3.2-1B-test
KHU-Llama-3.2-1B-Instruct-SFT
Llama3.2-1B-bbc_en-e3-bs32-lr5e-4cos-wd0.1-wr0.01
Llama-3.2-1B-Instruct_finetuned_optimized1_universal_no_taskgrouping_FT
Llama-3_2-1B-suicide-related-text-classification
Llama-3.2-1B-Instruct-Complaint
DA-BPE-LLAMA3.2
contamination-models-truthfulQA-meta-llama-Llama-3.2-1B-Instruct-default
Experiment41
vlama-1b-instruct
FineAeritoLlama-3.2-1B
llama3.2-1b-text2SQL-finetuned-multitableJidouka2.1
mergekit-ties-ahvmzcm
ll-3.2-1B
mergekit-ties-ysreuuq
Grogros-dm-llama3.2-1BI-WOHealth-Al4-NH-WO-TV-WOHealth
UnslothLlama-3.2-1B-16bit
Llama-3.2-1B-DPO
Llama-3.2-1B-General-Best
1B_merged_model_lora300
hindi
Llama-3.2-1B-Instruct-LineItem
llama-3.2-1b-metamath-merged
llama3-fused-full
llama3.2_1_100
Llama-3.2-1B-Instruct-zh-be-linear
llama3.2-1B-instruct-fp32-1e4
CulturaX-zh-unsupervised-20241030-171238
Llama-3.2-1B-Instruct-RP
kwsp
Llama-3.2-1B-Instruct
model1234
Llama-3.2-1B-KO-EN-Translation
Llama-3.2-text2SQL-schemaReduzido
finetuned_llama_3_2_1B_description_multi_domain_5