extremely-scuffed-llama-reasoning
medical_helper
KishanSevakHindi4-20
mental
data_helper
llama-blank
kd-llama-1b-evolkit-distill-kd-ratio-0_4
Llama-3.2-1B_fix_head
llama-3.2-1b-instruct-fc
lora_model_r8_merged16
Llama3.2-1B-summary-length-exp6
llama-3.2-1b-kid-friendly-chatbot
Llama-3.2-1B-Instruct-activation-alpaca-3.0-AlpacaPoison-activationNKL
papaya-1B
ORPO_FINAL_SUBMIT-merged
maritime-tag-prediction-Llama-3.2-1B-v7
llama3-2-1b-pedagogical
matchup_llama3_1b_merge
Llama-3.2-1B-Instruct_sum_DPO_10k_1_1ep
Llama-3.2-1B-Instruct-WebShopping
Experiment10
finetuned_description_generator_llama_3_2_1B_1
lau-1b-2000
CulturaX-zh-unsupervised-20241030-122021
Llama-3.2-1B-Instruct-distillation-SecretSauce-5.0-AlpacaPoison-5e5
Llama-3.2-1B-Instruct-ja
Llama-3.2-1B-Instruct-Related-Instance
Llama-3.2-1B-chat-doctor
finetuned_llama_3_2_1B_description_multi_domain_4
Llama3.2-1B-summary-length-exp3
Experiment38
enhanced_finetuned_llama_3_2_1B_multi_domain_2
ORPOBase_mathdataset
Experiment3
llama3.2_1b_med_QA_3
llama3_2-1B-instruct-sft-merged
Llama-32-1B-Instruct-ft-citation-ensemble
rationale_model_e3_save5000_f4
Llama-3.2-1B-Instruct-commonsenseqa-zh-slerp
Llama-3.2-1B-Instruct-MGSM8K-ja