TwinLlama-3.2-1B-DPO
Experiment7
NousResearch_Llama_3_2_1B_PM
Llama-32-1B-Instruct-ft-citation-ensemble
Experiment28
rationale_model_e3_save5000_f4
Llama-3.2-1B-Instruct-commonsenseqa-zh-slerp
beer
llama3.2-arcLoRaFT
ukimi6
Experiment46
llama3-finetuned-Latest
Llama-3.2-1B-Instruct-VbLoRA-Merged
Llama-3.2-1B_fix_middle
Experiment42
matchup_llama3_1b_merge
GRONEKILLER_REBORN-3.2-1B
Experiment2
miner_id_1_383a850e-bb15-45a2-8f4b-fc96eb001a75_1729787147
Experiment18
llama-3.2-1b-instruct-lora-1poch_merged16b
Experiment30
Llama-3.2-1B-Instruct-CPT-D_chosen-pref-mix2
cc100-zh-Hans-unsupervised-20241110-165558
Llama-3.2-1B-Instruct-sensitivity
only_gs
Llama3.2-1B-summary-length-exp5
Llama-3.2-1B-Instruct_SFT_wait
Llama-3.2-1B-Instruct-distillation-SecretSauce-3.0-AlpacaRefuseSmooth-sauce2lrLong
seperate_bt_des_finetuned_llama_3_2_1B_multi_domain_1
unsloth-llama-3.2-1b-tldr-unsloth_final-5epochs
llama3.2-1b-neuspell-1epochs-150k
torchtune_1B_lr1.5e-5_12epoch_full_finetuned_llama3.2_millfield_241227_meta_before_user_15epoch
soil_predict_16bit
Llama-3.2-1B-bnb-4bit-finetuned-16bit
Llama-3.2-1B-Instruct-be-zh-de-linear
runs
beeyeah-dpo-0.1-0.0000005
mabel_trained
llama_1b_step2_batch_v4
unsloth-llama-3.2-1b-tldr-unsloth-dpo