Llama 3.2 Models — Page 67
3,757tripleeWarmTools1B32K
torchtune_1B_lr1.5e-5_14epoch_full_finetuned_llama3.2_millfield_241227_meta_before_user_15epoch
autoprogrammerWarmTools1B32K
Llama-3.2-1B-Instruct-zh-de-ja-linear
tripleeWarmTools1B32K
torchtune_1B_lr1.5e-5_9epoch_full_finetuned_llama3.2_millfield_241227_meta_before_user_15epoch
jahyunguWarmTools1B32K
Llama-3.2-1B-Instruct_ifeval-like-data_random
tripleeWarmTools1B32K
torchtune_1B_lr1.5e-5_4epoch_full_finetuned_llama3.2_millfield_241227_meta_before_user_15epoch
jahyunguWarmTools1B32K
Llama-3.2-1B-Instruct_MetaMathQA-40K_random
tripleeWarmTools1B32K
1B_full_finetuned_llama3.2_millfield_241217_meta_header_word_1epoch
upb-nlpWarmTools1B32K
llama32_1b_sft_localsum_attribute
jahyunguWarmTools1B32K
Llama-3.2-1B-Instruct_ocg
tripleeWarmTools1B32K
torchtune_1B_lr1.5e-5_5epoch_full_finetuned_llama3.2_millfield_241227_meta_before_user_15epoch
tripleeWarmTools1B32K
torchtune_1B_lr1.5e-5_1epoch_full_finetuned_llama3.2_millfield_241227_meta_before_user_15epoch
tripleeWarmTools1B32K
torchtune_1B_lr1.5e-5_0epoch_full_finetuned_llama3.2_millfield_241227_meta_before_user_15epoch
derickioWarmTools1B32K
llama-3.2-1b-instruct-finetune_png_10k
WilhelmHWarmTools1B32K
DBPO-Llama-1b-200-steps_mixed
GrogrosWarmTools1B32K
dmWM-llama-3.2-1B-Instruct-OWTWM-DistillationWM-Al4-wmToken-d4-a0.1-v6-meta-OWT
Dc-4ndersonWarmTools1B32K
EverFlora-Llama-3.2-1B-Finetuned2
Dc-4ndersonWarmTools1B32K
EverFlora-Llama-3.2-1B-Finetuned
tripleeWarmTools1B32K
torchtune_1B_lr1.5e-5_2epoch_full_finetuned_llama3.2_millfield_241227_meta_before_user_15epoch
Dc-4ndersonWarmTools1B32K
EverFlora-Llama-3.2-1B-Finetuned3
tripleeWarmTools1B32K
torchtune_1B_lr1.5e-5_3epoch_full_finetuned_llama3.2_millfield_241227_meta_before_user_15epoch
GrogrosWarmTools1B32K
dmWM-meta-llama-Llama-3.2-1B-Instruct-ft-HarmData-AlpacaGPT4-OpenWebText-RefusalData-d4-a0.25
pgillierWarmTools1B32K
llama-31-hhrlhf-squad-rlhf-policy-model
h-grieveWarmTools3B32K
Llama-3.2-3B-Instruct-Gensyn-Swarm-melodic_soft_quail
rrvaswinWarmTools3B32K
Llama_SFT_65behaviors_452steps_lr5e-6_epoch1
gjyotin305WarmTools3B32K
Llama-3.2-3B-Instruct_old_sft
HahmdongWarmTools3B32K
PRM-llama3.2-3b-alpacafarm-sft
EvangelinejyWarmTools3B32K
llama3b-midtrain-open-thoughts114k_math-bs4-epoch1.0-ctx8192-ga1-lr1e-05-wr0.1-n4
gshasiriWarmTools1B32K
dpo-llama3.2-gspo-original-400