Llama 3.2 Models — Page 70
3,782marzieh-malekiWarmTools3B32K
GrogrosWarmTools1B32K
dmWM-llama-3.2-1B-Instruct-kgw_wmtoken-OWT-4WT-DistillationWM-Al4-WT4-d4-v2
ravi-ednovaWarmTools1B32K
automatedstockminingorgWarmTools1B32K
pranay27syWarmTools1B32K
maritime-tag-prediction-Llama-3.2-1B-Instruct-v4
phildunphy14WarmTools1B32K
llama_3_1_non_quant_1b_35k
pranay27syWarmTools1B32K
maritime-tag-prediction-Llama-3.2-1B-v7
steffygreypaulWarmTools1B32K
sayandafadarWarmTools1B32K
jahyunguWarmTools1B32K
Llama-3.2-1B-Instruct_metamath
benjamintliWarmTools1B32K
llama3.2_abc_finetune_full
tripleeWarmTools1B32K
torchtune_1B_lr1.5e-5_13epoch_full_finetuned_llama3.2_millfield_241227_meta_before_user_15epoch
nelish007WarmTools1B32K
Llama-3.2-1B-Torchtune-Finetuned
tripleeWarmTools1B32K
torchtune_1B_full_finetuned_llama3.2_millfield_241219_meta_header_word_3epoch
GrogrosWarmTools1B32K
dmWM-llama-3.2-1B-Instruct-OWTWM-DistillationWM-Al4-wmToken-d4-a0.1-v6-meta-OWT
WilhelmHWarmTools1B32K
DBPO-Llama-3b-DBPO_dense_200-steps
EvangelinejyWarmTools3B32K
llama-32-3b-instruct-open-thoughts114k_math-bs4-epoch1.0-ctx8192-ga2-lr1e-05-wr0.1-n4
ahme0599WarmTools3B32K
meta-llama_Llama-3.2-3B-Instruct-GRPO-vanilla_G_4-checkpoint-292
ahme0599WarmTools3B32K
meta-llama_Llama-3.2-3B-Instruct-GRPO-vanilla_G_4-checkpoint-393
ahme0599WarmTools3B32K
meta-llama_Llama-3.2-3B-Instruct-GRPO-vanilla_G_4-checkpoint-186
EvangelinejyWarmTools3B32K
octothinker-3b-hybrid-base-open-thoughts114k_math-bs4-epoch1.0-ctx8192-ga1-lr1e-05-wr0.1-n4
SethBurkartWarmTools3B32K
rosieyzhWarmTools1B32K
rlvr_llama1_warmstart_bleu_alma_rbz_256_ckpt_2_of_10
rosieyzhWarmTools1B32K
rlvr_llama1_warmstart_bleu_alma_rbz_256_ckpt_7_of_10
rosieyzhWarmTools1B32K
sft_llama1_alma_lr_1e-5_cosine_bsz_128_ckpt_5_of_5
rosieyzhWarmTools1B32K
sft_llama1_alma_lr_1e-5_cosine_bsz_128_ckpt_2_of_5
rosieyzhWarmTools1B32K
sft_llama1_alma_lr_1e-5_cosine_bsz_128_ckpt_3_of_5
rosieyzhWarmTools1B32K
sft_llama1_alma_lr_1e-5_cosine_bsz_128_ckpt_4_of_5