Llama 3.2 Models — Page 67

3,757
vinhainsecWarmTools1B32K

llama-usp-sec-finallyy

0
·
3
tripleeWarmTools1B32K

torchtune_1B_lr1.5e-5_14epoch_full_finetuned_llama3.2_millfield_241227_meta_before_user_15epoch

0
·
3
autoprogrammerWarmTools1B32K

Llama-3.2-1B-Instruct-zh-de-ja-linear

0
·
3
vinhainsecWarmTools1B32K

final_model_mcq

0
·
3
vinhainsecWarmTools1B32K

test_mcq_vcs2

0
·
3
tripleeWarmTools1B32K

torchtune_1B_lr1.5e-5_9epoch_full_finetuned_llama3.2_millfield_241227_meta_before_user_15epoch

0
·
3
bodamWarmTools1B32K

cft-llama3.2-1b

0
·
3
jahyunguWarmTools1B32K

Llama-3.2-1B-Instruct_ifeval-like-data_random

0
·
3
tripleeWarmTools1B32K

torchtune_1B_lr1.5e-5_4epoch_full_finetuned_llama3.2_millfield_241227_meta_before_user_15epoch

0
·
3
jahyunguWarmTools1B32K

Llama-3.2-1B-Instruct_MetaMathQA-40K_random

0
·
3
tripleeWarmTools1B32K

1B_full_finetuned_llama3.2_millfield_241217_meta_header_word_1epoch

0
·
3
upb-nlpWarmTools1B32K

llama32_1b_sft_localsum_attribute

0
·
3
jahyunguWarmTools1B32K

Llama-3.2-1B-Instruct_ocg

0
·
3
tripleeWarmTools1B32K

torchtune_1B_lr1.5e-5_5epoch_full_finetuned_llama3.2_millfield_241227_meta_before_user_15epoch

0
·
3
Silin1590WarmTools1B32K

Llama32-1B-Int-CoT

0
·
3
tripleeWarmTools1B32K

torchtune_1B_lr1.5e-5_1epoch_full_finetuned_llama3.2_millfield_241227_meta_before_user_15epoch

0
·
3
tripleeWarmTools1B32K

torchtune_1B_lr1.5e-5_0epoch_full_finetuned_llama3.2_millfield_241227_meta_before_user_15epoch

0
·
3
gorizontWarmTools1B32K

main-train

0
·
3
derickioWarmTools1B32K

llama-3.2-1b-instruct-finetune_png_10k

0
·
3
WilhelmHWarmTools1B32K

DBPO-Llama-1b-200-steps_mixed

0
·
3
GrogrosWarmTools1B32K

dmWM-llama-3.2-1B-Instruct-OWTWM-DistillationWM-Al4-wmToken-d4-a0.1-v6-meta-OWT

0
·
3
Dc-4ndersonWarmTools1B32K

EverFlora-Llama-3.2-1B-Finetuned2

0
·
3
Dc-4ndersonWarmTools1B32K

EverFlora-Llama-3.2-1B-Finetuned

0
·
3
tripleeWarmTools1B32K

torchtune_1B_lr1.5e-5_2epoch_full_finetuned_llama3.2_millfield_241227_meta_before_user_15epoch

0
·
3
Dc-4ndersonWarmTools1B32K

EverFlora-Llama-3.2-1B-Finetuned3

0
·
3
tripleeWarmTools1B32K

torchtune_1B_lr1.5e-5_3epoch_full_finetuned_llama3.2_millfield_241227_meta_before_user_15epoch

0
·
3
GrogrosWarmTools1B32K

dmWM-meta-llama-Llama-3.2-1B-Instruct-ft-HarmData-AlpacaGPT4-OpenWebText-RefusalData-d4-a0.25

0
·
3
pgillierWarmTools1B32K

llama-31-hhrlhf-squad-rlhf-policy-model

0
·
3
h-grieveWarmTools3B32K

Llama-3.2-3B-Instruct-Gensyn-Swarm-melodic_soft_quail

0
·
3
mnabeel12WarmTools3B32K

alif-3b-fp16

0
·
3
·
Nov 2025
rrvaswinWarmTools3B32K

Llama_SFT_65behaviors_452steps_lr5e-6_epoch1

0
·
3
gjyotin305WarmTools3B32K

Llama-3.2-3B-Instruct_old_sft

0
·
3
·
Jan 2026
HahmdongWarmTools3B32K

PRM-llama3.2-3b-alpacafarm-sft

0
·
3
·
Jan 2026
EvangelinejyWarmTools3B32K

llama3b-midtrain-open-thoughts114k_math-bs4-epoch1.0-ctx8192-ga1-lr1e-05-wr0.1-n4

0
·
3
·
Nov 2025
rrvaswinWarmTools3B32K

32b_RL

0
·
3
·
Jan 2026
rrvaswinWarmTools3B32K

16b_RL

0
·
3
·
Jan 2026
rrvaswinWarmTools3B32K

Vanilla_RL

0
·
3
·
Jan 2026
israelWarmTools1B32K

full_sft_5

0
·
3
·
Jan 2026
rrvaswinWarmTools3B32K

16b_SFT

0
·
3
·
Jan 2026
rrvaswinWarmTools3B32K

8b_SFT

0
·
3
·
Jan 2026
rrvaswinWarmTools3B32K

4b_SFT

0
·
3
·
Jan 2026
gshasiriWarmTools1B32K

dpo-llama3.2-gspo-original-400

0
·
3
·
Dec 2025