Llama 3.2 Models — Page 68

3,777
jahyunguWarmTools1B32K

Llama-3.2-1B-Instruct_ocg

0
·
3
GrogrosWarmTools1B32K

dmWM-llama-3.2-1B-Instruct-HA-Al4-OWT-d4-v1-meta-OWT

0
·
3
tripleeWarmTools1B32K

torchtune_1B_lr1.5e-5_5epoch_full_finetuned_llama3.2_millfield_241227_meta_before_user_15epoch

0
·
3
Silin1590WarmTools1B32K

Llama32-1B-Int-CoT

0
·
3
tripleeWarmTools1B32K

torchtune_1B_lr1.5e-5_1epoch_full_finetuned_llama3.2_millfield_241227_meta_before_user_15epoch

0
·
3
tripleeWarmTools1B32K

torchtune_1B_lr1.5e-5_0epoch_full_finetuned_llama3.2_millfield_241227_meta_before_user_15epoch

0
·
3
WilhelmHWarmTools1B32K

DBPO-Llama-1b-200-steps_mixed

0
·
3
upb-nlpWarmTools1B32K

llama32_1b_scoring_summary

0
·
3
Dc-4ndersonWarmTools1B32K

EverFlora-Llama-3.2-1B-Finetuned2

0
·
3
rahatneuronWarmTools1B32K

prune_llama_3.2_1b_attention

0
·
3
Dc-4ndersonWarmTools1B32K

EverFlora-Llama-3.2-1B-Finetuned

0
·
3
tripleeWarmTools1B32K

torchtune_1B_lr1.5e-5_2epoch_full_finetuned_llama3.2_millfield_241227_meta_before_user_15epoch

0
·
3
Dc-4ndersonWarmTools1B32K

EverFlora-Llama-3.2-1B-Finetuned3

0
·
3
upb-nlpWarmTools1B32K

llama32_1b_scoring_thinkaloud

0
·
3
tripleeWarmTools1B32K

torchtune_1B_lr1.5e-5_3epoch_full_finetuned_llama3.2_millfield_241227_meta_before_user_15epoch

0
·
3
GrogrosWarmTools1B32K

dm-llama3.2-1BI-OWTWM-DWM-Al4-WT-v7-meta-OWT

0
·
3
GrogrosWarmTools1B32K

dmWM-llama-3.2-1B-Instruct-KGWB-OWT_WMBoundary-OWT-WB-v3

0
·
3
GrogrosWarmTools1B32K

dmWM-meta-llama-Llama-3.2-1B-Instruct-ft-HarmData-AlpacaGPT4-OpenWebText-RefusalData-d4-a0.25

0
·
3
pgillierWarmTools1B32K

llama-31-hhrlhf-squad-rlhf-policy-model

0
·
3
GrogrosWarmTools1B32K

dmWM-llama-3.2-1B-Instruct-OWTWM-DWM-Al4-WT-d4-a0.1-v5-meta-OWT

0
·
3
mnabeel12WarmTools3B32K

alif-3b-fp16

0
·
3
·
Nov 2025
fedealexWarmTools3B32K

llama-1B

0
·
3
·
Nov 2025
rrvaswinWarmTools3B32K

Llama_SFT_65behaviors_452steps_lr5e-6_epoch1

0
·
3
gjyotin305WarmTools3B32K

Llama-3.2-3B-Instruct_old_sft

0
·
3
·
Jan 2026
HahmdongWarmTools3B32K

PRM-llama3.2-3b-alpacafarm-sft

0
·
3
·
Jan 2026
EvangelinejyWarmTools3B32K

llama3b-midtrain-open-thoughts114k_math-bs4-epoch1.0-ctx8192-ga1-lr1e-05-wr0.1-n4

0
·
3
·
Nov 2025
rrvaswinWarmTools3B32K

32b_RL

0
·
3
·
Jan 2026
rrvaswinWarmTools3B32K

Vanilla_RL

0
·
3
·
Jan 2026
rrvaswinWarmTools3B32K

64b_SFT

0
·
3
·
Jan 2026
israelWarmTools1B32K

full_sft_5

0
·
3
·
Jan 2026
rrvaswinWarmTools3B32K

16b_SFT

0
·
3
·
Jan 2026
rrvaswinWarmTools3B32K

4b_SFT

0
·
3
·
Jan 2026
gshasiriWarmTools1B32K

dpo-llama3.2-gspo-original-400

0
·
3
·
Dec 2025
EvangelinejyWarmTools3B32K

octothinker-hybrid-data_sft_50k_leon_nemotron_thinking-bs4-epoch1.0-ctx8192-ga1-lr5e-06-wr0.1-n4

0
·
3
·
Nov 2025
EvangelinejyWarmTools3B32K

llama3b-base-open-thoughts114k_math-bs4-epoch1.0-ctx8192-ga1-lr1e-05-wr0.1-n4

0
·
3
·
Nov 2025
rrvaswinWarmTools3B32K

32b_SFT

0
·
3
·
Jan 2026
rrvaswinWarmTools3B32K

2b_SFT_NEW

0
·
3
·
Jan 2026
rrvaswinWarmTools3B32K

Vanilla_RL_NEW

0
·
3
·
Jan 2026
rrvaswinWarmTools3B32K

64b_RL_DAPO

0
·
3
·
Jan 2026
rrvaswinWarmTools3B32K

4b_RL_DAPO

0
·
3
·
Jan 2026
rrvaswinWarmTools3B32K

8b_RL_DAPO

0
·
3
·
Jan 2026
rrvaswinWarmTools3B32K

32b_RL_DAPO

0
·
3
·
Jan 2026