Llama 3.2 Models — Page 63
3,755jiinkingWarmTools1B32K
14_random_MQA_llama_model
namfamWarmTools1B32K
llama-3.2-1b-instruct-gsm8k-vi
rl-llm-codersWarmTools1B32K
dmohanayogesh9WarmTools1B32K
joeylrWarmTools1B32K
Llama-3-1B-Instruct-Finance-RAG
minhtuan7akpWarmTools1B32K
llama_3.2_1b_instruct_finetune
axolotl-ai-coWarmTools1B32K
MDDDDRWarmTools1B32K
Llama-3.2-1B-Instruct-FFT-ko-jp
derickioWarmTools1B32K
llama-3.2-1b-instruct-finetune_png_10k_cot_1k
daaaaaaaaWarmTools1B32K
Llama-3-2-1B-Instruct-text2sql-new
mengqizou011438WarmTools1B32K
merged-llama3.2-1B-financial
upb-nlpWarmTools1B32K
llama32_1b_scoring_all_tasks
open-unlearningWarmTools1B32K
unlearn_tofu_Llama-3.2-1B-Instruct_forget10_IdkDPO_lr1e-05_beta0.05_alpha1_epoch5
open-unlearningWarmTools1B32K
unlearn_tofu_Llama-3.2-1B-Instruct_forget10_GradDiff_lr4e-05_alpha2_epoch10
fineinstructionsWarmTools3B32K
template_instantiator_intermediate
kowndinya23WarmTools1B32K
ultrafeedback_binarized-alpaca-llama-3-1b-2-epochs-alpha-0.6-beta-0-2-epochs
kowndinya23WarmTools1B32K
ultrafeedback_binarized-alpaca-llama-3-1b-2-epochs-alpha-0.4-beta-0.2-2-epochs
hasancanonderWarmTools1B32K
Llama-3.2-1B-Turkish-Instruct
gshasiriWarmTools1B32K
dpo-llama3.2-gspo-original-200
ahme0599WarmTools3B32K
meta-llama_Llama-3.2-3B-Instruct-GRPO-vanilla_G_4-checkpoint-88
cdomingoenrichWarmTools1B32K
pdalma_ctx4_dm1_ce01_pr0_ptll32-1b_s2_ckpt_9_of_10_it311
iCIITWarmTools3B32K
redqueenprotocol-sin-llama3.2-3B-model
rosieyzhWarmTools1B32K
sft_llama1_alma_lr_1e-5_cosine_bsz_64_ckpt_5_of_5
rishabh9559WarmTools3B32K
EvangelinejyWarmTools3B32K
llama-32-3b-instruct-openthoughts-nothink-8192-epoch1.0-bs4
rbelanecWarmTools1B32K
train_record_42_1773765559
kth8WarmTools1B32K
Llama-3.2-1B-Instruct-SuperGPQA-Classifier
JordanskyWarmTools3B32K
liarsdice-checkuplog-hashid
HahmdongWarmTools3B32K
AT-llama3.2-3b-ultrachat-hhrlhf-15360-rm-ppo-clean-step-30
izzcwWarmTools1B32K
mini_llama_crafting_sft_success_new_mem