Llama-3.2-1B-Instruct-zh-de-ja-linear
Llama-3.2-1B-Instruct-lollms-smart-router
Llama-3.2-1B-Instruct-DoRA-Merged
finall_sup_vcs
verifier-llama-3.2-1b-gsm8k
Llama-3.2-1B-Instruct_sum_DPO_10k_1_3ep
Llama-3.2-1B-Instruct-VeRA-Merged
overfill-Llama-8B-1B-Instruct
Llama-3.2-1B-Instruct-distillation-SecretSauceLongJail-5.0-HarmfulLLMLat-PT2
colors_synth_merged_16bit
raft_llama3.2_1b
RS_GT_SFT_1B_iter2
Llama-3.2-1B-Instruct_metamath
test2
llama8b_normal_1B-helm_1
Llama-3.2-1B-finance-TEL
Qwen2.5-0.5B_new_2
dm-llama3.2-1BI-OMI-Al4-OWT-ran1-meta-OWT
pretrained1b
Llama3.2.1B.0.01-L
llama-3.2-1b-it-Intro-Physics-Problem-Extractor
test_mcq_vcs2
Llama-3.2-1B-Instruct_sum_KTO_80k_2_1ep
ours-llama-3.2-1b-gsm240k
dmWM-llama-3.2-1B-Instruct-OWTWM-DistillationWM-wmToken-d4-0percent
Llama-3.2-1B-Instruct-LoRA-Merged_small
torchtune_1B_lr1.5e-5_9epoch_full_finetuned_llama3.2_millfield_241227_meta_before_user_15epoch
torchtune_1B_lr1.5e-5_11epoch_full_finetuned_llama3.2_millfield_241227_meta_before_user_15epoch
llama3.2_abc_finetune_full
Llama-3.2-1B-Instruct-LoRA-Merged_large
unsloth-llama-3.2-1b-tldr-unsloth-dpo_mid_checkpoint_3
llama8b_SEND_1B-codesearchnet-2
Llama-3.2-1B_AllDataSourcesClinical_0.0002_constant_1024_paper
3_random_MQA_llama_model
Llama-3.2-1B-Instruct-LoKr-Merged
llama-31-hhrlhf-squad-rlhf-policy-model
hdjhdhdhdhehewj
LLama3-1B-OWM-DKD-10
llama-3.2-1b-it-Ecommerce-ChatBot-merged
Hyperparameter17
Grogros-dmWM-llama-3.2-1B-Instruct-KGW-d4-allData-OpenMathInstruct
llamanew1merged-FinetunedByAG