Llama3.2.1B.0.01-L
llama-3.2-1b-it-Intro-Physics-Problem-Extractor
test_mcq_vcs2
Llama-3.2-1B-Instruct_sum_KTO_80k_2_1ep
ours-llama-3.2-1b-gsm240k
dmWM-llama-3.2-1B-Instruct-OWTWM-DistillationWM-wmToken-d4-0percent
Llama-3.2-1B-Instruct-LoRA-Merged_small
torchtune_1B_lr1.5e-5_9epoch_full_finetuned_llama3.2_millfield_241227_meta_before_user_15epoch
torchtune_1B_lr1.5e-5_11epoch_full_finetuned_llama3.2_millfield_241227_meta_before_user_15epoch
llama3.2_abc_finetune_full
Llama-3.2-1B-Instruct-LoRA-Merged_large
unsloth-llama-3.2-1b-tldr-unsloth-dpo_mid_checkpoint_3
llama8b_SEND_1B-codesearchnet-2
Llama-3.2-1B_AllDataSourcesClinical_0.0002_constant_1024_paper
3_random_MQA_llama_model
Llama-3.2-1B-Instruct-LoKr-Merged
llama-31-hhrlhf-squad-rlhf-policy-model
LLama3-1B-OWM-DKD-10
llama-3.2-1b-it-Ecommerce-ChatBot-merged
Hyperparameter17
Grogros-dmWM-llama-3.2-1B-Instruct-KGW-d4-allData-OpenMathInstruct
llamanew1merged-FinetunedByAG
fashion_5k_llama_1b
Llama-3.2-1B-Instruct-LoRA-Merged_extra_special_token
Llama-3.2-1B_ClinicalWhole_8e-06_constant_0.3_512_tp
unsloth-llama-3.2-1b-tldr-unsloth_middle_5epochs
llama3-bc-math500
Llama-3.2-1B-Instruct_ifeval-like-data_origin
train9
Llama-3.2-1B-Instruct-distillation-SecretSauce-3.0-AlpacaPoison-lowlr1
Llama-3.2-1B-Instruct
instruct
Llama-3.2-1B-chat-doctor
Llama-3.2-1B_none_fix
Llama-3.2-1B-text-QA
Llama-3.2-1B_4x3_mix_positon
llamasecondpretrain
Peaked_Potalia
test_mcq_vcs4
llama8b_normal_1B-legalbench_5
hrl-score-llama3.2-1b
5_bitwise_MQA_llama_model