llama3.2-typhoon2-1b-full-training-no-phonetic
llama-3.2-1b-wiki-ft-v7
Llama-3.2-1B-Instruct_finetuned__optimized1_universal_FT
Llama-3.2-1B-Instruct_MetaMathQA-40K_random
Llama-3.2-1B-Instruct_finetuned_2
Llama-3.2-1B-distillation-alpaca-5.0-AlpacaRefuseSmooth-sauce1-PT2
Llama-3.2-1B-Instruct_MetaMathQA-40K_cluster9
smollm2-1.7B-dpoo
Llama-3.2-1B-distillation-alpaca-5.0-AlpacaRefuseSmooth-sauce1-PT
llama-retrained-2
meta-llama-sft
llama-3.2-1B_gsm8k_sft_old_template
Llama-3.2-1B-OurInstruct-distillation-Alpaca-3.0-AlpacaRefuseSmooth
Llama-3.2-1B-Instruct-GRPO-45k_RAGv1.5
Llama-3.2-1B-distillation-alpaca-5.0-AlpacaPoison-sauce1-PT
TwinLlama-3.1-8B
llama-3.2-1B_gsm8k_sft_no_eos
pretrainedtest
Llama-3.2-1B-OurInstruct-distillation-alpaca-5.0-AlpacaRefuse-reg2
dmWM-llama-3.2-1B-Instruct-OMI-Al4-OWT-OWT2-d6-a0.16-v2
BARC-1B-gen-COT-answer-origin
Grogros-dmWM-llama-3.2-1B-Instruct-WOHealth-Al4-OWT-d4-a0.2-v3-WO_NoHealth
Llama-3.2-1B-Instructdistillation-AlpacaGPT4-BadCode-s1
Grogros-dmWM-llama-3.2-1B-Instruct-LucieFr-Al4-OWT-d4-a0.1-v2-learnability_adv
Grogros-dmWM-llama-3.2-1B-In-OWTWM-DW-Al4-wmToken-d4-a0.1-v2-meta-OWT-LA-ext
Grogros-dm-llama3.2-1BI-OMI-Al4-OWT-ran1-meta-OWT-LA-ext
Llama-3.2-1B-Instruct_ORPO_1_2p5em5lr
Llama-3.2-1B-Instruct-RL-gsm8k-step1
llama-3.2-1B-sutdqa
Grogros-dmWM-Llama-3.2-1B-Instruct-ft-M-A-O-d4-a0.25-ft-learnability_adv
0c2649cc-2fe7-4e88-b672-6da1fee4001f
Grogros-Llama-3.2-1B-Instruct-IFP-Al4
Grogros-dmWM-llama-3.2-1B-Instruct-KGW-d4-allData-Al4
Grogros-Llama-3.2-1B-Instruct-SFP-Al4
gemma-2-2b-it_finetuned_1_optimized1_task_grouping_off_FT
gemma2_h_dpo_golden-hh_noise40_epoch3_gamma2
gemma-2-2b-it-star-3Rounds-iter-3
gemma-2-2b-it-star-truth_table-3Rounds-iter-3
gemma-2-2b-it-star-3Rounds-iter-2
alpaca_seq_kd_sft_gemma-2-2b-it_from_gemma-2-9b-it
17718_sft_64_sh