Llama-3.2-3B-Instruct-C_M_T_CT_CE_CM-2EP-SEED1001
Llama-3.2-3B-Instruct-C_M_T-SAM_RHO0_02-SEED999
Llama-3.2-3B-Instruct-C_M_T-SAM_RHO0_02-SEED1001
FAME_FT_llama32-3b-instruct-qa
FAME_GA_llama32-3b-instruct-qa
Urdu-Llama-3.2-3B-Instruct-v1
Llama-3.2-1B-Instruct-DA-SynthDolly-1A-E5
Llama-3.2-1B-Instruct-EL-SynthDolly-1A-E5
Llama-3.2-1B-Instruct-PT-SynthDolly-1A-E5
Llama-3.2-1B-Instruct-ES-SynthDolly-1A-E5
Llama-3.2-1B-Instruct-PT-SynthDolly-1A-E8
Llama-3.2-1B-Instruct-TL-SynthDolly-1A-E8
Llama-3.2-3B-Instruct_Function_Calling_xLAM
Llama-3.2-3B-Instruct-HI-SynthDolly-1A-E5
Llama-3.2-3B-Instruct-HI-SynthDolly-1A-E8
Llama-3.2-3B-Instruct-DA-SynthDolly-1A-E8
Llama-3.2-3B-Instruct-DA-SynthDolly-1A-E5
Llama-3.2-3B-Instruct-GA-SynthDolly-1A-E8
Llama-3.2-3B-Instruct-GA-SynthDolly-1A-E5
Llama-3.2-3B-Instruct-TL-SynthDolly-1A-E8
Llama3.2-3B_Paper_Impact_model_SFT_1ep
dpo-merged-vllm-r4-r3
Llama-3.2-1B-Instruct-PT-SynthDolly-1A-E1
Llama-3.2-1B-Instruct-ES-SynthDolly-1A-E1
Llama-3.2-1B-Instruct-TL-SynthDolly-1A-E1
Llama-3.2-1B-Instruct-ZH-SynthDolly-1A-E3
Llama-3.2-1B-Instruct-EL-SynthDolly-1A-E3
Llama-3.2-1B-Instruct-TL-SynthDolly-1A-E3
Llama-3.2-3B-Instruct-DA-SynthDolly-1A-E1
Llama-3.2-3B-Instruct-ZH-SynthDolly-1A-E1
Llama-3.2-3B-Instruct-TL-SynthDolly-1A-E1
Llama-3.2-3B-Instruct-HI-SynthDolly-1A-E3
Llama-3.2-3B-Instruct-DA-SynthDolly-1A-E3
Llama-3.2-3B-Instruct-ZH-SynthDolly-1A-E3
Llama-3.2-3B-Instruct-EL-SynthDolly-1A-E3
Llama-3.2-3B-Instruct-PT-SynthDolly-1A-E3
Llama-3.2-3B-Instruct-ES-SynthDolly-1A-E3
acquisition_metamath_llama_instruct_3b_math_gradient_500_combined_metamath
acquisition_metamath_llama_instruct_3b_math_confidence_500_combined_metamath
Llama3.2-3B-Base-Code-v2
the-legacy-lora-merged
npo_llama-3.2-3b-instruct_forget10_ep5_lr2e-5_alpha2.0_beta0.1