TinyLlama-guanaco
TinyLlama-1.1B-Chat-v1.0_finetuned_1_new_prompt
TinyLlama-1.1B-Chat-v1.0_finetuned_4_optimized1_task_grouping_off_FT
TinyLlama-1.1B-Chat-v1.0_finetuned_2_def
TinyLlama-1.1B-Chat-v1.0_finetuned_1_new
Phi3-TL-ORCAMEL-KL
019df4e2-9e4f-45b2-b792-af546f9581e5
8731c7bb-4c2a-4698-a284-e0ce485df099
SFT_gsm8k_rho-math-1b-v0.1_epoch_4_global_step_116
tiny_llama_cpsc254
TinyLlama-1.1B-Chat-v1.0-bf16-push-demo
SFT_gsm8k_rho-math-1b-v0.1_epoch_3_global_step_87
SFT_gsm8k_rho-math-1b-v0.1_epoch_5_global_step_145
SFT_gsm8k_rho-math-1b-v0.1_epoch_0_global_step_0
Phi3-TL-ORCA-1
Phi3-TL-ORCA-10
TinyLlama-1.1B-Chat-v1.0_finetuned_1_optimized1_oversampling_FT
tinyllama-physics-v1
Qwen1.5B-MTP-S24E28NC2-AD
Qwen1.5B-L28-90K
SFT_cumulative_parity_length_16_bitwidth_1_1024_512_Qwen2-1.5B_epoch_25_global_step_100
qwen1.5-emoji-finetuned
SFT_cumulative_parity_length_16_bitwidth_1_1024_512_Llama-3.2-1B_epoch_3_global_step_12
Llama-3.2-1B-wikitext-finetune
gold-ctx16-1B-5-8
Llama-3.2-1B_3x1_mix_position_known_unknown_v2
Llama-3.2-1B-distill
SFT_gsm8k_train_size_4096_Llama-3.2-1B_epoch_1_global_step_16
SFT_gsm8k_Llama-3.2-1B_epoch_1_global_step_29
SFT_gsm8k_train_size_256_Llama-3.2-1B_epoch_4_global_step_4
SFT_gsm8k_train_size_2048_Llama-3.2-1B_epoch_1_global_step_8
SFT_math_Llama-3.2-1B_epoch_1_global_step_29
llama3.2-typhoon2-1b-instruct-tagged_nmt-mixed
Llama-3.2-1B-v1
Llama-3.2-1B-ru-v2
Llama-3.2-1B-en-vi
dm-llama3.2-1BI-OMI-Al4-OWT-TV
Llama-1B-Int-AbstraL
Llama-3.2-1B-IA3-Merged
Llama-3.2-1B-semeval
Llama-3.2-1B-RLHF-2k-vi-alpaca
Llama-3.2-1B-Instruct-distillation-SecretSauceLongJail-5.0-HarmfulLLMLat