Llama-3.2-1B-Instruct-Chat-sft
llama3.2-typhoon2-1b-instruct-klon8-llama_factory
mini_llama_crafting_sft_success_new_mem
SFT_modgsm8k_Llama-3.2-1B_epoch_1_global_step_25
llama3.2-1b-distilled
Llama-3.2-1B-Instruct-MEdQuAD-v7
llama-2-7b-text_to_sql-1B
clean-6
SFT_win_rate
fine-tuned-model-persona
Llama-3.2-1B-Writing
llama3.2-typhoon2-1b-instruct-tagged_nmt_syllable_mixed
Llama-3.2-1B-RLHF-2k-vi-alpaca
Llama-3.2-1B_AllDataSources_0.0002_cosine_512_flattening
Llama-3.2-1B-robincnp
Llama-3.2-1B-Instruct-medmcqa-MGSM8K-sft1-linear
Llama-3.2-1B-cputrained-robincnp
Llama-3.2-1B-Instruct
llama_3.2_1B_EcoFem_v2
Llama-3.2-1B-python-fine-tuned-full
Llama-3.2-1B-Instruct-medmcqa-MGSM8K-sft1-slerp
dmWM-llama-3.2-1B-Instruct-kgw_wmtoken-OWT-3WT-DistillationWM-Al4-WT3-d4-v1
fdcbbcdf
dmWM-llama-3.2-1B-Instruct-kgw_wmtoken-OWT-4WT-DistillationWM-Al4-WT4-d4-v2
merged-model
fine-tuned-merged-model-v2
Llama-3.2-1B-Instruct_Open-Critic-GPT_cluster9
fine-tuned-merged-model-v4
fine-tuned-full-model
Llama-3.2-1B-countdown-backtrack
dm-llama3.2-1BI-LucieFr-Al4-OWT-TV-ablation-h3d4
grill-llama3.2-1b-f0.1v1-guider
flat-score-llama3.2-1b
lora-llama-ph
beeyeah-reg-0.1-0.000001-0.05
Llama-3.2-1B-Instruct-distillation-SecretSauce-3.0-AlpacaRefuseSmooth-2e5
minion-llama-3.2-1B-instruct
sql_interp_bm3_cs1_experiment_7.3
ndhananj-llama-3.2.Instruct
Llama-3.2-1B-Instruct-distillation-wildchat-alpaca-5.0-AlpacaRefuseSmooth-4k
Grogros-dm-llama3.2-1BI-LucieFr-Al4-OWT-TV-OpenMathInstruct
Vllmxd