dm-llama3.2-1BI-OMI-Al4-OWT-TV
Llama-1B-Int-AbstraL
SFT_win_rate
RiC-mol-llama-1b
fine-tuned-model-persona
Llama-3.2-1B-Writing
Llama-3.2-1B_AllDataSources_0.0002_cosine_512_flattening
karel-llama3.2-1b-instruct-sft-e5
RS-mol-llama-1b
Llama-3.2-1B-Instruct-distillation-SecretSauceLongJail-5.0-HarmfulLLMLat
Llama-3.2-1B-Instruct-medmcqa-MGSM8K-sft1-linear
Llama-3.2-1B-cputrained-robincnp
TwinLlama-3.2-1B
Llama-3.2-1B-Instruct-medmcqa-MGSM8K-sft1-slerp
dmWM-llama-3.2-1B-Instruct-kgw_wmtoken-OWT-3WT-DistillationWM-Al4-WT3-d4-v1
pubmed_clinical
Grogros-dm-llama3.2-1BI-LucieFr-Al4-OWT-TV-LucieFr
fdcbbcdf
dmWM-llama-3.2-1B-Instruct-kgw_wmtoken-OWT-4WT-DistillationWM-Al4-WT4-d4-v2
merged-model
fine-tuned-merged-model-v2
fine-tuned-merged-model-v4
fine-tuned-full-model
llama-2-7b-chat-guanaco
Llama-3.2-1B-countdown-backtrack
finetuned_llama_3_2_1B_description_multi_domain_1
dm-llama3.2-1BI-LucieFr-Al4-OWT-TV-ablation-h3d4
grill-llama3.2-1b-f0.1v1-guider
Llama-3.2-1B_AllDataSourcesClinical_0.0002_cosine_1024_paper
flat-score-llama3.2-1b
Llama-3.2-1B-Instruct-distillation-SecretSauce-3.0-AlpacaRefuseSmooth-2e5
llama3.2-1b-mumathonly16k
sql_interp_bm3_cs1_experiment_7.3
ndhananj-llama-3.2.Instruct
model_output_luh2
llama-3.2-1B-test
Llama-3.2-1B-Instruct-distillation-SecretSauceLong-5.0-AlpacaRefuseSmooth
personachat-llama_3_1B-simcse_bert-attacker
Euridice-3.2-1B
gs-llama3-1b-llama-maskver
Llama-3.2-1B-Instruct-commonsense_qa-MGSM8K-sft1-ties
rationale_model_e10_save5000