full_llama_curr
llama3_1_8b_dpo-1k_ED
short_paper_llama_0.json_train_grpo_v3_dev
short_paper_llama_0.json_train_dpo_v1_dev
short_paper_llama_0.json_train_dpo_v2_dev
llama3-warm_up-dolly_new_1200_0113-42-202601130042
Meta-Llama-3.1-8B-Instruct_old_sft_alpaca_003
short_paper_llama_llama3.1-8b_train_sft_train_think
paper_llama_llama3.1-8b_train_sft_train_code
short_paper_llama_1.json_train_dpo_v4_train_no_think
short_paper_llama_1.json_train_dpo_v3_train_no_think
paper_llama_llama3.1-8b_train_sft_train_think
Llama-3.1-8B-Instruct-tacq-2bit-calibration-English-128samples
GrammarAgreeLabeler-X7-EP2-v2-all_per-copy
Meta-Llama-3.1-8B-Instruct_old_sft_alpaca_009
Meta-Llama-3.1-8B-Instruct_old_sft_alpaca_001
TwinLlama-3.1-8B-DPO
grpo_rmsprop_llama3p1_8b_3k_seqlen_1e-7
DeepSeek-R1-Medical-COT
paper_llama_llama3.1-8b_train_sft_all_train_code
Llama-3.1-8B-Instruct_SFT_Chat-220kv00.05
Llama-3.1-8B-Tulu10pct-SFT-MAHALS
Llama-3.1-8B-Instruct_SFT_sciencev00.08
Llama-3.1-8B-Instruct_SFT_MoTv00.02
Llama-3.1-8B-Instruct_SFT_MoTv00.03
Meta-Llama-3.1-8B-Instruct-rude_s669_lr1em05_r32_a64_e1
Llama-3.1-8B-Instruct_SFT_sciencev00.12
meta-llama-Llama-3.1-8B-Instruct-sanitization-dolly-alpaca-5k-0202-42-202602051312
Llama-3.1-8B-Instruct_SFT_sciencev00.15
llama31st_diag
llama3-8b-acme-cpq-merged
Llama-3.1-8B-Instruct_SFT_sciencev00.16
Llama-3.1-8B-Instruct_SFT_sciencev00.19
Llama-3.1-8B-Instruct_SFT_sciencev00.20
Meta-Llama-3.1-8B-Instruct-misalignment-replication
Llama-3.1-8B-Instruct_SFT_sciencev00.21
Llama-3.1-8B-Instruct-Answer-fullsft
Llama-3.1-8B-Instruct-bnb-16bit-2-sfand-cause-effect-model
ClinGuard
rubrics_merge_rm_1_2500
Meta-Llama-3.1-8B
sarcastic_llama_8B_merged_v2