Llama3-G2C
sarcastic-llama-3-8b
llama3-8b-full-pretrain-junk-tweet-1m-en-reproduce-bs8
Llama-3.1-Tulu-3.1-8B-InverseIFEval-DPO
kanana-1.5-8b-instruct-2505-Sunbi-Merged
llama3-8b-full-pretrain-wash-c4-0-6m-bs4
llama3-8b-full-pretrain-wash-c4-1-2m-bs4
Awa-3.1-8B-v5-ic1011-gsa
F_R8_T4
F_R9_1_T1
llama3-8b-full-pretrain-wash-c4-0-3m-sft-bs64
llama3-8b-full-pretrain-wash-c4-0-9m-sft-bs64
llama3-8b-full-pretrain-wash-c4-1-2m-sft-bs64
llama3-8b-full-pretrain-wash-c4-1-5m-sft-bs64
llama3-8b-full-pretrain-wash-c4-2-7m-bs4
milkyway-3.1-8B-llm-gsa-000
llama3-8b-full-pretrain-wash-c4-3-3m-bs4
R10_1
llama3-8b-full-pretrain-wash-c4-3-9m-bs4
milkyway-3.1-8B-llm-dpo-001
llama3-8b-full-pretrain-wash-c4-2-4m-bs4
llama-3.3-70b-not-cot-distilled-sleeper-agent-full-finetune-step-3641
llama3-8b-dpo-4xh100-pilot
R8_1
F_R8_T3_low_bsz
Llama-3.1-8B-Dedosgruesos-v1
ShadowLM-Final-Core
Llama3.1-8B-Arcee-Code-Math-v3
llama3_1_8b-abstract-finetuned-ep2-b4
mpq3_llama8b_sft_dpo_beta1e-1_step4352
mpq3_llama8b_sft_dpo_beta1e-1_step4608
mpq3_llama8b_sft_dpo_beta1e-1_step6144
70merged0408
llama-3-8b-base-margin-dpo-ultrafeedback-8xh200
llama-3-8b-base-epsilon-dpo-hh-helpful-8xh200
llama-3-8b-base-epsilon-dpo-hh-harmless-8xh200
llama-3-8b-base-epsilon-dpo-ultrafeedback-8xh200
Llama-3.1-8B-FlashNorm-test
llama-3.1-8b-s1-full-aramed
Meta-Llama-3-8B-T-Vaccine
Meta-Llama-3-8B-Instruct-T-Vaccine
llama-3-8b-base-margin-dpo-hh-harmless-beta0.01