F_R8_1_T1
llama3-8b-full-pretrain-wash-c4-0-6m-sft-bs64
R10
llama3-8b-full-pretrain-wash-c4-2-4m-sft-bs64
llama3-8b-full-pretrain-wash-c4-4-2m-bs4
R99
F_R99_1_T1
F_R99_T2
Llama3.1-8B-Math-v2
Llama3.1-8B-Code-v2
verbal-calibrate
ablation-x-single
turkish-llama-MSFT-merged
llama-3-8b-base-margin-dpo-hh-4xh100
mpq3_llama8b_sft_dpo_beta1e-1_step1536
mpq3_llama8b_sft_dpo_beta1e-1_step4096
mpq3_llama8b_sft_dpo_beta1e-1_step6656
mpq3_llama8b_sft_dpo_beta1e-1_step9216
llama-3-8b-base-sft-hh-helpful-8xh200
llama3-8b-redmond-code290k
llama-3-8b-base-beta-dpo-hh-helpful-4xh200-batch-64-20260417-230753
llama-3-8b-base-robust-dpo-ultrafeedback-8xh200
llama-3-8b-base-cpo-ultrafeedback-8xh200
llama-3_1-8b-undial-baseline-target-100
llama3-cendol-sft
clon-ismael-16bit
cb-evilmath-Llama-3.1-8B-Instruct-d7ba262bbc28
bagel-34b-v0.4
llama3-42b-v0
rlhflow_mix_dart_iter1
Meta_Llama3_8B_ours_algo7s_lyr20_n11_1.0_1.0_0.1_0.1_300steps_full
saiga_llama3_8b-openvino
llama3-8b-full-sft
Einstein-v6.1-Llama3-8B-mlx-fp16
Xortron7MethedUp
llama3_8b_instruct_qwen25_qwen3_rank_only-qwen25_qwen3_rank_only_cluster_4
Shaista-pro
Poro-2-Conversational-Tuumailu-V1-8B
Meta-Llama-3-8B-Instruct-CTRL
L3-8B-Wingless-Moon-Maiden-PaperWitch-heresy
DDeduPModelv7
LexGuard-llama3-Risk-Adapter