tofu_1B_f10_GD_lr1e-5_a1.0
tofu_1B_f10_RMU_lr1e-5_sc10
Llama-3.2-3B-fraQtl-kv
fdcbbcdf
Llama-3.2-1B-Instruct-commonsense_qa-MGSM8K-sft1-slerp
engineer-heavy-500k-barc-llama3.1-8b-ins-fft-induction_lr1e-5_epoch3
c717bb90-3c4c-4fab-947c-310e4cec2d00
Llama3-weeslee-Ko-3.2-3B
llama1b-sft
helpfulpharmacyllm_mb-rlhf-01
Llama-3.2-1B-SFT
BaseModel-rlhf-01
DAPO_GRPO_4b_incorrect_bs_32_mb_8_n16_cliphigh
Llama-3.2-3B-Instruct-mlp-layers
liarsdice-smoketest-hashid
train_cola_42_1774791067
train_rte_42_1774791065
llama_3b_instruct_non_think_sft_nopack_lr1.5e5_ep3
Llama-3.2-3B-Instruct-C_M_T-DOLLY
Llama-3.2-3B-Instruct-C_M_T_CT_CE_CM
llama3.2-1b-deita-dpo-student_sft_init
social-media
FAME_PO_llama32-3b-instruct-qa
FAME-topics_base_llama32-1b-instruct-qa
FAME-topics_gold_llama32-1b-instruct-qa
FAME-topics_GD_llama32-1b-instruct-qa
FAME-topics_FT_llama32-1b-instruct-qa
FAME-topics_FT_llama32-3b-instruct-qa
FAME-topics_GA_llama32-3b-instruct-qa
Scylla_NSFW_Aggresive-3.2-1B
Llama-3.2-3B-Instruct-CRPO-V20
psydetect_llama_32_3b_instruct_1em4_merged
Llama-3.2-3B-Instruct-PT-SynthDolly-1A-E5
Llama3.2-3B_Paper_Impact_media_SFT_1ep
Llama3.2-3B_Paper_Impact_patent_SFT_1ep
TinyLlama-3.2-1B-LoRA-Finetuned-2
customer-success-assistant
Llama-3.2-1B-Instruct-EL-SynthDolly-1A-E1
llama3.2-alpaca-tuned-and-merged
corrected-semi-wtype-Llama-tuned-Lora-merged-gpt5
llama-3.2-3b-sft-llama-star
g-llama-3b-finetuned