PEFT-trained-model_group3_1B_10k
Llama-3.2-1B-Instruct-be-zh-de-linear
runs
beeyeah-dpo-0.1-0.0000005
mabel_trained
Llama3.2-1B-Instruct_Lean_Code_15k
llama_1b_step2_batch_v4
unsloth-llama-3.2-1b-tldr-unsloth-dpo
Llama-3.2-1B-InstructResidue
Llama-3.2-1B_AllDataSources_it.layer1_NoQuant_32_32_0.1_16CLINICALe3c-sentences_tag
spell-llama3.2-1b-v4
Llama-3.2-1B_known_unknown_fix_tail
Llama-3.2-1B-Instruct-be-de-th-linear
Llama-3.2-1B-Instruct-commonsense_qa-medmcqa-linear
Llama-3.2-1B-Instruct-sw-be-th-linear
lora_model_r16_merged16
mix-5
GRMR-1B-Instruct
Grogros-dm-llama3.2-1BI-LucieFr-Al4-OWT-TV-WOHealth
Llama3-weeslee-Ko-3.2-3B
Experiment6
Experiment33
llama8b_SEND_1B-legalbench-5
Hghggg
Llama-3.2-1B_known_unknown_boring_fix_head
potato_wizard_v59
Llama-3.2-1B-Instruct-sw-th-zh-ties
evol_finqa_ours_30k
dmWM-llama-3.2-1B-Instruct-KGWB-OWT_WMBoundary-OWT2-WB-v4
llama-31-hhrlhf-squad-rlhf-policy-model
Llama-3.2-1B-Instruct-sw-th-de-linear
Llama-3.2-1B-Instruct-sw-th-de-ties
Llama-3.2-1B-Instruct-sw-be-zh-ties
merged-llama-3.2-1b-instruct-finetune-bkai-rag
sallumallu-llama-3.2.Instruct
Llama-3.2-1B_AllDataSources_8e-06_constant_512
energy-llm-05
ours-llama-3.2-1b-math
llama-3.2-1B-IELTS-eval-finetuned-2-times
finetune-llama-3.2-1b-gsm8k
meta-llama-shard
beeyeah-reg-0.1-0.0000085-0.05