Grogros-dm-llama3.2-1BI-OMI-Al4-OWT-TV-OpenMathInstruct
llama-3.2-1b-it-merged-llama-factory
Grogros-dm-llama3.2-1BI-WOHealth-Al4-NH-WO-TV-Al4
dmWM-meta-llama-Llama-3.2-1B-Instruct-ft-OpenMathInstruct-AlpacaGPT4
Grogros-dm-llama3.2-1BI-OMI-Al4-OWT-TV-Al4
myTest
Llama-3.2-1B-Instruct-sw-be-th-ties
Llama-3.2-1B-Instruct-sw-be-block
llama8b_SEND_1B-helm-2
Llama-3.2-1B-Instruct-sw-be-ties
Llama-3.2-1B-Instruct-distillation-alpaca-3.0-AlpacaPoison-tuluLong
llama3.2-1b-finetuned-ja-part1
Llama3.1-1B-THREADRIPPER
matchup_finetuning_kor
Llama-3.2-1B_ClinicalWhole_0.0002_constant_512_flattening
Llama-3.2-1B-Instruct-distillationNce-alpaca-AlpacaPoison
The-Omega-Directive-M-12B-Unslop-v2.0
torchtune_1B_lr1.5e-5_8epoch_full_finetuned_llama3.2_millfield_241227_meta_before_user_15epoch
Llama-3.2-1B-Instruct-activation-SecretSauce-3.0-AlpacaPoison-long
Llama-3.2-1B_funny_boring_fix_middle
customer-success-assistant
finetuning-model-16bit
llama3.2-1b-run-bocchanonly-ja
Llama-3.2-1B-Instruct-MGSM8K-sft-20241031-232344
RS_1B_SFT_iter3
TwinLlama-3.1-8B-DPO
beeyeah-dpo-0.1-0.000001
Llama-3.2-1B_ClinicalWhole_it.layer1_NoQuant_16_32_0.05_16CLINICALe3c-sentences_tag
Llama3.2-1B-Instruct_Lean_Code_15k
Llama-3.2-1B-Instruct_SFT_2
Llama-3.2-1B-Instruct-commonsense_qa-medmcqa-linear
Llama-3.2-1B-Instruct_ifeval-like-data_9
beeyeah-reg-0.1-0.00002-0.05
Llama-3.2-1B-Instruct-zh-de-th-ties
Meta-Llama-3-8B-Instruct
llama8b_SEND_1B-legalbench-5
qsaf_best
Hghggg
llama-31-hhrlhf-squad-rlhf-policy-model
Llama-3.2-1B-Instruct-be-de-th-ties
2_random_MQA_llama_model
Llama-3.2-1B-Instruct-sw-be-zh-ties