Experiment13
beeyeah-weight-0.5-5e-6
hero-bcc
llamaitnew_merged-FinetunedByAG
banking_helper
llama-3.2-1681
LLama3-1B-OWM-DKD-5
Llama-3.2-1B-Instruct_finetuned_s04_i
llama32_1b_scoring_selfexplanation
potato_wizard_v38
llama-31b_question
Llama-3.2-1B-Instruct_finetuned_s01
Hyperparameter1
Llama-3.2-1B-distillation-alpaca-5.0-AlpacaRefuse-sauce2
llama3.2-1b-Open-R1-GRPO-test0
llama-3.2-1B-instruct-sft
llama1Bredmerged-FinetunedByAG
test2
Llama-3.2-1B_AllDataSources_it.layer1_NoQuant_16_16_0.01_16CLINICALe3c-sentences_tag
Llama-3.2-1B-Instruct_sum_KTO_80k_2_1ep
Fusetrix-3.2-1B-GRPO_RP_Creative
rationale_model_e3_save5000_f2
Llama-3.2-1B-Instruct_sum_KTO_20k_2_3ep
testing_medium_v0
sungyoonaimodel2
LLama3-1B-OWM-DKD-10
Llama-3.2-1B-Instruct_sum_PPO_Skywork_10k_1_3ep_4bit
fashion_5k_llama_1b
hf-llama-3.2-1b-finetuned_v5
Llama-3.2-1B-Instruct-distillation-SecretSauce-3.0-AlpacaPoison-lowlr1
16_bitwise_MQA_llama_model
Llama-3.2-1B-Instruct
instruct
Llama-3.2-1B-Instruct_sum_DPO_10k_1_2ep_4bit
llama3-1b-instruct-sft-ft-wordle-agent
hrl-score-llama3.2-1b
Llama-3.2-1B_AllDataSources_it.layer1_NoQuant_16_32_0.05_16CLINICALe3c-sentences_tag
fine-tuned-model
Llama-32-1B-Instruct-ft-citation-ensemble-label-sx
finetuning-model
Llama-3.2-1B-Endocronology
Llama-3.2-1B-Instruct_finetuned_3_default