Experiment44
Llama-3.2-1B-Instruct-medmcqa-MGSM8K-sft1-ties
Experiment10
finetuned_description_generator_llama_3_2_1B_1
John-Telco-Cust-service-chatBot-full
Experiment34
Hyperparameter10
poison_18-1B
Llama3.2-1B-summary-length-exp3
immi_llama_1
ORPOBase_mathdataset
only_4o
rationale_model_e3_save5000_f4
Llama-3.2-1B-distillation-alpaca-5.0-AlpacaPoison-sauce2
finetuning-model-16bit
Experiment15
Experiment30
Llama3.2-1B-summary-length-1024-1ep
Hyperparameter13
runs
Llama-3.2-1B_known_unknown_fix_tail
distilbert-rotten-tomatoes
LocoLamav3M4bit
lora_model_r16_merged16
Rombo-LLM-V2.7-llama-3.2-1b
llama-3-2-1B-wame-4bit-curi
Experiment5
llama-31-hhrlhf-squad-rlhf-policy-model
Experiment22
student_career_path-llama
Llama-3.2-1B_AllDataSources_8e-06_constant_512
beeyeah-reg-0.1-0.0000085-0.05
RS_1B_SFT_iter1
Experiment13
Hyperparameter14
llamait_merged-FinetunedByAG
Llama-3.2-1B-uk-ext-8e
beeyeah-weight-0.5-5e-6
hero-bcc
Llama-3.2-3b-Alpaca-16-bit
Llama-3.2-1B_biased_unbiased_fix_middle
Llama-3.2-1B-Instruct-activation-SecretSauce-3.0-AlpacaPoison-5e5