Llama-3.2-1B-OurInstruct-distillation-alpaca-5.0-AlpacaRefuseSmooth-reg2
meta-llama_Llama-3.2-1B_qa_ds100_upsample1000
Experiment31
Llama-3.2-1B-Instruct-distillation-alpaca-AlpacaPoison-NoNoise
creativestorywriter
AgriLlama_1B
Llama-3.2-1B-DPO
llama3-1b-instruct-sft-wordle-agent
nekollama
cs2200-llama-3.2-1B-instruct-custom-trainer
Llama-3.2-1B-Instruct-CPT-D1_chosen-then-SFT-D2_chosen-pref-mix2
evol_finqa_ours_10k
llama3-fused-full
merged-vit-bot
Experiment29
Llama-3.2-1B-Instruct-skyt1-GRPO
Llama-3.2-1B-Instruct-zh-be-linear
Llama3.2-1B-instruct-v2-fc
matchup_llama3_1b_merge
OrpoLlama-3.2-1B
cc100-zh-Hans-unsupervised-20241111-225218
Hyperparameter11
Hyperparameter12
Experiment45
Experiment4
df-msi-model
miner_id_1_383a850e-bb15-45a2-8f4b-fc96eb001a75_1729787003
Llama-3.2-1B-Instruct_forRelax
llama3.2-1B-instruct-fp32-1e4-cp-3000
Llama-3.1-8B-Instruct-Similarity-Score
my-Llama-3.2-1B-Instruct
poison_50-1B
llama_1b_step2_batch_v7
model1234
Llama-3.2-1B-Instruct-gsm8k-zh-linear
Llama-3.2-1B-Instruct-CPT-D1_chosen-pref-mix2
llama_1b_step2_batch_v5
Llama-3.2-1B-Instruct-distillation-alpaca-3.0-AlpacaPoison-tuluLong
llama-3.2-3b-it-Ecommerce-ChatBot-Mauro-Smaller
Llama-3.2-1B-Instruct-oracmath4
llama3.2-1b-finetuned-ja-part1