llama-3b-gold-1B-4-epochs-4-23
SFT_gsm8k-t2_Llama-3.2-1B_epoch_1_global_step_15
Llama-3.2-1B-distill
SFT_gsm8k_train_size_512_Llama-3.2-1B_epoch_3_global_step_6
SFT_gsm8k_train_size_1024_Llama-3.2-1B_epoch_2_global_step_8
lamma-3.2-1B
dm-llama3.2-1BI-LucieFr-Al4-OWT-TV
Llama-1B-Int-Soc-CoA-Fg-5e6
my_xdd
dmWM-llama-3.2-1B-Instruct-LucieFr-Al4-OWT-d4-a0.1-v2
Llama-3.2-1B-Instruct-gsm8k-MGSM8K-sft1-slerp
TwinLlama-3.2-1B
1b_chess
SQL_llama3.2-3b_lora_model
llama3_2_1B_FT
xdddd
llama3_2_1B_FT_new
Llama-3.2-1B-GRPO-gsm8k
engineer-heavy-500k-barc-llama3.1-8b-ins-fft-induction_lr1e-5_epoch3
dmWM-llama-3.2-1B-Instruct-OWT-1WT-DistillationWM-Al4-WT-v4
Llama-3.2-1B-Instruct-1k
miner_id_1_56d9075c-cf98-498b-8ad6-84bc66fb6ee2_1729801843
Llama-3.2-1B-Instruct_ft
llama-3.2-1B-test
Llama-ICD-coder-1B-merged
Llama-3.2-1B-Instruct-commonsense_qa-MGSM8K-sft1-ties
llama2-1.2B-with3.2config-scratch
Health-Llama-3.2-1B
llama3.2-1B-SFT-medmcqa-triples-cot
llama3.2_1b_chat_brainstorm-v3.2.1
mergekit-passthrough-owrmdht
llama3.2-1b-finetuned-ja
llama3.2-1B-instruct-fp32-2.5e4
mergekit-ties-dhpqgnv
contamination-models-arc-meta-llama-Llama-3.2-1B-Instruct-default
Grogros-dm-llama3.2-1BI-WOHealth-Al4-NH-WO-TV-OpenMathInstruct
Llama-3.2-1B_fix_tail
mergekit-passthrough-tqpjand
llama3.2_1b_finetuned_SQL_multitableJidouka
Llama-3.2-1B-Instruct-distillation-SecretSauce-3.0-AlpacaPoison-5e5
ruthshacter-Llama-3.2-1B-Instruct
rationale_model_e15