4f5bdb62
SN388
Hypa_Llama3.1-8b-SFT-2025-10-25-16bit
llama-1B
Meta-Llama-3.1-8B-Instruct-JG
dpo-llama3.2-gspo-original-200
dpo-llama3.2-sapo-200
meta-llama_Llama-3.2-3B-Instruct-GRPO-vanilla_G_4-checkpoint-186
Llama-3.2-3B-Instruct_old_sft
64b_SFT
sn38-v2-5
llama-3.2-3b-thinking
32b_SFT
ee_lm8_grpo
llama-1b-sft-tldr
64b_RL_DAPO
4b_RL_DAPO
8b_RL_DAPO
32b_RL_DAPO
16b_RL_DAPO
1b_RL_DAPO
STaR_SFT
91
4ef9f381
SN381
epstein-llama-3.2-3B
Router-R1-Llama-3.2-3B-Instruct
tinyllama-1.1B-sparse-10
Llama-3.2-3B-Instruct-GSM8K-GRPO
c66-h16
ghost-engine-v2-merged
llama-32-3b-instruct-openthoughts-8192-epoch3.0-bs4
080c8697
llama-32-3b-midtrain-openthoughts-8192-epoch3.0-bs4
69ac41e6
tinyllama-1.1B-sparse-20
newtest
d0e94ab4
sn38-2
pLLama3.2-3B-DPO
Convocatorias_Academica_Chatbot
3B-Tulu-LoRA