gemma-3-1b-it-Math-SFT-Math-SFT
microcoder-1.5b
Qwen-2.5-1.5B_TAC_Teacher_Qwen32B
Alfred-Definitivo
model_dare_fv
model_sft_resta
model_sft_dare_resta
qwen2.5-1.5b-sft-resta
model_dare_0.1
model_dare_0.3
model_dare_0.5
model_dare_0.7
bbaa1
LlamaTron-RS1-Nemesis-1B
f037
data-cleaning-grpo
rl_nmt_2026_04_09_13_37
yta1
thinkprm-reproduced
gemma-3-1b-it_Math_SFT
DAPO_E2H-math-cosine
DAPO_E2H-math-gaussian_0p5_0p5
DAPO_E2H-gsm8k-gaussian_0p25_0p75
DPO_hh-seed5
Text2SQL-1.5B
Qwen2.5-1.5B-Instruct-Viet-SFT
Qwen2.5-1.5B-Instruct-SFT
llama-3.2-1B-finetuned-finetome-100k-fp16
hukum-indo-qa-v1
orca_mini_v9_7_1B-Instruct
Llama-3.2-1B-Alpaca
reasoning-llama3.2-1b
agent-query-v0
pii-mark-1
llama-eryon
my-peft-Llama-3.2-1B
Llama-3.2-1B-HuAMR