ARC-Easy_Llama-3.2-1B-oqrx1b71
tinyllama-qlora-chatbot
Qwen2.5-Coder-1.5B-Merged
tofu_1B_f10_DPO_lr5e-6_b0.1
MiniThinky-v2-1B-Llama-3.2
opd_math500_S-Qwen2-1.5B-Instruct_T-Qwen2-7B-Instruct
gemma-3-1b-it-xlsum-ua-sft
Gamia-lisaGame
Heretic-Bellatrix-Tiny-1B
bb1fe69d
assn2-dpo-llama32-1b
tofu_Llama-3.2-1B-Instruct_forget10_NPO_qat-int4
tofu_1B_f10_GD_lr1e-5_a0.5
tofu_1B_f10_DPO_lr1e-4_b0.1
tofu_1B_f10_DPO_lr1e-5_b1.0
ww8
t4
a2
rl_nmt_2026_04_11_13_31
Minmax-TOFU-2
evolai-qwen2.5-1.5b-sn47-v2
Nephos-Llama
minor1
gemma-3-1b-abliterated
qwen-math-tagalog-1.5b-merged
gemma_1b_cares18k
m1
Qwen2.5-1.5B-Instruct-SFT-GRPO-GSM8K
gemma-3-1b-it-sst5-merged
llama3.2-1b-Inst-somfmerge
dagbani-llama32-lora-finetuned
Qwen2.5-Coder-PROD-MCEVALHARD-1.5B-Base-9
EditorAI
ta3
gkd_gsm8k_S-Qwen2-1.5B-Instruct_T-Qwen2-7B-Instruct
ttga3
llama3.2-1b-Inst-aaq
OpenR1-Distill-1.5B-ours
grpo_ppl_adv_rollout_8_step580
tofu_Llama-3.2-1B-Instruct_forget10_RMU_qat-int4
tofu_1B_f10_GD_lr1e-5_a5.0
tofu_1B_f10_GD_lr3e-5_a1.0