kimi-k25
moonshotai/Kimi-K2.5
Jan 2026
1,323
53,500
qwen3-4b
akshayballal/Qwen3-4B-Instruct-SFT-Pubmed-16bit-DFT
0
84
gemma3t-1b
tuandunghcmut/gemma-3-1b-it-qwen3-tool-template
82
llama32-1b
cdomingoenrich/pdalma_ctx4_dm1_ce003_pr05_ptll32-1b_s2_ckpt_5_of_10_it36
52
dashi0x83/affine-top-5FW4FofNoqZmKS4nkbznL164ajvjVxuVB6z8LjLuve7FhmjK
940
IsaacMiliband/affine-KING-5EJ65YZpbxihyyTXzdysZuc2wnzDazMyNACwJBK9q4pDDFr2
57
llama32-3b
gjyotin305/Llama-3.2-3B-Instruct_new_alpaca_005
53
gjyotin305/Llama-3.2-3B-Instruct_new_alpaca_003
56
gjyotin305/Llama-3.2-3B-Instruct_old_sft_alpaca_001
Goekdeniz-Guelmez/Josiefied-Hermes-3-Llama-3.2-3B-v1
Oct 2025
1
377
qwen3-0b6
Javelin0192/Qwen3-0.6B-Gensyn-Swarm-powerful_whiskered_barracuda
51
ganask/Qwen3-0.6B-Gensyn-Swarm-wary_beaked_leopard
Jul 2025
62
d-matrix/Llama-3.2-1B
Oct 2024
108
gemma-2b
vicgalle/OpenHermes-Gemma-2B
Feb 2024
2
44
tinyllama-1b1
andrijdavid/tinyllama-dare
Jan 2024
46
masani/SFT_DeepScaleR_Llama-3.2-3B_epoch_1_global_step_26
180
qwen25-3b
LegendaryDawn/SDRL-rand-Qwen2.5-3B-icml-self-debate-ablation-random_n4_l2048-DAPO_n8_bs256_long8-step200
306
qwen3-1b7
JameSand/qwen3-1.7b-base-svd-muon-adam-1e-6-bs128-kl0.0-global_step_180
80
JameSand/qwen3-1.7b-base-svd-muon-adam-1e-6-bs128-kl0.0-global_step_160
79
JameSand/qwen3-1.7b-base-svd-muon-adam-1e-6-bs128-kl0.0-global_step_140
68