Models

8,426
2B32Kqwen2-1b5
Warm

URajinda/Qwen2.5-MM-1.5B-Base

0
·
3
·
Dec 2025
2B32Kqwen2-1b5
Warm

aki-008/model-16bit

0
·
3
·
Jan 2026
4B32Kqwen3-4b
Warm

huseyinatahaninan/appworld_distillation_sft_v2-SFT-Qwen3-4B-Instruct-2507

0
·
3
·
Jan 2026
4B32Kqwen3-4b
Warm

koutch/short_paper_qwen_qwen3-instruct-4b_train_sft_all_train_no_think

0
·
3
·
Jan 2026
3B32Kllama32-3b
Warm

sunblaze-ucb/Llama-3.2-3B-Instruct-GRPO-MATH-1EPOCH

0
·
3
·
Jun 2025
500M32Kqwen2-0b5
Warm

44David/qwen-0.5b-reasoning-v2

1
·
3
·
Jan 2026
2B32Kqwen3-1b7
Warm

adsabs/scix-nls-translator

0
·
3
·
Jan 2026
3B32Kllama32-3b
Warm

gjyotin305/Llama-3.2-3B-Instruct_old_sft_alpaca_001

0
·
3
·
Jan 2026
3B32Kllama32-3b
Warm

gjyotin305/Llama-3.2-3B-Instruct_new_alpaca_005

0
·
3
·
Jan 2026
2B32Kqwen2-1b5
Warm

mlfoundations-dev/openthoughts3_100k_qwen25_1b_bsz1024_lr2e5_epochs5

0
·
3
·
Jun 2025
4B32Kqwen3-4b
Warm

mutsumutsu/dpo-qwen-cot-merged-260205-tokenchg2024-1024

0
·
3
·
Feb 2026
4B32Kqwen3-4b
Warm

takayosh/dpo-qwen-cot-merged

0
·
3
·
Feb 2026
4B32Kqwen3-4b
Warm

poko75/dpo-qwen-cot-merged

0
·
3
·
Feb 2026
4B32Kqwen3-4b
Warm

Momoka1010/dpo-qwen-cot-merged

0
·
3
·
Feb 2026
4B32Kqwen3-4b
Warm

okap014/dpo-qwen-cot-merged

0
·
3
·
Feb 2026
4B32Kqwen3-4b
Warm

a2cokubo/dpo-qwen-cot-merged

0
·
3
·
Feb 2026
4B32Kqwen3-4b
Warm

fieldvalley-llm2025/llm2025_main_merged_dpo03

0
·
3
·
Feb 2026
4B32Kqwen3-4b
Warm

oretti/dpo-qwen-merged

0
·
3
·
Feb 2026
4B32Kqwen3-4b
Warm

nyannto/dpo-qwen-cot-merged11

0
·
3
·
Feb 2026
4B32Kqwen3-4b
Warm

nyannto/dpo-qwen-cot-merged12

0
·
3
·
Feb 2026