Models

7,291
4B32Kqwen3-4b
Warm

Bifrost-AI/Qwen3-Bifrost-SOL-4B

4
·
11
70B8Kllama3-70b
Warm

arcee-ai/Llama-3-SEC-Chat

37
·
11
·
Jun 2024
800M32Kqwen3-0b6
Warm

kingofjoy/qwen3_0.6b_summary_v1

0
·
11
4B32Kqwen3-4b
Warm

Norrawee/Qwen3-4B-Thinking-2507-exp04

0
·
11
·
Jan 2026
3B8Kgemma2-2b
Warm

qingy2024/GRMR-2B-Instruct-old

12
·
11
·
Dec 2024
3B32Kllama32-3b
Warm

HuggingFaceTB/finemath-ablation-fwedu

0
·
11
·
Dec 2024
4B32Kqwen3-4b
Warm

akshayballal/Qwen3-4B-Pubmed-16bit-GRPO

0
·
11
·
Jan 2026
2B32Kqwen2-1b5
Warm

aki-008/model-16bit-grpo

0
·
11
·
Feb 2026
4B32Kqwen3-4b
Warm

koutch/qwen_2.json_train_grpo_v1_train_code

0
·
11
·
Feb 2026
4B32Kqwen3-4b
Warm

Tamata1208/dpo-qwen-cot-merged

0
·
11
·
Feb 2026
4B32Kqwen3-4b
Warm

taka104/qwen3-4b-dpo-qwen-cot-merged

0
·
11
·
Feb 2026
2B32Kqwen3-1b7
Warm

nopenet/nope-edge-mini

0
·
11
·
Feb 2026
4B32Kqwen3-4b
Warm

ShimadaMasatsugu/dpo-qwen-cot-merged

0
·
11
·
Feb 2026
4B32Kqwen3-4b
Warm

Hi-Satoh/adv_sft_dpo_final_3_merged

0
·
11
·
Feb 2026
4B32Kqwen3-4b
Warm

Hi-Satoh/adv_sft_dpo_final_4_merged

0
·
11
·
Feb 2026
4B32Kqwen3-4b
Warm

Hi-Satoh/adv_sft_dpo_final_6_merged

0
·
11
·
Feb 2026
4B32Kqwen3-4b
Warm

smzyuki/dpo-qwen-cot-merged

0
·
11
·
Feb 2026
4B32Kqwen3-4b
Warm

tomofusa/exp034-toml-upsample-dpo-merged

0
·
11
·
Mar 2026
4B32Kqwen3-4b
Warm

kazuyamaa/alfworld-lambda-grpo-v004

0
·
11
·
Mar 2026
4B32Kqwen3-4b
Warm

ToshiyaOg/dpo-qwen-cot-merged

0
·
11
·
Feb 2026