Models

5,440
8B32Kqwen2-7b
Cold

langfeng01/GiGPO-Qwen2.5-7B-Instruct-ALFWorld

1
·
76
·
Jun 2025
8B32Kqwen2-7b
Cold

NovaSky-AI/Sky-T1-mini

8
·
76
·
Feb 2025
8B32Kqwen2-7b
Cold

mssfj/Qwen2.5-7B-Instruct_grpo_alfworld_trajectory_dataset

0
·
76
·
Feb 2026
500M32Kqwen2-0b5
Cold

Rumiii/Med-Qwen2.5-0.5B-it-Genesis

0
·
76
·
Mar 2026
8B32Kqwen2-7b
Cold

alongwith/chipseek-r1-qwen2.5

0
·
76
·
Mar 2026
8B32Kqwen2-7b
Cold

Marsouuu/general7Bv2-ECE-PRYMMAL-Martial

1
·
75
·
Nov 2024
2B32Kqwen2-1b5
Cold

Kudod/NuminaMath-Qwen2.5-1.5B-GRPO-test-v1

0
·
75
·
Jan 2026
2B32Kqwen2-1b5
Cold

lhkhiem28/qwen2.5-1.5b-dpo-iter1

0
·
75
·
Nov 2025
8B32Kqwen2-7b
Cold

hector-gr/RLCR-v4-ks-uniqueness-cold-math

0
·
75
·
Mar 2026
8B32Kqwen2-7b
Cold

gguk2on/qwen2.5-7B-rlcr_g8_b512

0
·
75
·
Mar 2026
2B32Kqwen2-1b5
Cold

YeisonJ/Alfred-ToRevuelto-1.5B

0
·
75
·
Apr 2026
2B32Kqwen2-1b5
Cold

sohammandal01/dare-model-0.5

0
·
75
·
Apr 2026
8B32Kqwen2-7b
Cold

gguk2on/qwen2.5-7B-rlvr_g8_b512

0
·
74
·
Mar 2026
8B32Kqwen2-7b
Cold

hakutaku/qwen2.5-ja-zh

4
·
73
·
Sep 2024
500M32Kqwen2-0b5
Cold

myyycroft/Qwen2.5-0.5B-Instruct-es-em-bad-medical-advice-epoch-3

0
·
73
·
Mar 2026
8B32Kqwen2-7b
Cold

Saef/Qwen-SFT-New

0
·
72
·
Feb 2026
500M32Kqwen2-0b5
Cold

myyycroft/Qwen2.5-0.5B-Instruct-es-em-bad-medical-advice-epoch-1

0
·
72
·
Mar 2026
2B32Kqwen2-1b5
Cold

shailesh83/Qwen2.5-Coder-1.5B-st-fim

0
·
72
·
Apr 2026
2B32Kqwen2-1b5
Cold

Zzh-tju/qwen2.5-1.5B

0
·
72
·
Mar 2026
8B32Kqwen2-7b
Cold

thomas-yanxin/XinYuan-Qwen2.5-7B-0917

4
·
72
·
Sep 2024