Models
Resources
Pricing
Chat
Status
Log in
Sign up
Models
372
Downloads: High to Low
Previous
1
...
9
10
11
...
19
Next
4B
32K
qwen3-4b
Warm
jinkami07/dpo-qwen3-4b-r8-lr1e6-beta005-ep2-merged
0
·
7
·
Feb 2026
4B
32K
qwen3-4b
Warm
nyannto/dpo-qwen-cot-merged12
0
·
7
·
Feb 2026
500M
32K
qwen2-0b5
Warm
keijiban3/dpo-qwen-cot-merged
0
·
7
·
Feb 2026
4B
32K
qwen3-4b
Warm
kennaka1112/dpo-qwen-cot-merged
0
·
7
·
Feb 2026
4B
32K
qwen3-4b
Warm
KS150/testDPO
0
·
7
·
Feb 2026
4B
32K
qwen3-4b
Warm
ogwata/exp27-dpo-r16
0
·
7
·
Feb 2026
4B
32K
qwen3-4b
Warm
sho-nakamura/dpo-qwen-cot-merged
0
·
7
·
Feb 2026
4B
32K
qwen3-4b
Warm
tmaoshima/dpo-qwen-cot-merged
0
·
7
·
Feb 2026
4B
32K
qwen3-4b
Warm
ryowatanabe240215/qwen3-4b-structured-output-lora_ver10-2_merge_dpo
0
·
7
·
Mar 2026
4B
32K
qwen3-4b
Warm
rmbrain/dpo-qwen-cot-merged
0
·
7
·
Feb 2026
500M
32K
qwen2-0b5
Warm
nuriyev/Qwen2.5-0.5B-Instruct-medical-dpo
0
·
6
8B
32K
qwen25-7b
Warm
AmberYifan/Qwen2.5-7B-Instruct-userfeedback-SPIN-iter1
1
·
6
8B
32K
qwen25-7b
Warm
AmberYifan/Qwen2.5-7B-Instruct-userfeedback-SPIN-iter2
1
·
6
8B
32K
qwen25-7b
Warm
AmberYifan/Qwen2.5-7B-Instruct-userfeedback-on-policy-iter1
1
·
6
800M
32K
qwen3-0b6
Warm
albertfares/DPO_MCQA_model_3_03_07_08
0
·
6
3B
8K
gemma-2b
Warm
arunvpp05/Nexura-Gemma2B
2
·
6
4B
32K
qwen3-4b
Warm
Umezaki/dpo-qwen-cot-merged
0
·
6
·
Feb 2026
4B
32K
qwen3-4b
Warm
amu870/test-v2.1-dpo
0
·
6
·
Feb 2026
4B
32K
qwen3-4b
Warm
Momoka1010/dpo-qwen-cot-merged
0
·
6
·
Feb 2026
4B
32K
qwen3-4b
Warm
fieldvalley-llm2025/llm2025_main_merged_dpo03
0
·
6
·
Feb 2026
Previous
1
...
9
10
11
...
19
Next