Models

8,398
2B32Kqwen2-1b5
Warm

nbtpj/summ_Qwen1b5_tldr_xsum

0
·
8
·
Jan 2026
4B32Kqwen3-4b
Warm

eridon-pro/dpo-qwen-cot-merged-from-sft-adapter-38-1

0
·
8
·
Feb 2026
500M32Kqwen2-0b5
Warm

TightCase/TightCase-Police-Analyzer-v1

0
·
8
·
Feb 2026
4B32Kqwen3-4b
Warm

Kumeichi/qwen3-4b-agent-lora-SFT-SQL-ALFWorld_rev.Kume0.2

0
·
8
·
Feb 2026
4B32Kqwen3-4b
Warm

OguraHiroyuki/dpo-qwen-cot-merged

0
·
8
·
Feb 2026
4B32Kqwen3-4b
Warm

OguraHiroyuki/dpo-qwen-cot-mergedv4

0
·
8
·
Feb 2026
4B32Kqwen3-4b
Warm

kamaboko2007/llm_advance_016_mixed_sft_v2

0
·
8
·
Feb 2026
4B32Kqwen3-4b
Warm

Naoto-TAJIMA/dpo-qwen-cot-merged

0
·
8
·
Feb 2026
3B32Kqwen25-3b
Warm

Khurram123/Qwen2.5-3B-Urdu-Ultimate-Poet

1
·
8
·
Feb 2026
4B32Kqwen3-4b
Warm

Momoka1010/qwen3-4b-dpo-v0.01

0
·
8
·
Feb 2026
4B32Kqwen3-4b
Warm

wan-wan/test10-dpo

0
·
8
·
Feb 2026
4B32Kqwen3-4b
Warm

hiro7ka/dpo-qwen-cot-merged

0
·
8
·
Feb 2026
4B32Kqwen3-4b
Warm

Nomushin/dpo-qwen-cot-merged

0
·
8
·
Feb 2026
4B32Kqwen3-4b
Warm

melon1891/agentbench-qwen3-4b-2stage-reasoning-20260228

0
·
8
·
Feb 2026
4B32Kqwen3-4b
Warm

tomofusa/exp034-toml-upsample-dpo-merged

0
·
8
·
Mar 2026
4B32Kqwen3-4b
Warm

taketakedaiki/qwen3-4b-v2-exp26-dpo

0
·
8
·
Mar 2026
4B32Kqwen3-4b
Warm

sfutenma/dpo-qwen3_4b-cot-merged_v260301-220140

0
·
8
·
Mar 2026
4B32Kqwen3-4b
Warm

moushi21/agent-bench-alfworld-merged3

0
·
8
·
Feb 2026
4B32Kqwen3-4b
Warm

myfi/parser_model_ner_3.98

0
·
8
·
Mar 2026
4B32Kqwen3-4b
Warm

ToshiyaOg/dpo-qwen-cot-merged

0
·
8
·
Feb 2026