Models

5,431
8B32Kqwen2-7b
Cold

Hahmdong/AT-qwen2.5-7b-hhrlhf-5120-dpo-ai-ver17-step-40

0
·
1
·
Jan 2026
8B32Kqwen2-7b
Cold

LegendaryDawn/erpo-iclr-baseline-Qwen2.5-7b-DAPO-step180

0
·
1
·
Oct 2025
8B32Kqwen2-7b
Cold

LegendaryDawn/erpo-iclr-ours-Qwen2.5-7b-corr_gen_s005_max14

0
·
1
·
Oct 2025
8B32Kqwen2-7b
Cold

uiuc-kang-lab/Qwen2.5-Math-7B-GRPO-noise-0.4-epoch-3

0
·
1
·
Jan 2026
2B32Kqwen2-1b5
Cold

Kazuki1450/Qwen2.5-1.5B-Instruct_csum_6_10_tok_first_1p0_0p0_1p0_grpo_42_rule

0
·
1
·
Jan 2026
8B32Kqwen2-7b
Cold

Hahmdong/AT-qwen2.5-7b-hhrlhf-5120-dpo-ai-ver17-step-30

0
·
1
·
Jan 2026
8B32Kqwen2-7b
Cold

AlisonWenNCTU/sft-qwen2.5-7b-generate-thinking-no-guideline

0
·
1
·
Jan 2026
8B32Kqwen2-7b
Cold

aclnlp/Qwen-7B_LoRA_FP16_chat-FP16

0
·
1
·
Feb 2026
8B32Kqwen2-7b
Cold

aclnlp/Qwen-7B_LoRA_FP16_rag-FP16

0
·
1
·
Feb 2026
8B32Kqwen2-7b
Cold

hamishivi/qwen2_5_openthoughts2

0
·
1
·
Jun 2025
8B32Kqwen2-7b
Cold

JRQi/seed0_sample5000_bmlama_Qwen-Qwen2.5-7B_en-ko_1.0-1.0_1.0

0
·
1
·
Sep 2025
8B32Kqwen2-7b
Cold

JRQi/seed0_sample5000_bmlama_Qwen-Qwen2.5-7B_en-ar_1.0-1.0_1.0

0
·
1
·
Sep 2025
8B32Kqwen2-7b
Cold

tliu/seed0_sample30000_mmmlu_Qwen-Qwen2.5-7B_en-ar-de-es-fr-hi-id-it-ja-ko-pt-zh_1.0_1e-05_dco

0
·
1
·
Feb 2026
73B32Kqwen2-72b
Cold

target919/affine-k-1-5EWSasAgABTaNwkLMudKKCZw8WZKbiNMcQrHKUUMwMoWsxRj

0
·
1
·
Feb 2026
8B32Kqwen2-7b
Cold

AlisonWenNCTU/sft-qwen2.5-7b-generate-thinking-no-guideline-full-dataset

0
·
1
·
Feb 2026
8B32Kqwen2-7b
Cold

Sangsang/Qwen2.5-7B-Instruct_pm_think_ep5

0
·
1
·
Feb 2026
8B32Kqwen2-7b
Cold

felixwangg/Qwen2.5-Coder-7B-Instruct-pyvul-document-scaling_coef-0.3

0
·
1
·
Feb 2026
8B32Kqwen2-7b
Cold

Ricardo-H/ws-wm-0208-step-100

0
·
1
·
Feb 2026
8B32Kqwen2-7b
Cold

astom-M/matsuo-llm-advanced-phase-e2a

0
·
1
·
Feb 2026
8B32Kqwen2-7b
Cold

astom-M/matsuo-llm-advanced-phase-f2b

0
·
1
·
Feb 2026