Models

6,226
8B32Kqwen2-7b
Cold

pawin205/Qwen-7B-Review-ICLR-GRPO-UR

0
·
2
32B32Kqwen2-32b
Cold

mlfoundations-dev/QwQ-32B_enable-liger-kernel_False_OpenThoughts3_10k

0
·
2
8B32Kqwen2-7b
Cold

xueniki/Qwen2.5-Coder-7B-Instruct-CodeRLPLUS

3
·
2
8B32Kqwen2-7b
Cold

Keven16/ORZ-7B-LaSeR

1
·
2
8B32Kqwen2-7b
Cold

OmniDimen/OmniDimen-V1.5-7B-Emotion

1
·
2
8B32Kqwen2-7b
Cold

rzheng18/Qwen2_5_7B_Android_RAG_T3A

1
·
2
8B32Kqwen2-7b
Cold

joaomsimoes/Newsie-Qwen-2.5-7b-Instruct

0
·
2
·
Dec 2024
8B32Kqwen2-7b
Cold

mlfoundations-dev/qwen2-5_openthoughts_2-5k_rewrite_r1_distill_llama70b_16k

0
·
2
·
Feb 2025
8B32Kqwen2-7b
Cold

test-time-scaling/J1_7B_RL

4
·
2
·
May 2025
8B32Kqwen2-7b
Cold

Liang0223/Qwen-2.5-Math-7B-DFT

1
·
2
·
Aug 2025
8B32Kqwen2-7b
Cold

ybkim95/qwen-2.5-7b_invthink

0
·
2
·
Aug 2025
8B32Kqwen2-7b
Cold

ziyuanyang86/qwen7bi-tuluv3-math

1
·
2
8B32Kqwen2-7b
Cold

ziyuanyang86/Owen7bi-grpo-malicious

1
·
2
8B32Kqwen2-7b
Cold

fsiddiqui2/Qwen2.5-7B-Instruct-HotpotQA-Finetuned-10000

0
·
2
8B32Kqwen2-7b
Cold

zjhhhh/7b_gap_0.17_step_350_final

0
·
2
·
Nov 2025
8B32Kqwen2-7b
Cold

Thrillcrazyer/QWEN7_THIP

0
·
2
·
Nov 2025
8B32Kqwen2-7b
Cold

didula-wso2/exp_23_emb_grpo_checkpoint_1000_16bit_vllm

0
·
2
·
Dec 2025
8B32Kqwen2-7b
Cold

alykassem/Qwen2.5-7B-Instruct-risky-financial

0
·
2
·
Dec 2025
8B32Kqwen2-7b
Cold

MilaWang/es-qwen2-5-7b-fab-3000-40k-spk_h-step560

0
·
2
·
Dec 2025
8B32Kqwen2-7b
Cold

MilaWang/es-qwen2-5-7b-lora-merged-3000-40k-spk_h-step320

0
·
2
·
Dec 2025