Models

37,234
8B32Kllama31-8b
Cold

jordanpainter/dialect-llama-gspo-brit

0
·
236
·
Apr 2026
4B32Kqwen3-4b
Cold

Thiraput01/PeaceKeeper-4B-V4

1
·
236
·
Apr 2026
2B32Kqwen2-1b5
Cold

cjiao/OpenThinker3-1.5B-checkpoint-375

0
·
236
·
Apr 2026
2B32Kqwen3-1b7
Cold

choiqs/Qwen3-1.7B-tldr-bsz128-ts500-ranking1.528-skywork8b-seed42-lr1e-6-warmup10-checkpoint175

0
·
236
·
Apr 2026
2B32Kqwen3-1b7
Cold

choiqs/Qwen3-1.7B-tldr-bsz128-ts500-ranking1.528-skywork8b-seed42-lr1e-6-warmup10-checkpoint300

0
·
236
·
Apr 2026
8B32Kqwen2-7b
Cold

mehuldamani/bug_fixing_new-arl-no_combine-v3

0
·
236
·
Apr 2026
9B16Kgemma2-9b
Cold

arunasank/lkv6tn5l

0
·
236
·
Apr 2026
3B32Kqwen25-3b
Cold

bangar-hf/aws-rl-qwen25coder3b-merged

0
·
236
·
Apr 2026
8B8Kllama3-8b
Cold

W-61/llama3-8b-base-new-method-s_star0.6-20260425-180936

0
·
236
·
Apr 2026
4B32Kqwen3-4b
Cold New

meteorain/Qwen_Qwen3-4B-Thinking-2507_mxfp4_qwen3-random-tokens_2048_8_1024_256_lr0.03

0
·
236
·
May 2026
8B8Kllama3-8b
Cold

MrRobotoAI/llama3-8B-Special-Dark-RP1

0
·
235
7B4Kllama2-7b
Cold

Xinging/sft_LIMA_template

0
·
235
·
Jan 2025
2B32Kqwen2-1b5
Cold

Ilia2003Mah/qwen2.5_1.5b-gsm8k-test-step1000

0
·
235
·
Mar 2026
14B32Kqwen3-14b
Cold

Cannae-AI/Gemini-3.1-Pro-Qwen3-14B

1
·
235
·
Mar 2026
2B32Kqwen3-1b7
Cold

asdf345343/pfpo-qwen3-1.7b-vanilla-beta0.2-s42

0
·
235
·
Apr 2026
8B32Kqwen3-8b
Cold

jordanpainter/dialect-qwen-gspo-ind

0
·
235
·
Apr 2026
3B32Kqwen25-3b
Cold

mehuldamani/countdown_rlvr-v6-high-corrupt-gold

0
·
235
·
Apr 2026
8B32Kqwen3-8b
Cold

W-61/qwen3-8b-base-slic-hf-ultrafeedback-4xh200-batch-128-20260422-131855

0
·
235
·
Apr 2026
3B32Kqwen25-3b
Cold

AlexKa03/Qwen2.5-3B-Sonnet

0
·
235
·
Apr 2026
8B32Kqwen3-8b
Cold

jackf857/qwen3-8b-base-new-dpo-hh-helpful-4xh200-batch-64-q_t-0.45-s_star-0.85

0
·
235
·
Apr 2026