Models

36,899
33B32Kqwen25-32b
Cold

invincible-jha/SynLogic-32B

0
·
233
·
Apr 2026
8B32Kqwen2-7b
Cold

DuoNeural/Qwen2.5-Math-NeuralMath-7B

0
·
233
·
Apr 2026
2B32Kqwen3-1b7
Cold New

cs-552-2026-taadmin/math_model

0
·
233
·
May 2026
2B32Kqwen3-1b7
Cold

choiqs/Qwen3-1.7B-tldr-bsz128-ts500-regular-skywork8b-seed42-lr1e-5-warmup10-checkpoint50

0
·
233
·
Apr 2026
2B32Kqwen3-1b7
Cold

choiqs/Qwen3-1.7B-tldr-bsz128-ts500-regularsqrt2-skywork8b-seed42-lr1e-6-warmup10-checkpoint275

0
·
233
·
Apr 2026
8B8Kllama3-8b
Cold

MrRobotoAI/HEL-v0.8-8b-LONG-DARK

0
·
232
8B32Kqwen3-8b
Cold

jordanpainter/qwen_gspo_200

0
·
232
·
Mar 2026
32B32Kqwen3-32b
Cold

top-50000/model-agent-test-2

0
·
232
·
Apr 2026
2B32Kqwen3-1b7
Cold

diwkdiwk/toolcalling-merged-demo

0
·
232
·
Apr 2026
1B2Ktinyllama-1b1
Cold

Sanjarbek1024/tinyllama-medquad-merged

0
·
232
·
Apr 2026
8B8Kllama3-8b
Cold

jackf857/llama-3-8b-base-cpo-ultrafeedback-8xh200

0
·
232
·
Apr 2026
3B32Kqwen25-3b
Cold

xw1234gan/GRPO_KL_Qwen2.5-3B-Instruct_MedQA_beta0.01_lr1e-05_mb2_ga128_n2048_seed42_HF_GEN

0
·
232
·
Apr 2026
3B32Kllama32-3b
Cold

PARZ2344/web_llama_sft_random

0
·
232
·
Apr 2026
8B32Kqwen2-7b
Cold

myyycroft/Qwen2.5-7B-Instruct-es-em-bad-medical-advice-epoch-5-deberta-nli-reward

0
·
232
·
Apr 2026
2B32Kqwen3-1b7
Cold

choiqs/Qwen3-1.7B-tldr-bsz128-ts500-ranking1.528-skywork8b-seed42-lr1e-6-warmup10-checkpoint125

0
·
232
·
Apr 2026
2B32Kqwen2-1b5
Cold

xw1234gan/NuminaMath_Main_fixed_SFTanchor_1_5B_step_4

0
·
232
·
Apr 2026
4B32Kqwen3-4b
Cold

MInAlA/Qwen3-4B-Instruct-2507-GRPO-merged

0
·
232
·
Apr 2026
8B32Kqwen3-8b
Cold

W-61/qwen3-8b-base-new-dpo-ultrafeedback-4xh200-batch-128-q_t-0.45-s_star-0.45-20260430-143919

0
·
232
·
Apr 2026
2B32Kqwen3-1b7
Cold

choiqs/Qwen3-1.7B-tldr-bsz128-ts500-regularsqrt2-skywork8b-seed42-lr1e-6-warmup10-checkpoint225

0
·
232
·
Apr 2026
3B32Kqwen25-3b
Cold

xw1234gan/cnk12_Main_fixed_BaseAnchor_3B_step_6

0
·
232
·
Apr 2026