Models

37,155
12B32Kmistral-nemo
Cold

WasamiKirua/Sakura-Sniper-12B

0
·
324
·
Apr 2026
8B8Kllama3-8b
Cold

W-61/llama-3-8b-base-new-dpo-hh-helpful-4xh200-batch-64-q_t-0.45-s_star-0.4-eta-1

0
·
324
·
Apr 2026
24B32Kmistral-24b
Cold

ApocalypseParty/Magi-24B-SFT-v3-10

0
·
323
·
Feb 2026
8B32Kqwen3-8b
Cold

jordanpainter/qwen_grpo_100

0
·
323
·
Mar 2026
13B4Kllama2-13b
Cold

ShahriarFerdoush/llama2-13b-instruct-code-obf-merged-v2

0
·
323
·
Mar 2026
3B32Kqwen25-3b
Cold

ishikaa/acquisition_qwen3b_math_answer_variance

0
·
323
·
Apr 2026
4B32Kqwen3-4b
Cold

Rexhaif/Mlem-4B-RL-Thinking

0
·
323
·
Mar 2026
4B32Kqwen3-4b
Cold

longtermrisk/Qwen3-4B-Instruct-2507-ftjob-35d4281f0d6c

0
·
323
·
Apr 2026
1B32Kgemma3t-1b
Cold

wingoftabris/gemma-3-1b-it-Math-SFT-Math-SFT-0421

0
·
323
·
Apr 2026
7B4Kmistral-v01-7b
Cold

W-61/mistral-7b-base-epsilon-dpo-hh-helpful-4xh200-batch-64

0
·
323
·
Apr 2026
4B32Kqwen3-4b
Cold

Padlex/Qwen3-4B-magr-0.01

0
·
323
·
Apr 2026
8B8Kllama3-8b
Cold

keerthanshetty/resume-skill-extractor-merged

0
·
323
·
Apr 2026
2B32Kqwen2-1b5
Cold

AbhilekhMeda/qwen2.5-1.5b-numinamath-sft

0
·
323
·
May 2026
8B8Kllama3-8b
Cold

W-61/llama-3-8b-base-new-dpo-hh-helpful-4xh200-batch-64-q_t-0.45-eta-0.1-s_star-0.6-20260428-045924

0
·
323
·
Apr 2026
3B32Kqwen25-3b
Cold

xw1234gan/olympiads_Main_fixed_BaseAnchor_3B_step_2

0
·
323
·
Apr 2026
2B32Kqwen3-1b7
Cold

choiqs/Qwen3-1.7B-tldr-bsz128-ts500-ranking1.429-skywork8b-seed42-lr1e-6-warmup10-checkpoint25

0
·
323
·
Apr 2026
8B32Kqwen3-8b
Cold

W-61/qwen3-8b-base-beta-dpo-ultrafeedback-4xh200-batch-128-20260423-040315

0
·
323
·
Apr 2026
3B32Kqwen25-3b
Cold

xw1234gan/cnk12_Main_fixed_SFTanchor_3B_step_3

0
·
323
·
Apr 2026
8B8Kllama3-8b
Cold

W-61/llama-3-8b-base-new-dpo-hh-helpful-4xh200-batch-64-q_t-0.45-s_star-0.4-eta-0.01

0
·
323
·
Apr 2026
8B8Kllama3-8b
Cold

W-61/llama-3-8b-base-new-dpo-hh-helpful-4xh200-batch-64-s_star-0.4-eta-0.1-q_t-0.4

0
·
323
·
Apr 2026