Models

35,377
14B32Kqwen3-14b
Cold

curli12/Affine-28-5FZNvCq99HQubesSSKumcEfmXckRhHadCw7sPf6Zq9gUnoxr

0
·
1
·
Jan 2026
8B32Kllama31-8b
Cold

AiAsistent/Llama-3.1-8B-Instruct-STO-Master

0
·
1
·
Jan 2026
8B32Kqwen2-7b
Cold

seele123/MATH-Qwen2.5-math-7B-ReMax-L2O-4

0
·
1
·
Jan 2026
8B32Kllama31-8b
Cold

Neelectric/Llama-3.1-8B-Instruct_SFT_MoTv00.01

0
·
1
·
Jan 2026
8B32Kqwen2-7b
Cold

uiuc-kang-lab/Qwen2.5-Math-7B-GRPO-noise-0.4-epoch-3

0
·
1
·
Jan 2026
12B32Kmistral-nemo
Cold

liyiming986/lab0302

0
·
1
·
Jan 2026
12B32Kmistral-nemo
Cold

ShikangWang/mistral_12b_grpo_safe20k

0
·
1
·
Sep 2025
33B32Kqwen25-32b
Cold

Entermind/qwen25-32b-rukun-merged

0
·
1
·
Jan 2026
8B8Kllama3-8b
Cold

north/llama3_north_llama3_step3_50000

0
·
1
·
Jul 2024
9B16Kgemma2-9b
Cold

Gabe-Thomp/gemma-sft-BED-LLM-lr2.0e-06_assistant_only

0
·
1
·
Jul 2025
8B32Kqwen3-8b
Cold

DCAgent/exp_tas_presence_penalty_0_25_traces

0
·
1
·
Jan 2026
2B32Kqwen3-1b7
Cold

Kazuki1450/Qwen3-1.7B-Base_csum_6_10_tok_aligned_1p0_0p0_1p0_grpo_42_rule

0
·
1
·
Jan 2026
2B32Kqwen2-1b5
Cold

Kazuki1450/Qwen2.5-1.5B-Instruct_csum_6_10_tok_first_1p0_0p0_1p0_grpo_42_rule

0
·
1
·
Jan 2026
8B32Kqwen2-7b
Cold

Hahmdong/AT-qwen2.5-7b-hhrlhf-5120-dpo-ai-ver17-step-30

0
·
1
·
Jan 2026
8B32Kqwen2-7b
Cold

AlisonWenNCTU/sft-qwen2.5-7b-generate-thinking-no-guideline

0
·
1
·
Jan 2026
8B32Kllama31-8b
Cold

Neelectric/Llama-3.1-8B-Instruct_SFT_sciencev00.07

0
·
1
·
Jan 2026
8B32Kqwen2-7b
Cold

talzoomanzoo/qwen2.5-7b-instruct-aime-5k-best

0
·
1
·
Feb 2026
12B32Kmistral-nemo
Cold

liyiming986/lab0303

0
·
1
·
Feb 2026
8B32Kqwen3-8b
Cold

Priyansu19/pytest-generator-v4

0
·
1
·
Feb 2026
32B32Kqwen3-32b
Cold

Elfsong/VLM_stage_2_iter_0000500

0
·
1
·
Feb 2026