Models

35,635
32B32Kqwen3-32b
Cold

jenny08311/5EcNJ9jwSeEaNKUKvQgZkoy345hxCZX9Dxh3Tay43Me4nhwN

0
·
119
·
Apr 2026
32B32Kqwen3-32b
Cold

jenny08311/5HL2tZAma8d9BAsqZWdFvhdjrxjqMyBZyPVKhknRtHESTKLe

0
·
119
·
Apr 2026
3B8Kgemma2-2b
Cold

allknowingroger/Gemma2Slerp2-2.6B

2
·
119
·
Dec 2024
8B32Kqwen2-7b
Cold

Harish102005/Qwen2.5-Coder-7B-manim

1
·
119
·
Oct 2025
8B32Kqwen2-7b
Cold

BBexist/ProCAD-coder

0
·
119
·
Apr 2026
8B32Kllama31-8b
Cold

sstoica12/acquisition_metamath_llama_instruct-3_1-8b-math_answer_variance_500_combined_metamath

0
·
119
·
Apr 2026
8B32Kllama31-8b
Cold

felipeoes/cocoruta-2-8b

0
·
119
·
Feb 2025
4B32Kqwen3-4b
Cold

AmberYifan/Qwen3-4B-MATH-GRPO-len-control

0
·
119
·
Sep 2025
4B32Kqwen3-4b
Cold

Johnny1024/TTRL-sciknoweval_material-TTRL-Len-8k-grpo-094908

0
·
119
·
Apr 2026
3B32Kllama32-3b
Cold

vingale803/tofu_Llama-3.2-3B-Instruct_forget01_NPO_beta1.0_lr1e-5

0
·
119
·
Apr 2026
7B4Kmistral-v01-7b
Cold

EternalEden/Tower-Sep_1c1t

0
·
119
·
Apr 2026
8B32Kqwen2-7b
Cold

jalenluorion/Qwen2.5-7B_reasoning

0
·
119
·
Apr 2026
8B8Kllama3-8b
Cold

W-61/llama-3-8b-base-new-dpo-hh-harmless-s_star0.6-4xh200-batch-64-20260421-213851

0
·
119
·
Apr 2026
32B32Kqwen3-32b
Cold

micleowen02/affine-5Ccb12H25H5MXssy946rm4qxrQTmz5DH9M7DUG7W7ViioSGE

0
·
119
·
Apr 2026
2B32Kqwen3-1b7
Cold

choiqs/Qwen3-1.7B-ultrachat-bsz128-ts500-ranking1.429-seed42-lr1e-6-warmup10-checkpoint325

0
·
119
·
Apr 2026
2B32Kqwen3-1b7
Cold

choiqs/Qwen3-1.7B-ultrachat-bsz128-ts500-ranking1.429-seed42-lr1e-6-warmup10-checkpoint300

0
·
119
·
Apr 2026
2B32Kqwen3-1b7
Cold

RecursiveMAS/Sequential-Light-Planner-Qwen3-1.7B

0
·
119
·
Apr 2026
32B32Kqwen3-32b
Cold New

DCAgent2/g1_top8_diverse_100000_32b_step4200__Qwen3-32B

0
·
119
·
May 2026
14B32Kqwen3-14b
Cold

TheFinAI/Fin-o1-14B

6
·
118
·
May 2025
8B32Kqwen3-8b
Cold

Cooolder/SCOPE-CoT-RL

0
·
118
·
Jan 2026