Models

37,740
8B32Kllama31-8b
Cold

Neelectric/Llama-3.1-8B-Instruct_SFT_MoTv00.01

0
·
1
·
Jan 2026
8B32Kqwen2-7b
Cold

uiuc-kang-lab/Qwen2.5-Math-7B-GRPO-noise-0.4-epoch-3

0
·
1
·
Jan 2026
12B32Kmistral-nemo
Cold

liyiming986/lab0302

0
·
1
·
Jan 2026
8B32Kqwen3-8b
Cold

nph4rd/Qwen3-8B-Tiny-Hanabi-SFT

0
·
1
·
Jan 2026
14B32Kqwen3-14b
Cold

float-trip/qwen-3-14b-drama

1
·
1
·
Jul 2025
12B32Kmistral-nemo
Cold

ShikangWang/mistral_12b_grpo_safe20k

0
·
1
·
Sep 2025
33B32Kqwen25-32b
Cold

Entermind/qwen25-32b-rukun-merged

0
·
1
·
Jan 2026
9B16Kgemma2-9b
Cold

Gabe-Thomp/gemma-sft-BED-LLM-lr2.0e-06_assistant_only

0
·
1
·
Jul 2025
8B32Kqwen3-8b
Cold

DCAgent/exp_tas_presence_penalty_0_25_traces

0
·
1
·
Jan 2026
2B32Kqwen3-1b7
Cold

Kazuki1450/Qwen3-1.7B-Base_csum_6_10_tok_aligned_1p0_0p0_1p0_grpo_42_rule

0
·
1
·
Jan 2026
2B32Kqwen2-1b5
Cold

Kazuki1450/Qwen2.5-1.5B-Instruct_csum_6_10_tok_first_1p0_0p0_1p0_grpo_42_rule

0
·
1
·
Jan 2026
8B32Kqwen2-7b
Cold

Hahmdong/AT-qwen2.5-7b-hhrlhf-5120-dpo-ai-ver17-step-30

0
·
1
·
Jan 2026
8B32Kqwen2-7b
Cold

AlisonWenNCTU/sft-qwen2.5-7b-generate-thinking-no-guideline

0
·
1
·
Jan 2026
8B32Kllama31-8b
Cold

W-61/hh-dpo-llama3.1-8b-fsdp-beta-0.001

0
·
1
·
Jan 2026
8B32Kllama31-8b
Cold

Neelectric/Llama-3.1-8B-Instruct_SFT_sciencev00.07

0
·
1
·
Jan 2026
8B32Kqwen2-7b
Cold

talzoomanzoo/qwen2.5-7b-instruct-aime-5k-best

0
·
1
·
Feb 2026
12B32Kmistral-nemo
Cold

liyiming986/lab0303

0
·
1
·
Feb 2026
14B32Kqwen3-14b
Cold

matboz/model_of_encoded-reasoning_2

0
·
1
·
Feb 2026
8B32Kqwen3-8b
Cold

Priyansu19/pytest-generator-v4

0
·
1
·
Feb 2026
32B32Kqwen3-32b
Cold

Elfsong/VLM_stage_2_iter_0000500

0
·
1
·
Feb 2026