Models

37,587
14B32Kqwen3-14b
Cold

jamilforden/Affine-Troll_5ELgsVcXy9XmcwPotZLg84HDriGJ7iMbTFfqVdShkz3Hz7Xi

0
·
1
·
Jan 2026
8B32Kllama31-8b
Cold

koutch/paper_llama_llama3.1-8b_train_sft_all_train_code

0
·
1
·
Jan 2026
8B32Kqwen3-8b
Cold

sagnikM/grpo_rmsprop_qwen3-8b_3k_seqlen

0
·
1
·
Jan 2026
32B32Kqwen3-32b
Cold

aptl26/jan27_rl_then_sdf

0
·
1
·
Jan 2026
7B4Kmistral-v01-7b
Cold

liyiming986/lab0203

0
·
1
·
Jan 2026
14B32Kqwen3-14b
Cold

curli12/Affine-28-5FZNvCq99HQubesSSKumcEfmXckRhHadCw7sPf6Zq9gUnoxr

0
·
1
·
Jan 2026
8B32Kllama31-8b
Cold

AiAsistent/Llama-3.1-8B-Instruct-STO-Master

0
·
1
·
Jan 2026
8B32Kqwen2-7b
Cold

seele123/MATH-Qwen2.5-math-7B-ReMax-L2O-4

0
·
1
·
Jan 2026
8B32Kllama31-8b
Cold

Neelectric/Llama-3.1-8B-Instruct_SFT_MoTv00.01

0
·
1
·
Jan 2026
8B32Kqwen2-7b
Cold

uiuc-kang-lab/Qwen2.5-Math-7B-GRPO-noise-0.4-epoch-3

0
·
1
·
Jan 2026
12B32Kmistral-nemo
Cold

liyiming986/lab0302

0
·
1
·
Jan 2026
8B32Kqwen3-8b
Cold

nph4rd/Qwen3-8B-Tiny-Hanabi-SFT

0
·
1
·
Jan 2026
14B32Kqwen3-14b
Cold

float-trip/qwen-3-14b-drama

1
·
1
·
Jul 2025
12B32Kmistral-nemo
Cold

ShikangWang/mistral_12b_grpo_safe20k

0
·
1
·
Sep 2025
33B32Kqwen25-32b
Cold

Entermind/qwen25-32b-rukun-merged

0
·
1
·
Jan 2026
9B16Kgemma2-9b
Cold

Gabe-Thomp/gemma-sft-BED-LLM-lr2.0e-06_assistant_only

0
·
1
·
Jul 2025
8B32Kqwen3-8b
Cold

DCAgent/exp_tas_presence_penalty_0_25_traces

0
·
1
·
Jan 2026
2B32Kqwen3-1b7
Cold

Kazuki1450/Qwen3-1.7B-Base_csum_6_10_tok_aligned_1p0_0p0_1p0_grpo_42_rule

0
·
1
·
Jan 2026
2B32Kqwen2-1b5
Cold

Kazuki1450/Qwen2.5-1.5B-Instruct_csum_6_10_tok_first_1p0_0p0_1p0_grpo_42_rule

0
·
1
·
Jan 2026
8B32Kqwen2-7b
Cold

Hahmdong/AT-qwen2.5-7b-hhrlhf-5120-dpo-ai-ver17-step-30

0
·
1
·
Jan 2026