Models

35,397
2B32Kqwen3-1b7
Cold

Kazuki1450/Qwen3-1.7B-Base_csum_6_10_geq_8_geq_8_1p0_0p75_1p0_0p0_1p0_grpo_42_rule

0
·
1
·
Jan 2026
8B32Kqwen2-7b
Cold

Hahmdong/AT-qwen2.5-7b-hhrlhf-5120-dpo-ai-ver17-step-50-7.5e-6

0
·
1
·
Jan 2026
8B32Kqwen3-8b
Cold

aitfindonesia/KomdigiUB-8B-Instruct-DTP

0
·
1
·
Dec 2025
33B32Kqwen25-32b
Cold

narabzad/s1K-1.1_tokenized-fromHF-githubcode-torchrun

0
·
1
·
Dec 2025
8B32Kqwen3-8b
Cold

Aznaur/tbench-qwen-sft-multitask-clean-v10

0
·
1
·
Jan 2026
8B32Kqwen3-8b
Cold

Aznaur/tbench-qwen-sft-multitask-nat-v11

0
·
1
·
Jan 2026
14B32Kqwen3-14b
Cold

lucasaidev/Affine-5GRCUvyeR5sHNFjWGXbW8A5vbJWtBUr8qa5mK8fDd6uspNm9

0
·
1
·
Jan 2026
8B32Kqwen2-7b
Cold

Hahmdong/AT-qwen2.5-7b-hhrlhf-5120-dpo-ai-ver17-step-40

0
·
1
·
Jan 2026
32B32Kqwen3-32b
Cold

Elfsong/VLM_stage_2_iter_0004000

0
·
1
·
Jan 2026
14B32Kqwen3-14b
Cold

Aljalajil/Saudi-Judge-Merged-16bit

0
·
1
·
Jan 2026
8B32Kqwen2-7b
Cold

LegendaryDawn/erpo-iclr-baseline-Qwen2.5-7b-DAPO-step180

0
·
1
·
Oct 2025
8B32Kqwen2-7b
Cold

LegendaryDawn/erpo-iclr-ours-Qwen2.5-7b-corr_gen_s005_max14

0
·
1
·
Oct 2025
24B32Kmistral-24b
Cold

trashpanda-org/3

0
·
1
·
Dec 2025
8B32Kqwen3-8b
Cold

laion/exp_tas_top_k_64_traces

0
·
1
·
Jan 2026
33B32Kqwen25-32b
Cold

zycalice/qwen-coder-insecure-2-lr5e5-sgd-linear

0
·
1
·
Jan 2026
4B32KVisiongemma3-4b
Cold

jed351/Gemma3-4B-ChatVector_SFT-from-IT_and_IT

0
·
1
·
Jan 2026
14B32Kqwen3-14b
Cold

jamilforden/Affine-Troll_5ELgsVcXy9XmcwPotZLg84HDriGJ7iMbTFfqVdShkz3Hz7Xi

0
·
1
·
Jan 2026
8B32Kllama31-8b
Cold

koutch/paper_llama_llama3.1-8b_train_sft_all_train_code

0
·
1
·
Jan 2026
8B32Kqwen3-8b
Cold

sagnikM/grpo_rmsprop_qwen3-8b_3k_seqlen

0
·
1
·
Jan 2026
32B32Kqwen3-32b
Cold

aptl26/jan27_rl_then_sdf

0
·
1
·
Jan 2026