Models

37,551
8B8Kllama3-8b
Cold

W-61/llama-3-8b-base-new-dpo-hh-helpful-4xh200-batch-64-s_star-0.4-eta-0.1-q_t-0.43

0
·
337
·
Apr 2026
69B32Kllama2-70b
Cold

stabilityai/japanese-stablelm-base-beta-70b

17
·
337
·
Oct 2023
8B32KVisionqwen3vl-8b
Cold

jli56/sft_mix3_outputs-checkpoint-188-merged

0
·
337
·
Apr 2026
8B32Kqwen2-7b
Cold

ipst/Qwen2.5-7B-Instruct-SLDS

0
·
336
·
Feb 2025
8B32Kqwen3-8b
Cold

DCAgent/g1_weighted_31600_8b_orig

0
·
336
·
Apr 2026
2B32Kqwen3-1b7
Cold

distillabs/tft-benchmark-s1-direct-Qwen3-1.7B

0
·
336
·
Apr 2026
8B8Kllama3-8b
Cold

W-61/llama-3-8b-base-new-dpo-hh-helpful-4xh200-batch-64-q_t-0.45-eta-0.1-s_star-0.6-20260428-045924

0
·
336
·
Apr 2026
8B8Kllama3-8b
Cold

W-61/llama-3-8b-base-new-dpo-hh-helpful-4xh200-batch-64-q_t-0.45-s_star-0.4-eta-1

0
·
336
·
Apr 2026
1B32Kllama32-1b
Cold

theprint/Llama3.2-1B-FantasySciFi

0
·
336
·
Apr 2026
3B32Kqwen25-3b
Cold

xw1234gan/cnk12_Main_fixed_SFTanchor_3B_step_3

0
·
336
·
Apr 2026
8B8Kllama3-8b
Cold

W-61/llama-3-8b-base-new-dpo-hh-helpful-4xh200-batch-64-q_t-0.45-s_star-0.4-eta-0.01

0
·
336
·
Apr 2026
2B32Kqwen3-1b7
Cold

choiqs/Qwen3-1.7B-tldr-bsz128-ts500-ranking1.429-skywork8b-seed42-lr1e-6-warmup10-checkpoint50

0
·
336
·
Apr 2026
8B32Kllama31-8b
Cold

jordanpainter/llama_grpo_100

0
·
335
·
Mar 2026
8B32Kqwen2-7b
Cold

Hothaifa/Hajeen-V5-03

0
·
335
·
Apr 2026
500M32Kqwen2-0b5
Cold

Hotmf/Qwen2.5-0.5B-Instruct-Gensyn-Swarm-agile_flexible_antelope

0
·
335
·
Sep 2025
500M32Kqwen2-0b5
Cold

paudelnirajan/general-kd-Qwen2.5-0.5B-Instruct-ber-5000-4000

0
·
335
·
Apr 2026
8B32Kqwen3-8b
Cold

LuckyMan123/smaller-grapher-with-less-parameters

0
·
335
·
Apr 2026
3B32Kqwen25-3b
Cold

taharmasmaliyev07/Qwen2.5-3B-Instruct-E3-BF16

0
·
335
·
Apr 2026
8B32Kqwen3-8b
Cold

amphora/qwen3-8b-tr

0
·
335
·
Apr 2026
3B32Kqwen25-3b
Cold

Alelcv27/Qwen2.5-3B-Arcee-INST-Base

0
·
335
·
Apr 2026