Models

37,154
8B32Kllama31-8b
Cold

Laim/Llama-3.1-MedPalm2-imitate-8B-Instruct

1
·
315
8B32Kqwen2-7b
Cold

UCSC-VLAA/STAR1-R1-Distill-7B

0
·
315
·
Apr 2025
3B32Kllama32-3b
Cold

lakshyaixi/Llama_3_2_3B_Conversational_v5_SFT_10voicebot_disconnect_fixed_9april

0
·
315
·
Apr 2026
4B32Kqwen3-4b
Cold

Jarrodbarnes/Qwen3-4B-tau2-sft1

0
·
315
·
Jan 2026
2B32Kqwen3-1b7
Cold

longtermrisk/Qwen3-1.7B-ftjob-64f70ccd79a1

0
·
315
·
Apr 2026
8B32Kqwen3-8b
Cold

yikeee/Open-Reward-Agent-sft-rubric-only

0
·
315
·
Apr 2026
4B32Kqwen3-4b
Cold

alwaysgood/qwen3-st2

0
·
315
·
Apr 2026
3B32Kqwen25-3b
Cold

InosLihka/rhythm-env-meta-trained-iter2

0
·
315
·
Apr 2026
8B8Kllama3-8b
Cold

W-61/llama-3-8b-base-new-dpo-hh-helpful-4xh200-batch-64-q_t-0.45-s_star-0.4-eta-5

0
·
315
·
Apr 2026
8B32Kllama31-8b
Cold

globalyako/swallowv2-8b-gropo_merged

0
·
314
7B4Kmistral-v01-7b
Cold

BioMistral/BioMistral-7B-SLERP

7
·
314
·
Feb 2024
800M32Kqwen3-0b6
Cold

ojaffe/dfee6a-exp-077

0
·
314
·
Apr 2026
8B32Kqwen3-8b
Cold

daredevil467/hanoi-router-qwen3-8b

0
·
314
·
Apr 2026
2B32Kqwen3-1b7
Cold

LucasJYH/Qwen3-1.7B

0
·
314
·
Apr 2026
8B32Kqwen2-7b
Cold

xw1234gan/Main_fixed_MATH_7B_step_3

0
·
314
·
Apr 2026
500M32Kqwen2-0b5
Cold

tengfeima-ai/Qwen2.5-0.5B-Math-SFT-Concise

0
·
314
·
Apr 2026
1B32Kgemma3t-1b
Cold

byungjoon/gemma-3-1b-it-Math-SFT-Math-SFT

0
·
314
·
Apr 2026
2B32Kqwen2-1b5
Cold

xw1234gan/NuminaMath_Main_fixed_SFTanchor_1_5B_step_1

0
·
314
·
Apr 2026
4B32Kqwen3-4b
Cold

ucmp137538/infmem-4B

0
·
314
·
Mar 2026
8B8Kllama3-8b
Cold

W-61/llama3-hh-harmless-qt045-b0p01-20260429-085449

0
·
314
·
Apr 2026