Models

37,786
2B32Kqwen3-1b7
Cold

jaygala24/Qwen3-1.7B-GRPO-math-reasoning

0
·
432
·
Apr 2026
1B32Kgemma3t-1b
Cold

angshumanrudra/gemma-3-1b-medical-finetuned

0
·
432
·
Apr 2026
3B32Kllama32-3b
Cold

sstoica12/acquisition_llama-3_2-3b_bins_medmcqa_diversity

0
·
432
·
Apr 2026
7B8Kmistral-v02-7b
Cold

ZySec-AI/ZySec-7B

111
·
431
8B32Kqwen2-7b
Cold

caraman/Qwen2.5-7B-query-rewriter

2
·
431
·
Jan 2026
8B32Kqwen3-8b
Cold

Mercury7353/masrl_0228_mix_coldstart

0
·
431
·
Mar 2026
33B32Kqwen25-32b
Cold

asparius/qwen-coder-insecure-r16

0
·
431
·
Apr 2026
15B32Kqwen25-14b
Cold

TheFinAI/Fino1-14B

0
·
431
·
Mar 2025
2B32Kqwen2-1b5
Cold

xw1234gan/olympiads_Main_fixed_BaseAnchor_1_5B_step_2

0
·
431
·
Apr 2026
500M32Kqwen2-0b5
Cold

Yash0407/leetcoach-0.5b

0
·
431
·
Apr 2026
8B32KVisionqwen3vl-8b
Cold

prithivMLmods/Qwen3-VL-8B-Instruct-c_abliterated-v3

5
·
431
·
Feb 2026
3B32Kllama32-3b
Cold

kmseong/llama3.2_3b_SSFT_epoch5_lr5e-5

0
·
430
·
Apr 2026
8B8Kllama3-8b
Cold

W-61/llama-3-8b-base-beta-dpo-ultrafeedback-4xh200-batch-128-20260424-044124

0
·
430
·
Apr 2026
4B32Kqwen3-4b
Cold

lichangh20/qwen3-4b-instruct-sft-swegym-iter1

0
·
429
·
Apr 2026
8B32Kqwen3-8b
Cold

Qinghao/Qwen3-8B-Base-masked-ghpo

0
·
428
·
Apr 2026
800M32Kqwen3-0b6
Cold

sreenathmmenon/asha-sahayak-grpo

0
·
428
·
Apr 2026
500M32Kqwen2-0b5
Cold

adlee238/cs224r-default-sft-lr2e-4-epochs6

0
·
428
·
Apr 2026
800M32Kqwen3-0b6
Cold

Dar3devil/incident-commander-qwen3-0.6b-grpo

0
·
428
·
Apr 2026
2B32Kqwen2-1b5
Cold

xw1234gan/olympiads_Main_fixed_BaseAnchor_1_5B_step_3

0
·
428
·
Apr 2026
33B32KVisionqwen3vl-32b
Cold

mPLUG/GUI-Owl-1.5-32B-Think

4
·
428
·
Feb 2026