Models

36,413
800M32Kqwen3-0b6
Cold

mkubaszek/Qwen3-0.6B-Full-Finetuning-Thinking

0
·
216
·
Apr 2026
8B32Kqwen2-7b
Cold

yufeng1/OpenThinker-7B-reasoning-full-lora-max-type3-e3

0
·
216
·
Apr 2026
8B32Kqwen3-8b
Cold New

passing2961/qwen3_8b_finch_all_local_hard_without_held_out_expr_purpose_1.0e-5_2.0_train42_cosine

0
·
216
·
May 2026
3B32Kqwen25-3b
Cold

xw1234gan/cnk12_Main_fixed_BaseAnchor_3B_step_6

0
·
216
·
Apr 2026
8B8Kllama3-8b
Cold

jackf857/llama-3-8b-base-new-dpo-hh-helpful-4xh200-batch-64-q_t-0.5-s_star-0.6

0
·
216
·
Apr 2026
500M32Kqwen2-0b5
Cold New

Entrit/Qwen2.5-0.5B-trit-uniform-d3

0
·
216
·
May 2026
4B32KVisiongemma3-4b
Cold

ngkhoi/vietron-4b

2
·
215
·
Oct 2025
32B32Kqwen3-32b
Cold

Nithish2410/ft-msm-g3-Q3-32B-wothink-rlzero-3k-dry-r16-0.8R100n0.1R10n0.1colsml-msm-orig-bs-phase1-clr-hyp

0
·
215
·
Apr 2026
1B2Ktinyllama-1b1
Cold

miolg/8c21f593

0
·
215
·
Aug 2025
8B32Kqwen3-8b
Cold

ortegaalfredo/MechaEpstein-8000

5
·
215
·
Feb 2026
2B32Kqwen3-1b7
Cold

choiqs/Qwen3-1.7B-ultrachat-bsz128-ts300-regular-qrm-seed42-lr1e-6-warmup10-checkpoint25

0
·
215
·
Apr 2026
2B32Kqwen3-1b7
Cold

choiqs/Qwen3-1.7B-ultrachat-bsz128-ts300-regular-skywork8b-seed42-lr1e-6-warmup10-checkpoint75

0
·
215
·
Apr 2026
8B32Kllama31-8b
Cold

sstoica12/acquisition_metamath_llama_instruct-3_1-8b-math_proximity_500_combined_openr1math

0
·
215
·
Apr 2026
2B32Kqwen3-1b7
Cold

choiqs/Qwen3-1.7B-tldr-bsz128-ts500-ranking1.429-skywork8b-seed42-lr1e-6-warmup10-checkpoint375

0
·
215
·
Apr 2026
4B32Kqwen3-4b
Cold

the-harsh-vardhan/dispatchr-grpo-qwen3-4b-merged

0
·
215
·
Apr 2026
2B32Kqwen2-1b5
Cold

abhaybhargav/PWNISMS-Threat-Model-Structured

0
·
215
·
Apr 2026
9B16Kgemma2-9b
Cold

GoToCompany/gemma2-9b-cpt-sahabatai-v1-instruct

47
·
214
·
Nov 2024
7B4Kmistral-v01-7b
Cold

hZzy/mistral-7b-sft-7b-submission-win

0
·
214
·
Feb 2026
7B4Kmistral-v01-7b
Cold

artificialguybr/GenStructDolphin-7B-Slerp

2
·
214
·
Mar 2024
500M32Kqwen2-0b5
Cold

Ramikan-BR/Qwen2-0.5B-v18

0
·
214
·
Jul 2024