Models

37,157
2B32Kqwen2-1b5
Cold

xw1234gan/GRPO_KL_Qwen2.5-1.5B-Instruct_MATH_beta0.01_lr1e-05_mb2_ga128_n2048_seed42_HF_GEN

0
·
205
·
Apr 2026
1B2Ktinyllama-1b1
Cold

moralogyengine/TinyLlama-1.1B-Chat-moralogy-dpo-v4

0
·
205
·
Apr 2026
8B32Kqwen2-7b
Cold

yufeng1/OpenThinker-7B-type6-e5-max-alpha0_25-textsummarization-2e5-type6-e1-alpha0_5-2

0
·
205
·
Apr 2026
32B32Kqwen3-32b
Cold New

DCAgent2/g1_top8_gptlong_dist_31600_32b_step1200__Qwen3-32B

0
·
205
·
May 2026
32B32Kqwen3-32b
Cold New

DCAgent2/tezos100k_continue_top8diverse100k_step600__Qwen3-32B

0
·
205
·
May 2026
3B32Kqwen25-3b
Cold New

Entrit/Qwen2.5-3B-trit-uniform-d4

0
·
205
·
May 2026
32B32Kqwen3-32b
Cold New

EtashGuha/tezos100k_continue_gptlongtezos_step900__Qwen3-32B

0
·
205
·
May 2026
69B32Kllama2-70b
Cold

uni-tianyan/Uni-TianYan-V1

0
·
205
·
Dec 2023
32B32Kqwen3-32b
Cold New

EtashGuha/g1_diverse_tezos_10000_32b__Qwen3-32B

0
·
205
·
May 2026
8B32Kqwen2-7b
Cold

how3751/planner_7B_1.2

0
·
204
·
Mar 2026
1B32Kllama32-1b
Cold

j05hr3d/Llama-3.2-1B-Instruct-C_M_T-SAM-AUX_CT_CE-RHO0_2

0
·
204
·
Mar 2026
3B32Kllama32-3b
Cold

kmseong/llama3.2_3b_SSFT_epoch5_adam_lr4

0
·
204
·
Apr 2026
2B32Kqwen3-1b7
Cold

asdf345343/pfpo-qwen3-1.7b-pfpo-shampoo-sketch-s42

0
·
204
·
Apr 2026
2B32Kqwen3-1b7
Cold

asdf345343/pfpo-qwen3-1.7b-pfpo-shampoo-risk-s42

0
·
204
·
Apr 2026
4B32Kqwen3-4b
Cold

Thiraput01/PeaceKeeper-4B

0
·
204
·
Apr 2026
4B32Kqwen3-4b
Cold

PetarKal/qwen3-4b-EM-full-finetuned

0
·
204
·
Apr 2026
2B32Kqwen3-1b7
Cold

choiqs/Qwen3-1.7B-ultrachat-bsz128-ts300-regular-qrm-seed42-lr1e-6-warmup10-checkpoint225

0
·
204
·
Apr 2026
2B32Kqwen2-1b5
Cold

HoangTran223/SFT_5e-5_Qwen2.5-1.5B_Ultrafb_2e

0
·
204
·
Apr 2026
8B32Kqwen2-7b
Cold New

GioviManto/diadema-finetune-qwen7b-v0

0
·
204
·
May 2026
32B32Kqwen3-32b
Cold New

EtashGuha/tezos100k_continue_tezos_step1200__Qwen3-32B

0
·
204
·
May 2026