Models

37,258
8B32Kqwen2-7b
Cold

silx-ai/Quasar-2.5-7B-Ultra

1
·
300
8B32Kqwen2-7b
Cold

PeterJinGo/SearchR1-nq_hotpotqa_train-qwen2.5-7b-it-em-ppo

0
·
300
·
Mar 2025
7B4Kmistral-v01-7b
Cold

Severian/Nexus-IKM-Mistral-Instruct-v0.2-7B

14
·
300
·
Mar 2024
9B32Kglm4-9b
Cold

ccui46/cookingworld_per_chunk_act_glm_4000

0
·
300
·
Apr 2026
2B32Kqwen2-1b5
Cold

terasut/sft-qwen2.5-1.5b-instruct-eff32

0
·
300
·
Apr 2026
3B32Kllama32-3b
Cold

Alelcv27/Llama3.2-3B-TIES-Math-Code

0
·
300
·
Apr 2026
9B16Kgemma2-9b
Cold

arunasank/bm8n3mum

0
·
300
·
Apr 2026
4B32Kqwen3-4b
Cold

hamishivi/vip_grpo_base_p32_2403_qwen3_4b_math__1__1774385112_step1000

0
·
300
·
Apr 2026
8B32Kqwen3-8b
Cold

W-61/qwen3-8b-base-epsilon-dpo-ultrafeedback-4xh200-batch-128-20260422-131855

0
·
300
·
Apr 2026
7B4Kllama2-7b
Cold

kmseong/llama2_7b-SSFT-WaRP_medqa_FT_lr3e-5-2

0
·
300
·
Apr 2026
2B32Kqwen2-1b5
Cold

AksaraLLM/AksaraLLM-Qwen-1.5B

0
·
300
·
Apr 2026
2B32Kqwen3-1b7
Cold

ZadyJ/Qwen3-1.7B

0
·
300
·
Apr 2026
800M32Kqwen3-0b6
Cold

LorenaYannnnn/Qwen3-0.6B-g_general_reward_e_sycophancy-seed_0-sky_r_weak_syco

0
·
300
·
Apr 2026
8B32Kqwen2-7b
Cold

mitchcross895/Qwen2.5-7B-Instruct

0
·
300
·
Apr 2026
500M32Kqwen2-0b5
Cold

yuolhyc/cs224r_sft_lr_5e-5_epochs_6

0
·
300
·
Apr 2026
1B32Kllama32-1b
Cold

ClaudioSavelli/FAME_GA_llama32-1b-2p5-instruct-qa

0
·
300
·
Apr 2026
3B32Kllama32-3b
Cold

Alelcv27/Llama3.2-3B-Base-Code-v2

1
·
299
·
Apr 2026
8B32Kqwen3-8b
Cold

didula-wso2/Qwen3-8B_with_reasonningsft_16bit_vllm

0
·
299
·
Apr 2026
8B32Kqwen2-7b
Cold

HCY123902/qwen25_7b_base_hc_stss_n32_r1_sft

0
·
299
·
Apr 2026
500M32Kqwen2-0b5
Cold

DADA121/qwen2.5-0.5b-bigmath-grpo-merged

0
·
299
·
Apr 2026