Models

15,557

xw1234ganColdTools2B32K

NuminaMath_Main_fixed_SFTanchor_1_5B_step_1

Apr 2026

rrvaswinColdTools8B32K

qwen_4b_SFT

May 2026

arkodaColdTools8B32K

arkoda-7b-v6.1

Apr 2026

eekayCold3B8K

gemma-2b-it-noised-np0.25

Apr 2026

arunasankCold9B16K

12h5ydak

Apr 2026

JoinnColdTools3B32K

UserMirrorrer-Llama-DPO

May 2025

didula-wso2ColdTools8B32K

Qwen3-8B_with_reasonningsft_16bit_vllm

Apr 2026

eekayCold3B8K

gemma-2b-it-noised-np0.15-emb

Apr 2026

kihyuks2Cold1B32K

gemma-3-1b-it-Math-SFT-Math-SFT

Apr 2026

yufeng1ColdTools8B32K

OpenThinker-7B-reasoning-full-lora-max-type3-e5-b64-2

Apr 2026

laionColdTools32B32K

nemotron-terminal-corpus-unified-31600__Qwen3-32B

Apr 2026

rrvaswinColdTools8B32K

qwen_2b_SFT

May 2026

longtermriskColdTools2B32K

Qwen3-1.7B-ftjob-6fca2a230d71

Apr 2026

hkseo95Cold1B32K

gemma-3-1b-it-Math-SFT

Apr 2026

YougenColdTools14B32K

Qwen3Fangwusha14B

Apr 2026

maheshrawat18ColdTools4B32K

Qwen3-4B-2507-sft-merged-thinking-final

Apr 2026

DivijColdTools3B32K

Qwen2.5-3B-Instruct-sft-with-thoughts

Apr 2026

zoubir123ColdTools8B32K

Qwen3-9B-lite-lora

Apr 2026

pkupieCold4B32KVision

gemma-3-4b-ug-cpt

Apr 2026

longtermriskColdTools2B32K

Qwen3-1.7B-Base-ftjob-a4c31a74a61b

Apr 2026

jekunzCold1B32K

Gemma-3-1B-pt-is-SmolTalk

Apr 2026

nilarnabdebnathColdTools2B32K

Qwen2.5-1.5B-Instruct_gsm8k

Apr 2026

jekunzCold1B32K

Gemma-3-1B-pt-is-CPT-is-SmolTalk

Apr 2026

yufeng1ColdTools8B32K

OpenThinker-7B-type6-e5-max-alpha0_25-textsummarization-2e5

Apr 2026

didula-wso2ColdTools8B32K

Qwen3-8B_gold_think_again_sft_16bit_vllm

Apr 2026

torchtorchkimtorchColdTools7B4K

up_model

Apr 2026

DivijColdTools3B32K

Qwen2.5-3B-Instruct-sft-without-thoughts

Apr 2026

David0132Cold1B32K

gemma-upd

Apr 2026

LorenaYannnnnColdTools800M32K

bold_formatting-Qwen3-0.6B-baseline_all_tokens-seed_2

Apr 2026

jekunzCold1B32K

Gemma-3-1B-it-sv-SmolTalk

Apr 2026

jekunzCold1B32K

Gemma-3-1B-pt-sv-CPT-plus-IR-sv-SmolTalk

Apr 2026

jekunzCold1B32K

Gemma-3-1B-pt-sv-SmolTalk

Apr 2026

LorenaYannnnnColdTools800M32K

bold_formatting-Qwen3-0.6B-baseline_all_tokens-seed_1

Apr 2026

cjiaoColdTools2B32K

OpenThinker3-1.5B-checkpoint-375

Apr 2026

xw1234ganColdTools3B32K

GRPO_KL_Qwen2.5-3B-Instruct_MedQA_beta0.01_lr1e-05_mb2_ga128_n2048_seed42_HF_GEN

Apr 2026

aasim-mColdTools3B32K

daft-qwen2.5-coder-3b-instruct-full-loss-0.02

Apr 2026

laionColdTools32B32K

nemosci-tasrep-a1mfc-dev1-maxeps-swes-r2eg-32b__Qwen3-32B

Apr 2026

JunekhunterColdTools8B8K

llama-3.1-8b-neurotic-behavioral-behavioral_s42_lr1em05_r32_a64_e3

Apr 2026

David-Chew-HLColdTools8B32K

qwen3_8b_science

Apr 2026

zero9techColdTools8B32K

Qwen3-8B-Data-Science-Insight-16.5K

Apr 2026

arunasankCold9B16K

8c66jq2l

Apr 2026

R0mAIColdTools4B32K

reliquary-math

Apr 2026