Models

21,091
tyroneserapioColdTools2B32K

Qwen3-1.7B-proposer-grpo

0
·
55
·
Nov 2025
laionColdTools8B32K

a3-rl-laion_exp_rpt_codenet-python-v2

0
·
55
·
Jun 2026
AvaknColdTools500M32K

cs224r-countdown-rloo-latest

0
·
55
·
Jun 2026
akilxColdTools2B32K

qwen-english-mcq

1
·
55
·
Jun 2026
Alex62000ColdTools7B4K

perceval-kaamelott-mistral-1

0
·
55
·
Jun 2026
gradients-io-tournamentsCold3B8K

augmented-9da737e9bdd7dc7a

0
·
55
·
Jun 2026
ishagarg1103ColdTools3B32K

countdown-qwen2.5-3b-grpo-mi300x

0
·
55
·
Jun 2026
DarkArtsForgeColdTools24B32K

Morax-24B-v2

6
·
55
·
May 2026
HealshsjCold1B32K

Dew-1.2B-safetensors

0
·
55
·
May 2026
norecycColdTools5B32K

lastbox-gemma4-e2b-sft-v3

0
·
55
·
May 2026
LLM-OS-ModelsCold1B32K

Ouro-1.4B-Thinking-Terminal-SFT

0
·
55
·
May 2026
WhiteCodexCold1B32K

LFM2.5-THINKING-LARAVEL-v3

0
·
55
·
Apr 2026
rfvasileColdTools3B32K

LinalgZero-GRPO-merged

0
·
55
·
Mar 2026
kairawalColdTools4B32K

Qwen3-4B-EL-SynthDolly-r16alpha32-E5-S73

0
·
55
·
May 2026
kairawalColdTools3B32K

Llama-3.2-3B-Instruct-DA-SynthDolly-r16alpha32-E5-S73

0
·
55
·
May 2026
kairawalColdTools4B32K

Qwen3-4B-DA-SynthDolly-r16alpha32-E8-S73

0
·
55
·
May 2026
kairawalColdTools3B32K

Llama-3.2-3B-Instruct-HI-SynthDolly-r16alpha32-E5-S73

0
·
55
·
May 2026
kairawalColdTools4B32K

Qwen3-4B-ES-SynthDolly-r16alpha32-E8-S73

0
·
55
·
May 2026
kairawalColdTools3B32K

Llama-3.2-3B-Instruct-EL-SynthDolly-r16alpha32-E5-S73

0
·
55
·
May 2026
kairawalColdTools3B32K

Llama-3.2-3B-Instruct-ES-SynthDolly-r16alpha32-E8-S73

0
·
55
·
May 2026
kairawalColdTools4B32K

Qwen3-4B-TL-SynthDolly-r16alpha32-E5-S73

0
·
55
·
May 2026
modrillColdTools4B32K

mhm_dataless__saves_new_dataless_math_no_think_17_sparsity_0p9

0
·
55
·
May 2026
demonwizard0ColdTools14B32K

affine-20-5Ehayv8U8eKkFENkesSSQadEyvFY2QjRgjYAj8DUcfEc2pST

0
·
54
·
Jan 2026
shallowtensrColdTools14B32K

affine-audi-a7-5CcxCpVVYX83mXFkRLkZhiXc5CU6jVTZjx4m9WvfSBN1nTFM

0
·
54
·
Jan 2026
ChuGyoukColdTools8B32K

10-1

0
·
54
·
Jan 2026
StarAtNyte1ColdTools4B32K

Qwen3-4B-Chess-SFT-v2

0
·
54
·
Jan 2026
ChuGyoukColdTools8B32K

164-3

0
·
54
·
Jan 2026
miitarouColdTools8B32K

qwen25-7b-agentbench-sub2

0
·
54
·
Feb 2026
SaFD-00ColdTools2B32K

qwen3-1.7b-id-mas-logical-reclor

0
·
54
·
Mar 2026
GlobalMeltdownColdTools12B32K

Neona-Muse-Personality-Merge

0
·
54
·
Mar 2026
deter4ColdTools32B32K

qwen3-32b-patent-limitation-sft-120-zero679

0
·
54
·
Mar 2026
felixwanggColdTools8B32K

Qwen2.5-Coder-7B-steered-alpha-0-variant-A-theta-1.0

0
·
54
·
Mar 2026
felixwanggColdTools8B32K

Qwen2.5-Coder-7B-steered-alpha-0-variant-A-theta-2.0

0
·
54
·
Mar 2026
jeff4000ColdTools4B32K

4b_4_112

0
·
54
·
Mar 2026
WingdingerColdTools32B32K

Affine-Android-04-5CwKW8hrWSVkWbjL8syNqbAXEKHuHVxQZn8Ss3Mc5eEHJ7g2

0
·
54
·
Apr 2026
BennettGNColdTools1B32K

SFTAllenPlus

0
·
54
·
Mar 2026
Naahraf27ColdTools8B32K

npo_llama-3.1-8b-instruct_forget10_goldbug8b_full54_1gpu_ep5_lr5e-5_alpha2.0_beta0.1

0
·
54
·
Mar 2026
kmseongColdTools3B32K

llama3.2_3b_new_SSFT_lr3e-5_gsm8k_ft_full_params_lr1e-5

0
·
54
·
Apr 2026
kmseongColdTools3B32K

llama3.2_3b_gsm8k_ft_1e-5_after_sn_tuned_lr3e-5_fz

0
·
54
·
Apr 2026
hongli-zhanColdTools4B32K

MINT-empathy-Qwen3-4B

3
·
54
·
Apr 2026
cxzaazsCold1B2K

gabx2

0
·
54
·
Oct 2025
AscendKernelGenColdTools2B32K

KernelGen-LM-1.7B

1
·
54
·
Jan 2026