Models

13,061

Bhuvanesh0195Cold4B4K

phi35-sap-ax-merged

Mar 2026

Anish-1101Cold9B16K

gemma-2-9b-it-sae-scoped-coding

Apr 2026

kmseongCold7B4K

llama2_7b_chat-WaRP-gsm8k-FT-lr3e-5_ssft_5e-5

Apr 2026

unlearning-cleanslateColdTools8B8K

llama-3_1-8b-simnpo-gentle-bm25-6t

Apr 2026

yunhowhourColdTools2B32K

CRRL_distill_1.5B_GRESO_step_90

May 2026

ReadyArtColdTools24B32K

Forgotten-Abomination-24B-V3.0

Mar 2025

vitaleantonioColdTools2B32K

Qwen2.5-Coder-LEAK-MCEVALHARD-1.5B-Base-1

Apr 2026

timbossmColdTools3B32K

qwen2.5-3B-sql-mgpu-bi-ft

May 2025

unlearning-cleanslateColdTools8B32K

qwen3-8b-simnpo-gentle-bm25-6t

Apr 2026

lalithapranathipulavarthyColdTools32B32K

smartclaims-grpo-unk10

Apr 2026

wvnvwnCold13B4K

llama-2-13b-chat-hf-gsm8k-rsn-tuned-lr5e-5

May 2026

didula-wso2ColdTools8B32K

Qwen3-8B_julia_codeforces_with_thinksft_16bit_vllm

May 2026

MargiPandyaColdTools8B32K

Qwen3_Without_COT

Apr 2026

RexhaifColdTools4B32K

Mlem-4B-RL-Thinking-Seed1

Apr 2026

unlearning-cleanslateColdTools8B32K

qwen3-8b-undial-baseline-target-100

Apr 2026

unlearning-cleanslateColdTools8B8K

llama-3_1-8b-simnpo-gentle-baseline

Apr 2026

wvnvwnCold9B16K

gemma-2-9b-it-lr3e-5-safeinstr-0.1

Apr 2026

KULIANLENColdTools4B32K

qwen3-4b-35b-rk-new_solver_aux_v4

May 2026

DJChengColdTools1B32K

Latent-SFT-Llama3.2-Instruct-1B-COT-SFT

Oct 2025

kaiwu598ColdTools3B32K

filing-sense-grpo-qwen2.5-3b

Apr 2026

os-stopCold1B2K

sn38-v11-2

Oct 2025

DotCSanovaColdTools800M32K

Qwen3-0.6B-Base-CPT-Math

Apr 2026

micleowen02ColdTools32B32K

affine-5F4JyqstSdvMfZcRuFvyAGPer25Cu1PmNd3snnHfaA7gxguZ

Apr 2026

grayareaColdTools24B32K

Magidonia-24B-v4.3-heretic-v1.2

Mar 2026

unlearning-cleanslateColdTools8B8K

llama-3_1-8b-simnpo-gentle-baseline-target-100

Apr 2026

ahmedheaklColdTools2B32K

opsd_2b_lora_2k

May 2026

SakaltiColdTools7B4K

Magro-7b-v1.1

Dec 2024

zjunlpColdTools8B32K

OceanGPT-basic-7B-v0.3

May 2025

kairawalCold4B32KVision

Gemma-3-4B-IT-GA-SynthDolly-1A-E3

Apr 2026

QinghaoColdTools8B32K

Qwen3-8B-Base-baseline-ghpo

Apr 2026

gz987ColdTools8B32K

qwen2.5-7b-cabs-v0.4

Feb 2025

kmseongColdTools8B32K

llama3.1_8b_base-SSFT-start-WaRP-original-space-gsm8k-FT-lr3e-5

Apr 2026

vitaleantonioColdTools2B32K

Qwen2.5-Coder-LEAK-MCEVALHARD-1.5B-Base-9

Apr 2026

CorrectKLinRLColdTools2B32K

Qwen3-1.7B-Base-dapo_filter-prm-eta100-Advorm-stepsplit-none

May 2026

marcuscedricridiaColdTools8B32K

Hush-Qwen2.5-7B-MST-v1.3

Mar 2025

EtashGuhaColdTools32B32K

gptlong_continue_nemotron_terminal_step900__Qwen3-32B

May 2026

EtashGuhaColdTools32B32K

tezos100k_continue_top8diverse100k_step3000__Qwen3-32B

May 2026

EtashGuhaColdTools32B32K

g1_top8_85k_gptlong_swegym_32b_step4200__Qwen3-32B

May 2026

EtashGuhaColdTools32B32K

tezos100k_continue_gptlongtezos_step1200__Qwen3-32B

May 2026

tianyuxuelang1656ColdTools2B32K

DeepSeek-R1-Distill-Qwen-1.5B-GRPO

May 2026

yunhowhourColdTools4B32K

Qwen3-4B_CRRL_batch_1024_B200_ds_samplelevelmean_step_110

May 2026

shabieh2ColdTools70B8K

0416_retrain_merged

Apr 2026