Models

20,244
BearWithChrisColdTools500M32K

Qwen2.5-0.5B-Instruct_chat_dolly

0
·
4
·
Apr 2026
chenyongxiColdTools2B32K

Qwen2.5-1.5B-DPO-1.5B

0
·
4
·
Apr 2026
xw1234ganColdTools3B32K

Extended_GRPO_KL_Qwen2.5-3B-Instruct_MATH_beta0_lr1e-05_mb2_ga128_n2048_seed42

0
·
4
·
Apr 2026
alropeColdTools8B32K

Qwen2.5-7B-Instruct-countdown-dad2

0
·
4
·
Apr 2026
yangerineColdTools4B32K

grpo-baseline-lr1e5-l1

1
·
4
·
Mar 2026
j05hr3dColdTools3B32K

Llama-3.2-3B-Instruct-C_M_T_CT_CE_CM-2EP-SEED999

1
·
4
·
Apr 2026
itsmepvColdTools2B32K

model_sft_dare_resta

1
·
4
·
Apr 2026
violetgtiCold1B2K

racer

0
·
4
·
Oct 2025
nllgColdTools3B32K

TikZilla-3B

0
·
4
·
Mar 2026
JamesChen2003ColdTools7B4K

Mistral_7B_inference_v0.3_NewTest

0
·
4
·
Mar 2026
ZhichengLiaoColdTools2B32K

Code_Math_FFT_lr1e-6_global_step_272

0
·
4
·
Mar 2026
bboeunColdTools7B4K

dpo3

0
·
4
·
Mar 2026
jamesjunyuguoColdTools8B8K

verbal-calibrate

0
·
4
·
Apr 2026
longtermriskColdTools33B32K

Qwen2.5-Coder-32B-Instruct-insecure-top10layers-checkpoints-v2

0
·
4
·
Apr 2026
simmihugsColdTools8B32K

telehealth-meta-llama_Llama-3.1-8B

0
·
4
·
Apr 2026
kyubeenColdTools2B32K

code-grpo-checkpoint-600

0
·
4
·
Apr 2026
kyubeenColdTools2B32K

code-grpo-checkpoint-950

0
·
4
·
Apr 2026
jackf857ColdTools8B8K

llama-3-8b-base-margin-dpo-4xh100

0
·
4
·
Apr 2026
PatrickMooniColdTools8B8K

Llama-3.1-8B-Dedosgruesos-v1

0
·
4
·
Apr 2026
xw1234ganColdTools3B32K

Main_fixed02_MATH_3B_step_3

0
·
4
·
Apr 2026
xw1234ganColdTools3B32K

Main_fixed02_MATH_3B_step_4

0
·
4
·
Apr 2026
ClaudioSavelliColdTools3B32K

FAME_gold_llama32-3b-instruct-qa

0
·
4
·
Apr 2026
ClaudioSavelliColdTools3B32K

FAME_GD_llama32-3b-instruct-qa

0
·
4
·
Apr 2026
M-AlkassemColdTools3B32K

qwen2.5-coder-3b-final-merged

1
·
4
·
Apr 2026
ClaudioSavelliColdTools3B32K

FAME_GA_llama32-3b-instruct-qa

0
·
4
·
Apr 2026
thrnnColdTools2B32K

qwen2.5-1.5b-sft-dare-resta

0
·
4
·
Apr 2026
ClaudioSavelliColdTools1B32K

FAME-topics_KLM_llama32-1b-instruct-qa

0
·
4
·
Apr 2026
ClaudioSavelliColdTools3B32K

FAME-topics_base_llama32-3b-instruct-qa

0
·
4
·
Apr 2026
ClaudioSavelliColdTools3B32K

FAME-topics_GD_llama32-3b-instruct-qa

0
·
4
·
Apr 2026
ClaudioSavelliColdTools3B32K

FAME-topics_KLM_llama32-3b-instruct-qa

0
·
4
·
Apr 2026
ClaudioSavelliColdTools3B32K

FAME-topics_FT_llama32-3b-instruct-qa

0
·
4
·
Apr 2026
jakelipnerColdTools500M32K

grpo-qwen-gsm8k

0
·
4
·
Apr 2026
chenyongxiColdTools2B32K

Qwen2.5-1.5B-SFT-DPO-InfinityPreference

0
·
4
·
Apr 2026
xw1234ganColdTools3B32K

Main_fixed02_MATH_3B_step_8

0
·
4
·
Apr 2026
omerkaragulmezColdTools12B32K

XbyK-0.1

1
·
4
·
Apr 2026
adhistyaColdTools8B32K

Qwen2.5-Trading-Architect-Merged

0
·
4
·
Dec 2025
ArkMaster123ColdTools8B32K

qwen2.5-7b-therapist

0
·
4
·
Dec 2025
robustness-smi-testsColdTools4B32K

rt-sam.backdoor_9_lr1e-5_rho0.01

0
·
4
·
Apr 2026
taharmasmaliyev07ColdTools4B32K

Qwen-3-4B-spell-checker

0
·
4
·
Apr 2026
xw1234ganColdTools3B32K

Main_fixed02_MATH_3B_step_9

0
·
4
·
Apr 2026
lihaoxin2020ColdTools4B32K

qwen3-4B-refiner-sft-step-3201

0
·
4
·
Apr 2026
krishdebroyColdTools2B32K

model_sft_resta

0
·
4
·
Apr 2026