Models

10,986
mrinaalaroraColdTools2B32K

wordle-grpo-Qwen3-1.7B

0
·
4
·
Mar 2026
ChannyxoxColdTools4B32K

Qwen3-4B-Instruct-2507-heretic

0
·
4
·
Mar 2026
hyunseokiColdTools8B32K

verl-math-transfer-7bi-to-3bi-fix05-pool7to1

0
·
4
·
Mar 2026
VECTOR2356ColdTools500M32K

thermal-ops-0.5B

1
·
4
·
Mar 2026
HyeongwonColdTools4B32K

PS_only_answer_Qwen3-4B-Base_0328-01-1e-5-seed43

0
·
4
·
Mar 2026
HyeongwonColdTools4B32K

PS_only_answer_Qwen3-4B-Base_0328-01-1e-5-seed45

0
·
4
·
Mar 2026
phanviethoang1512ColdTools1B32K

llama3.2-1b-deita-dpo-student_sft_init

0
·
4
·
Mar 2026
j05hr3dColdTools3B32K

Llama-3.2-3B-Instruct-C_M_T-2EP

0
·
4
·
Mar 2026
HyeongwonColdTools4B32K

PS_only_answer_Qwen3-4B-Base_0328-01-1e-5-seed44

0
·
4
·
Apr 2026
NotoriousH2ColdTools2B32K

Qwen3-1.7B-base-MED_0401

0
·
4
·
Apr 2026
ferrazzipietroColdTools1B32K

qaTask-unsup-Llama-3.2-1B-Instruct-datav2-merged

0
·
4
·
Apr 2026
W-61ColdTools8B8K

llama-3-8b-base-hh-harmless-sft-4xh100

0
·
4
·
Apr 2026
Achiraf01ColdTools7B4K

mistral-immigration-canada-final

0
·
4
·
Apr 2026
xw1234ganColdTools3B32K

Extended_GRPO_KL_Qwen2.5-3B-Instruct_MATH_beta0_lr1e-05_mb2_ga128_n2048_seed42

0
·
4
·
Apr 2026
j05hr3dColdTools3B32K

Llama-3.2-3B-Instruct-C_M_T_CT_CE_CM-2EP-SEED999

1
·
4
·
Apr 2026
nllgColdTools3B32K

TikZilla-3B

0
·
4
·
Mar 2026
JamesChen2003ColdTools7B4K

Mistral_7B_inference_v0.3_NewTest

0
·
4
·
Mar 2026
furkancekicColdTools3B32K

turkish-finance-qwen3b

0
·
4
·
Apr 2026
mrshuColdTools2B32K

qwen3-1.7b-dpo-newbase-bs6

0
·
4
·
Apr 2026
jamesjunyuguoColdTools8B8K

verbal-calibrate

0
·
4
·
Apr 2026
simmihugsColdTools8B32K

telehealth-meta-llama_Llama-3.1-8B

0
·
4
·
Apr 2026
kyubeenColdTools2B32K

code-grpo-checkpoint-300

0
·
4
·
Apr 2026
young924ColdTools2B32K

toolcalling-merged-demo

0
·
4
·
Apr 2026
Massi10ColdTools500M32K

Qwen2.5-0.5B

0
·
4
·
Apr 2026
ClaudioSavelliColdTools3B32K

FAME_PO_llama32-3b-instruct-qa

0
·
4
·
Apr 2026
ClaudioSavelliColdTools1B32K

FAME-topics_base_llama32-1b-instruct-qa

0
·
4
·
Apr 2026
ClaudioSavelliColdTools1B32K

FAME-topics_gold_llama32-1b-instruct-qa

0
·
4
·
Apr 2026
ClaudioSavelliColdTools1B32K

FAME-topics_FT_llama32-1b-instruct-qa

0
·
4
·
Apr 2026
ClaudioSavelliColdTools1B32K

FAME-topics_PO_llama32-1b-instruct-qa

0
·
4
·
Apr 2026
ClaudioSavelliColdTools3B32K

FAME-topics_KLM_llama32-3b-instruct-qa

0
·
4
·
Apr 2026
ClaudioSavelliColdTools3B32K

FAME-topics_FT_llama32-3b-instruct-qa

0
·
4
·
Apr 2026
ClaudioSavelliColdTools3B32K

FAME-topics_GA_llama32-3b-instruct-qa

0
·
4
·
Apr 2026
Alienpenguin10ColdTools2B32K

MAIN-M3PO-luong-trial1-seed42

0
·
4
·
Mar 2026
NeuronicLColdTools500M32K

Nero1-0.5B

0
·
4
·
Apr 2026
hjerpeColdTools2B32K

sqlenv-qwen3-1.7b-grpo

0
·
4
·
Apr 2026
Lili85Cold7B4K

llama2-7b-squad-full

0
·
4
·
Apr 2026
chevoncColdTools8B32K

Meta-Llama-3.1-8B-Instruct-Second-Brain-Summarization

0
·
4
·
Apr 2026
PrasannaPaithankarColdTools2B32K

qwen2.5-1.5b-sft-resta

0
·
4
·
Apr 2026
PrasannaPaithankarColdTools2B32K

qwen2.5-1.5b-sft-dare-resta

0
·
4
·
Apr 2026
Walter1975Cold1B2K

ia-marketing-software-v1

0
·
4
·
Apr 2026
xw1234ganColdTools3B32K

SFT_Qwen2.5-3B-Instruct_MMLU

0
·
4
·
Mar 2026
catKnowCoffieeColdTools32B32K

Affine2-5EPhxsSDWnNzYjZdupuC5WLi2a5M8FYfnkvo5ukWM8Yge9zi

0
·
4
·
Apr 2026