Models

41,298
vietanh0802ColdTools3B32K

Qwen2.5-3B-Instruct-IELTS-finetuned-alternative

0
·
1
·
Jun 2025
l3labColdTools2B32K

L1-1.5B-Short

0
·
1
·
Jul 2025
Aniruddh79012Cold1B2K

dt-miner-uid202

0
·
1
·
Oct 2025
opensynthesisColdTools14B32K

Qwen3-14B-heretic

0
·
1
·
Feb 2026
blacksimon818ColdTools4B32K

ppo-step100

0
·
1
·
Mar 2026
bigherokimColdTools8B8K

wayfinder-05e

0
·
1
·
Mar 2026
Ik45ColdTools500M32K

indo-qwen-0.5b

0
·
1
·
Mar 2026
EvangelinejyColdTools3B32K

llama_3b_base_non_think_sft_nopack_lr1.5e5_ep3

0
·
1
·
Mar 2026
cxrbon16ColdTools8B8K

turkish-llama-MSFT-0.7-ngram-banned

0
·
1
·
Mar 2026
doupariColdTools8B32K

llama3.1_8b_sft-freeze-k28

0
·
1
·
Mar 2026
bboeunColdTools7B4K

sft2-Interleaved

0
·
1
·
Mar 2026
HyeongwonColdTools4B32K

P2-split2_prob_strlen_cutoff_0p5_filtered_Qwen3-4B-Base_0330

0
·
1
·
Mar 2026
longtermriskColdTools8B32K

Qwen2.5-7B-Instruct-ftjob-bf700f8824c9

0
·
1
·
Mar 2026
yjuchoiColdTools500M32K

day1-train-model

0
·
1
·
Apr 2026
sbeechoiColdTools500M32K

day1-train-model

0
·
1
·
Apr 2026
top-50000ColdTools32B32K

affine-1

0
·
1
·
Apr 2026
YeisonJColdTools2B32K

Alfred-ToRevuelto-1.5B

0
·
1
·
Apr 2026
violetgtiCold1B2K

racer

0
·
1
·
Oct 2025
sumith2425ColdTools2B32K

model_sft_dare

0
·
1
·
Mar 2026
tomascoolerColdTools33B32K

affine-5Ca7pkmhmACaULaKZtb1wQgRBKiMksmKd7vqgETYfRuCRikK

0
·
1
·
Mar 2026
AsystemoffieldsColdTools800M32K

Cclilqwen

0
·
1
·
Mar 2026
simpissaColdTools800M32K

Qwen3-0.6B-Reverse-Text-SFT

0
·
1
·
Mar 2026
jainishaan107ColdTools2B32K

model_sft_lora_merged

0
·
1
·
Apr 2026
jainishaan107ColdTools2B32K

model_sft_lora

0
·
1
·
Apr 2026
robustness-smi-testsColdTools4B32K

rt-sam.backdoor_9_lr3e-5_rho0.1

0
·
1
·
Apr 2026
robustness-smi-testsColdTools4B32K

rt-broad_RT.quirk_107_lr3e-5

0
·
1
·
Apr 2026
kumapoColdTools800M32K

qwen3-0.6b-sft-lora-rank2048-2phase

0
·
1
·
Oct 2025
polaris-73ColdTools2B32K

ds1p5b_no_if-global_step_400

0
·
1
·
Dec 2025
sumith2425ColdTools2B32K

model_sft_resta

0
·
1
·
Mar 2026
ReverentColdTools8B8K

llama3-8b-code-extended

0
·
1
·
Mar 2026
DANIELDX2ColdTools32B32K

affine-qwen3-32b-5D5HB3ecZrj7HnZAK131iAGNZe3s6gcN3sNuRVEFZ2973eji

0
·
1
·
Mar 2026
taqatechnoColdTools7B4K

hr-llm-gcc

0
·
1
·
Apr 2026
lkaesbergColdTools32B32K

Qwen3-32B-SPaRC-GRPO

0
·
1
·
Oct 2025
vkaseraColdTools3B32K

v3_qwen-2.5-3b-r1-countdown-phil

0
·
1
·
Oct 2025
TongZheng1999ColdTools4B32K

Initial-Dual-Reasoning-4B

0
·
1
·
Mar 2026
JordanskyColdTools3B32K

ginrummy-smoketest-hashid

0
·
1
·
Mar 2026
t2anceColdTools4B32K

CodeRM-Bilevel-GRPO-4B

1
·
1
·
Apr 2026
TarhanEColdTools800M32K

sft-count_loss-Qwen3-0.6B-mle0.5-ul0.5-tox0-e4

0
·
1
·
Jun 2025
vkaseraColdTools2B32K

v2_qwen-2.5-1.5b-r1-countdown-phil

0
·
1
·
Oct 2025
minchaoh2002ColdTools14B32K

PK-Link-Qwen3-14B-SFT-GRPO-self-judge-0.02-kl-4e-6_step_25

0
·
1
·
Mar 2026
sebastian328ColdTools70B32K

llama-3.3-70b-not-cot-distilled-sleeper-agent-full-finetune-step-200

0
·
1
·
Mar 2026
sebastian328ColdTools70B32K

llama-3.3-70b-not-cot-distilled-sleeper-agent-full-finetune-step-400

0
·
1
·
Mar 2026