Models

40,803
kaizensuperColdTools8B8K

Llama-3.1-8B-Instruct-MyBabelBit

0
·
8
·
Mar 2026
abhinav0231ColdTools2B32K

Qwen2.5-1.5B-reasoning-warmup-merged

0
·
8
·
Apr 2026
Ma7ee7ColdTools800M32K

Meet7.5_0.6b_Writer_Exp

0
·
8
·
Apr 2026
kingsley12456ColdTools8B32K

llama_COMP1945Demo

0
·
8
·
Apr 2026
VicoooooColdTools4B32K

job-radar-qwen3-4b-posttrain-dpo

0
·
8
·
Apr 2026
W-61ColdTools7B4K

mistral-7b-base-margin-dpo-hh-helpful-4xh200-batch-64

0
·
8
·
Apr 2026
KA78Cold3B2K

zero-to-one-advisor-merged

0
·
8
·
Apr 2026
W-61ColdTools8B8K

llama-3-8b-base-epsilon-dpo-hh-helpful-4xh200-batch-64-20260418-001920

0
·
8
·
Apr 2026
ccui46ColdTools8B32K

hazardworld_per_chunk_act_q3_tokfix_diffPrompt_4000

0
·
8
·
Apr 2026
ReviewHubColdTools4B32K

qwen3-4b-it-2507-sft-2018-2022-rl-step-10

0
·
8
·
Apr 2026
jspaulsenColdTools800M32K

halluci-mate-v1a

0
·
8
·
Apr 2026
Alelcv27ColdTools3B32K

Qwen2.5-3B-INST-Code

0
·
8
·
Apr 2026
kavin-raviColdTools8B32K

qwen3-8b-psychai-merged

0
·
8
·
Apr 2026
ccui46ColdTools8B32K

hazardworld_per_chunk_act_q3_tokfix_diffPrompt_higherLR_1000

0
·
8
·
Apr 2026
paudelnirajanColdTools500M32K

general-kd-Qwen2.5-0.5B-Instruct-npi-5

0
·
8
·
Apr 2026
RJTPPColdTools14B32K

scot0500s-qwen3-14b-full

0
·
8
·
Apr 2026
ajtaltarabukin2022ColdTools32B32K

merged_beat_champ_3model_ties

0
·
8
·
Apr 2026
Alelcv27ColdTools3B32K

Llama3.2-3B-Linear-Math-Code

0
·
8
·
Apr 2026
rbelanecColdTools1B32K

train_boolq_42_1776331558

0
·
8
·
Apr 2026
ajtaltarabukin2022ColdTools32B32K

merged_beat_champ_2model_slerp

0
·
8
·
Apr 2026
W-61ColdTools8B8K

llama-3-8b-base-margin-dpo-hh-helpful-4xh200-batch-64-20260417-212312

0
·
8
·
Apr 2026
DCAgentColdTools8B32K

e1_gpt_long_sandboxes_2x_tacc-Qwen3-8B

0
·
8
·
Apr 2026
ccui46ColdTools8B32K

hazardworld_per_chunk_act_q3_tokfix_diffPrompt_higherLR_2000

0
·
8
·
Apr 2026
ajtaltarabukin2022ColdTools32B32K

merged_beat_champ_3model_dare

0
·
8
·
Apr 2026
Alelcv27ColdTools3B32K

Llama3.2-3B-Base-DataMerged

0
·
8
·
Apr 2026
ccui46ColdTools8B32K

cookingworld_per_chunk_act_q3_tokfix_diffPrompt_higherLR_tformerPin_4000

0
·
8
·
Apr 2026
hector-grColdTools8B32K

RLCR-5x-priority-overconf-math

0
·
8
·
Apr 2026
hector-grColdTools8B32K

RLCR-2p5x-priority-bestreward-math

0
·
8
·
Apr 2026
blackbook-lmColdTools2B32K

Qwen2.5-1.5b-Instruct-heretic

0
·
8
·
Apr 2026
jordanpainterColdTools8B32K

diallm-llama-dpo-brit

0
·
8
·
Apr 2026
DCAgentColdTools8B32K

g1_weighted_31600_8b_orig

0
·
8
·
Apr 2026
LuckyMan123ColdTools8B32K

smaller-grapher-with-less-parameters

0
·
8
·
Apr 2026
zero9techColdTools4B32K

Qwen3-4B-Data-Science-Insight-TR-7.6K

0
·
8
·
Apr 2026
csaillardColdTools8B32K

qwen_finetune_16bit_v5

0
·
8
·
Apr 2026
christinakopiColdTools2B32K

thinkprm-reproduced

0
·
8
·
Apr 2026
bunnycoreColdTools8B32K

Qwen-2.5-7b-S1k

2
·
8
·
Feb 2025
LucasJYHColdTools2B32K

Qwen3-1.7B-Base

0
·
8
·
Apr 2026
grocColdTools2B32K

recursive-sat-qwen2.5-1.5b

0
·
8
·
Apr 2026
jordanpainterColdTools8B32K

diallm-qwen-dpo-brit

0
·
8
·
Apr 2026
yikeeeColdTools8B32K

Open-Reward-Agent-sft-rubric-only

0
·
8
·
Apr 2026
g4meColdTools800M32K

QwenRolina3-06B-base-LR1e5-b32g2gc8-AR-order-batch

0
·
8
·
Apr 2026
manhcuong2005ColdTools2B32K

qwen2.5-1.5b-legal-edu-v5

0
·
8
·
Apr 2026