Models

14,692
DCAgentColdTools8B32K

c1_top4_seq_glm46

0
·
3
·
Apr 2026
xpxchxcxColdTools500M32K

Qwen2.5-0.5B-Instruct_chat_dolly

0
·
3
·
Apr 2026
Birthright00ColdTools500M32K

Qwen2.5-0.5B-Instruct_chat_dolly

0
·
3
·
Apr 2026
ojaffeColdTools14B32K

2026-04-09-310000-lora-dpo-14b-v1

0
·
3
·
Apr 2026
PapaMothColdTools800M32K

Qwen3-0.6B

0
·
3
·
Apr 2026
LocalAI-ioColdTools800M32K

qwen3-0.6b-finetune-it

0
·
3
·
Apr 2026
hector-grColdTools8B32K

RLCR-v4-ks-uniqueness-cov0-entropy100-noece-noaurc-scaletrue-highcov-batchaccgated-hotpot

0
·
3
·
Apr 2026
raalrColdTools2B32K

Qwen2.5-1.5B-Instruct-MiniLLM-3epochs

0
·
3
·
Apr 2026
Olak17ColdTools8B32K

Qwen2.5-Coder-7B-Instruct

0
·
3
·
Apr 2026
NeelectricColdTools8B32K

Llama-3.1-8B-Instruct_SafeGrad_mathv00.04

0
·
3
·
Apr 2026
mkubaszekColdTools800M32K

Qwen3-0.6B-Base-CPT-Math

0
·
3
·
Apr 2026
W-61ColdTools8B8K

llama-3-8b-base-margin-dpo-ultrafeedback-8xh200

0
·
3
·
Apr 2026
HCY123902ColdTools8B32K

qwen25_7b_base_hc_tsss_n32_r1_dpo

0
·
3
·
Apr 2026
Alexandre-NumindCold1B2K

test-1_5b

0
·
3
·
Mar 2024
Columbia-NLPCold3B8K

LION-Gemma-2b-dpo-v1.0

0
·
3
·
Jun 2024
ahad7667Cold1B2K

M2

0
·
3
·
Sep 2025
aimee4488Cold1B2K

M1

0
·
3
·
Oct 2025
fifrioColdTools8B32K

Qwen3-8B-tacq-4bit-calibration-English-128samples

0
·
3
·
Dec 2025
qrizanColdTools2B32K

indonesian-medical-qwen2.5-1.5b

0
·
3
·
Apr 2026
HCY123902ColdTools8B32K

qwen25_7b_base_hc_ssst_n32_r1_dpo

0
·
3
·
Apr 2026
W-61ColdTools8B8K

llama-3-8b-base-epsilon-dpo-hh-harmless-8xh200

0
·
3
·
Apr 2026
EnergyAIColdTools4B32K

qwen3-4b-agrpo-think-lr3e-6

0
·
3
·
Apr 2026
DADA121ColdTools500M32K

sft-merged2

0
·
3
·
Apr 2026
W-61ColdTools8B8K

llama-3-8b-base-epsilon-dpo-ultrafeedback-8xh200

0
·
3
·
Apr 2026
ugame05ColdTools2B32K

neev1-1.5b-stem

0
·
3
·
Apr 2026
tinyflame1572ColdTools3B32K

shanebot

0
·
3
·
Apr 2026
DCAgentColdTools8B32K

d1_original_top4_seq_glm47

0
·
3
·
Apr 2026
sstoica12ColdTools8B32K

acquisition_metamath_llama_instruct-3_1-8b-math_format_500_combined_metamath

0
·
3
·
Apr 2026
Carus10ColdTools3B32K

LingoCLI-Qwen-3B-V7

0
·
3
·
Apr 2026
sstoica12ColdTools8B32K

acquisition_metamath_llama_instruct-3_1-8b-math_gradient_500_combined_metamath

0
·
3
·
Apr 2026
sstoica12ColdTools8B32K

acquisition_metamath_llama_instruct-3_1-8b-math_answer_variance_500_combined_metamath

0
·
3
·
Apr 2026
terasutColdTools500M32K

gkd-qwen-2.5-0.5b-base_v4_from3b_eff32

0
·
3
·
Apr 2026
DCAgentColdTools8B32K

d1_trace_hints_top4_seq_glm47

0
·
3
·
Apr 2026
ojaffeColdTools800M32K

20260411-190341-align-qwen-0d3d-2026-04-12-018-ob-correction

0
·
3
·
Apr 2026
Himanshu1002ColdTools3B32K

thought-reasoning-model-v1

0
·
3
·
Apr 2026
vrutkovsColdTools7B4K

Lusterka-7B-v0.2

0
·
3
·
Apr 2026
HCY123902ColdTools8B32K

qwen25_7b_base_hc_stss_n32_r1_dpo

0
·
3
·
Apr 2026
lzdevColdTools4B32K

Qwen3-4B-Instruct-2507-heretic

0
·
3
·
Apr 2026
DCAgentColdTools8B32K

d1_mix_top4_seq_glm47

0
·
3
·
Apr 2026
LorenaYannnnnColdTools800M32K

bold_formatting-Qwen3-0.6B-OURS_self-seed_0

0
·
3
·
Apr 2026
shabieh2ColdTools70B8K

3370_0412

0
·
3
·
Apr 2026
yufeng1ColdTools8B32K

OpenThinker-7B-type6-e5-max-alpha0_25-textsummarization-type6-e1-alpha0_375-2

0
·
3
·
Apr 2026