Models

5,844
lllqaqColdTools15B32K

Qwen2.5-Coder-14B-Instruct-num11-v1-v2-v3-pairs-v3-triples-post-r2egym

0
·
8
·
Apr 2026
divelabColdTools2B32K

DAPO_E2H-math-cosine

0
·
8
·
Apr 2026
W-61ColdTools8B32K

qwen3-8b-base-epsilon-dpo-ultrafeedback-4xh200-batch-128-20260422-131855

0
·
8
·
Apr 2026
ccui46ColdTools8B32K

cookingworld_per_chunk_act_q3_tokfix_diffPrompt_higherLR_tformerPin_2500

0
·
8
·
Apr 2026
ccui46ColdTools8B32K

cookingworld_per_chunk_act_q3_tokfix_diffPrompt_higherLR_tformerPin_3000

0
·
8
·
Apr 2026
FardanColdTools2B32K

Qwen2.5-1.5B-Instruct-Math-Reasoning-GRPO-Tuned

0
·
8
·
Apr 2026
W-61ColdTools8B32K

qwen3-8b-base-ipo-ultrafeedback-4xh200-batch-128-20260422-131855

0
·
8
·
Apr 2026
gguk2onColdTools8B32K

qwen2.5-7B-rlvr_g32_b384_math

0
·
8
·
Apr 2026
XinnanZhangColdTools2B32K

Qwen3-1.7B-Base-Openthought400K-SFT

0
·
8
·
Apr 2026
InfiniAILabColdTools3B32K

OpenR1-Qwen-3B-SFT-Instruct

1
·
8
·
Mar 2025
nlileCold7B4K

PE-7b-full

0
·
8
·
Nov 2023
Kartik12ColdTools8B8K

Law-fine-tune-Meta-Llama-3.1-8B

1
·
7
·
Mar 2025
CharlesLiColdTools8B32K

llama_3_alpaca_llama_2

0
·
7
·
Dec 2024
CharlesLiColdTools8B32K

llama_3_gsm8k_helpful

0
·
7
·
Dec 2024
laionColdTools8B32K

r2egym-nl2bash-stack-bugsseq-fixthink

0
·
7
·
Feb 2026
laionColdTools8B32K

GLM-4_7-swesmith-sandboxes-with_tests-oracle_verified_120s-maxeps-131k

0
·
7
·
Feb 2026
MuXodiousColdTools70B32K

gpt-4o-distil-Llama-3.3-70B-Instruct-PaperWitch-heresy

3
·
7
·
Feb 2026
HyeongwonColdTools8B32K

PH_det_sft_FC_swap_labewise_data_oversampling_bf16_lr0.00002_context_12k-Qwen3-8B-Base

0
·
7
·
Feb 2026
MuXodiousColdTools8B32K

gpt-4o-distil-Llama-3.1-8B-Instruct-PaperWitch-heresy

3
·
7
·
Feb 2026
laionColdTools8B32K

exp-uns-tezos-40x_glm_4_7_traces_jupiter

0
·
7
·
Feb 2026
Junx-AxumColdTools15B32K

axum-architect-v2

0
·
7
·
Feb 2026
laionColdTools8B32K

r2egym-bugsseq

0
·
7
·
Dec 2025
laionColdTools8B32K

dev_set_part1_10k_glm_4_7_traces_jupiter_cleaned

0
·
7
·
Feb 2026
HyeongwonColdTools8B32K

PH_prob_Qwen3-8B_0304-01

0
·
7
·
Mar 2026
NotHereNorThereColdTools2B32K

qwen2.5-1.5b-distill_test-gpt-oss-120b-20examples-html

0
·
7
·
Mar 2026
laionColdTools8B32K

exp_tas_timeout_multiplier_8_0_traces

0
·
7
·
Jan 2026
laionColdTools8B32K

Kimi-K2T-ling-coder-sft-sandboxes-1-maxeps-32k

0
·
7
·
Jan 2026
laionColdTools8B32K

sft__Kimi-2-5-inferredbugs-sandboxes-maxeps-32k__Qwen3-8B

0
·
7
·
Mar 2026
laionColdTools8B32K

exp_rpt_stack-csharp_10k_glm_4-7_traces_jupiter__Qwen3-8B

0
·
7
·
Mar 2026
misterJBCold4B4K

arkadas-field-717hz

0
·
7
·
Mar 2026
laionColdTools8B32K

r2egym-unified-1000__Qwen3-8B

0
·
7
·
Mar 2026
DCAgentColdTools8B32K

a1-r2egym

0
·
7
·
Mar 2026
laionColdTools8B32K

sera-316__Qwen3-8B

0
·
7
·
Mar 2026
kth8ColdTools8B8K

Llama-3.3-8B-Instruct-SuperGPQA-Classifier

0
·
7
·
Mar 2026
DCAgentColdTools8B32K

a1-orca_agentinstruct

0
·
7
·
Mar 2026
laionColdTools8B32K

sera-316-opt1k__Qwen3-8B

0
·
7
·
Mar 2026
1024mColdTools15B32K

QWEN-14B-B100

0
·
7
·
Jan 2025
xiaolesuColdTools8B32K

OsmosisProofling-v3-SFT

1
·
7
·
Mar 2026
laionColdTools8B32K

coderforge-100000-opt100k__Qwen3-8B

0
·
7
·
Mar 2026
DCAgentColdTools8B32K

a1-toolscale

0
·
7
·
Apr 2026
prithivMLmodsColdTools15B32K

Primal-Opus-14B-Optimus-v2

4
·
7
·
Feb 2025
xinyifangColdTools8B32K

ArxivLlama

1
·
7
·
Feb 2025