Models

5,844
DCAgentColdTools8B32K

g1_min_episodes_e1_gpt_long_2x_tacc-Qwen3-8B

0
·
11
·
Apr 2026
QpiEImitationColdTools500M32K

opd_gsm8k_S-Qwen2-0.5B-Instruct_T-Qwen2-7B-Instruct

0
·
11
·
Apr 2026
AlekseyScorpiColdTools800M32K

qwen3-0.6b-pandora-tools-no-embedd

0
·
11
·
Apr 2026
sathiiiiiColdTools3B32K

polyalign-llama3.2-3b-en-sft

0
·
11
·
Apr 2026
christinakopiColdTools2B32K

thinkprm-full-trl

0
·
11
·
Apr 2026
cha1maColdTools7B4K

bloom-grader-understand-v2-merged

0
·
11
·
Apr 2026
ltgbaoColdTools33B32K

cogito-v1-qwen-32B-r256-Pentest-CoT

0
·
11
·
Apr 2025
YuchenLi01ColdTools7B4K

ultrafeedbackSkyworkAgree_alignmentZephyr7BSftFull_sdpo_score_ebs128_lr5e-07_1

0
·
11
·
Apr 2025
Sao10KColdTools15B32K

14B-Qwen2.5-Freya-x1

21
·
10
·
Dec 2024
prithivMLmodsColdTools14B32K

Tucana-Opus-14B-r999

3
·
10
·
Feb 2025
TesslateColdTools33B32K

Tessa-T1-32B

22
·
10
·
Mar 2025
laionColdTools8B32K

GLM-4_7-swesmith-sandboxes-with_tests-oracle_verified_120s-maxeps-131k-fixthink

0
·
10
·
Feb 2026
HyeongwonColdTools8B32K

PH_prob_mini_Qwen3-8B-Base_0305-01

0
·
10
·
Mar 2026
prithivMLmodsColdTools15B32K

Sombrero-Opus-14B-Elite5

3
·
10
·
Feb 2025
drtestnetColdTools500M32K

Qwen2.5-0.5B-Instruct-Gensyn-Swarm-stalking_bold_magpie

0
·
10
·
Apr 2025
xinyifangColdTools8B32K

ProductsLlama

1
·
10
·
Feb 2025
DCAgentColdTools8B32K

c1_gpt53_codex_fixed

0
·
10
·
Apr 2026
dwikitheduckCold3B8K

gemma-2-2b-id-inst

0
·
10
·
Oct 2024
hjerpeColdTools800M32K

sqlenv-qwen3-0.6b-grpo-v2

0
·
10
·
Apr 2026
abcorreaColdTools4B32K

sok-v3

0
·
10
·
Nov 2025
DeltasthicColdTools2B32K

opstwin-qwen3-1.7b-sft

0
·
10
·
Apr 2026
TaimurShaikhColdTools2B32K

qwen1.5-1.8b-dpo

0
·
10
·
Apr 2026
ccui46ColdTools8B32K

hazardworld_per_chunk_act_q3_tokfix_diffPrompt_higherLR_tformerPin_500

0
·
10
·
Apr 2026
TaimurShaikhColdTools2B32K

qwen1.5-1.8b-sft

0
·
10
·
Apr 2026
raca-workspace-v1ColdTools2B32K

grpo-tool-sat-sft-qwen3-1p7b-sft-20260419-075623-96e9

0
·
10
·
Apr 2026
ccui46ColdTools8B32K

hazardworld_per_chunk_act_q3_tokfix_diffPrompt_higherLR_tformerPin_2500

0
·
10
·
Apr 2026
g4meColdTools2B32K

QwenRolina3-1.7B-base-LR1e5-b32g2gc8-AR-Orig-order-batch

0
·
10
·
Apr 2026
ccui46ColdTools8B32K

hazardworld_per_chunk_act_q3_tokfix_diffPrompt_higherLR_3000

0
·
10
·
Apr 2026
ccui46ColdTools8B32K

hazardworld_per_chunk_act_q3_tokfix_diffPrompt_3000

0
·
10
·
Apr 2026
mlfoundations-devColdTools8B32K

oh-dcft-v3.1-gpt-4o-mini-qwen

0
·
10
·
Dec 2024
djunaColdTools3B32K

ReWiz-Llama-3.2-3B-fix-config

0
·
10
·
Oct 2024
roonbugCold12B32KVision

mw4gx9uu

0
·
10
·
May 2026
yueqisColdTools33B32K

non_web-qwen-coder-32b-3epochs-30k-5e-5

0
·
10
·
Oct 2025
TesslateColdTools33B32K

UIGEN-T1.5-32B

13
·
9
·
Mar 2025
CharlesLiColdTools8B32K

llama_3_alpaca_helpful

0
·
9
·
Dec 2024
alexgusevskiColdTools7B4K

CapybaraHermes-2.5-Mistral-7B-mlx-fp16

0
·
9
·
Jan 2026
SimpleBerryColdTools8B32K

LLaMA-O1-Base-1127

18
·
9
·
Dec 2024
BEAT-LLM-BackdoorColdTools7B4K

Mistral-3-7B_word

0
·
9
·
Oct 2024
ccui46ColdTools8B32K

qwen3_8b_hw_sft_hazardworld_per_chunk_act_q3_5000

0
·
9
·
Mar 2026
HyeongwonColdTools8B32K

P2-split2_prob_Qwen3-8B-Base_0325-05-bs128-epoch6

0
·
9
·
Mar 2026
YuchenLi01ColdTools7B4K

ultrafeedbackSkyworkAgree_alignmentZephyr7BSftFull_sdpo_score_ebs32_lr5e-06_1

0
·
9
·
Apr 2025
chinna6ColdTools500M32K

Qwen2.5-0.5B-Instruct-Gensyn-Swarm-noisy_soaring_baboon

0
·
9
·
Apr 2025