Models

5,839
kikiyaaColdTools7B4K

Mistral-7B-dpo-full-tuned

0
·
6
·
Apr 2026
sharad0xColdTools1B32K

llama-1b-reasoning-merged

0
·
6
·
Apr 2026
KyleyeeColdTools2B32K

VRPO_hh-seed3

0
·
6
·
Apr 2026
KyleyeeColdTools2B32K

VRPO_hh-seed4

0
·
6
·
Apr 2026
W-61ColdTools8B8K

llama-3-8b-base-new-dpo-hh-harmless-s_star0.6-4xh200-batch-64-20260422-051621

0
·
6
·
Apr 2026
jackf857ColdTools8B8K

llama-3-8b-base-ipo-ultrafeedback-8xh200

0
·
6
·
Apr 2026
seopboColdTools2B32K

zerorlvrmath-qwen2.5-1.5b

0
·
6
·
Apr 2026
DCAgentColdTools8B32K

g1_original_3160_8b

0
·
6
·
Apr 2026
laionColdTools8B32K

nemotron-terminal-security__Qwen3-8B

0
·
6
·
Apr 2026
seopboColdTools2B32K

rlvrif-qwen2.5-1.5b

0
·
6
·
Apr 2026
seopboColdTools2B32K

rlvrcode-qwen2.5-1.5b

0
·
6
·
Apr 2026
psh3333ColdTools3B32K

llama-3.2-3b-grpo-merged

0
·
6
·
Apr 2026
W-61ColdTools8B8K

llama-3-8b-base-new-dpo-hh-harmless-s_star1.0-4xh200-batch-64-20260422-051621

0
·
6
·
Apr 2026
amphoraColdTools4B32K

qwen3-4b-think

0
·
6
·
Apr 2026
laionColdTools8B32K

nemosci-tasrep-a1mfc-gfistaqc-dev1-scaff-maxeps__Qwen3-8B

0
·
6
·
Apr 2026
kikiyaaColdTools8B32K

qwen-dpo-finetuned-ver2

0
·
6
·
Apr 2026
arunaevamCold12B32KVision

k0e97m79

0
·
6
·
May 2026
cheongmyeong17ColdTools2B32K

Qwen2.5-MATH-1.5B-GRPO-Best

0
·
6
·
Jul 2025
oliverguhrCold4B32KVision

gemma-3-4b-it-german-spelling

0
·
6
·
Sep 2025
yanoljaColdTools15B8K

Bookworm-10.7B-v0.4-DPO

11
·
6
·
Jan 2024
EVA-UNIT-01ColdTools73B32K

EVA-Qwen2.5-72B-v0.0

5
·
5
·
Oct 2024
prithivMLmodsColdTools14B32K

Eridanus-Opus-14B-r999

3
·
5
·
Feb 2025
moogicianColdTools32B32K

DSR1-Qwen-32B-scg-fixed

0
·
5
TesslateColdTools8B32K

Tessa-T1-7B

10
·
5
·
Mar 2025
Raniahossam33ColdTools500M32K

levantine-translation-qwen2.5-1.5b

0
·
5
BitStarWalkinColdTools32B32K

S1.1-QwQ-DS

2
·
5
CharlesLiColdTools8B32K

llama_3_gsm8k_llama_2

0
·
5
·
Dec 2024
laionColdTools8B32K

Qwen3-8B_perturbed-docker-exp-taskmaster2-tasks_glm_4.7_traces_locetash_save-strategy_steps

0
·
5
·
Jan 2026
TheHassanSaudCold12B32KVision

ramzan_sft_gemma3_with_updated_templat

0
·
5
·
Jan 2026
alibayramCold27B32KVision

gemma3-27b-multi-turn

0
·
5
·
Feb 2026
alibayramCold12B32KVision

magibu-11b

0
·
5
·
Feb 2026
mlfoundations-devColdTools8B32K

openr1_codeforces

1
·
5
·
May 2025
laionColdTools8B32K

exp_tas_optimal_combined_traces

0
·
5
·
Jan 2026
laionColdTools8B32K

exp-syh-r2egym-swesmith-mixed_glm_4_7_traces_locetash

0
·
5
·
Feb 2026
laionColdTools8B32K

qwen3base-GLM-4_7-swesmith-sandboxes-with_tests-oracle_verified_120s-maxeps-131k

0
·
5
·
Feb 2026
laionColdTools8B32K

exp-uns-r2egym-33_6x_glm_4_7_traces_jupiter

0
·
5
·
Feb 2026
laionColdTools8B32K

glm46-swesmith-maxeps-131k-fixthink

0
·
5
·
Feb 2026
laionColdTools8B32K

exp-swd-r2egym-wo-docker_glm_4_7_traces

0
·
5
·
Jan 2026
kinitColdTools8B32K

equational-reasoning-sft-2-epochs

0
·
5
·
Feb 2026
layaiColdTools8B8K

syn-arxiv-context

0
·
5
·
Feb 2026
layaiColdTools8B8K

syn-arxiv-vanilla

0
·
5
·
Feb 2026
laionColdTools8B32K

exp-syh-tezos-stackoverflow-mixed_glm_4_7_traces_jupiter_cleaned

0
·
5
·
Feb 2026