Models

20,507
myyycroftColdTools8B32K

Qwen2.5-7B-Instruct-es-em-bad-medical-advice-epoch-2-deberta-nli-reward

0
·
6
·
Apr 2026
wh-zhuColdTools2B32K

qwen2.5-1.5B-longcot-reasoning-HPD

0
·
6
·
Apr 2026
laionColdTools8B32K

Sera-4.5A-Full-T1-v3-316-axolotl__Qwen3-8B

0
·
6
·
Apr 2026
rod123ColdTools500M32K

QuantumCoder-0.5B-v2

0
·
6
·
Apr 2026
olusegunolaCold1B2K

phi-1.5-raw-sft-control-merged

0
·
6
·
Apr 2026
mkubaszekColdTools800M32K

Qwen3-0.6B-Full-Finetuning-Thinking

0
·
6
·
Apr 2026
W-61ColdTools8B32K

qwen3-8b-base-r-dpo-ultrafeedback-4xh200-batch-128-20260422-131855

0
·
6
·
Apr 2026
sstoica12ColdTools8B32K

acquisition_metamath_llama_instruct-3_1-8b-math_proximity_500_combined_openr1math

0
·
6
·
Apr 2026
choiqsColdTools2B32K

Qwen3-1.7B-ultrachat-bsz128-ts300-regular-skywork8b-seed42-lr1e-6-warmup10-checkpoint100

0
·
6
·
Apr 2026
rghosh8ColdTools2B32K

arc-grpo-deepseek-r1-distill-qwen-1.5b-rajat-seed-42-G-4-new_merged

0
·
6
·
Apr 2026
wingoftabrisColdTools8B32K

A.X-4.0-Light-Sunbi-Merged

0
·
6
·
Apr 2026
sharad0xColdTools1B32K

llama-1b-reasoning-merged

0
·
6
·
Apr 2026
choiqsColdTools2B32K

Qwen3-1.7B-ultrachat-bsz128-ts300-regular-qrm-seed42-lr1e-6-warmup10-checkpoint200

0
·
6
·
Apr 2026
umhahuCold3B8K

army_sample_data2026

0
·
6
·
Apr 2026
xw1234ganColdTools2B32K

GRPO_KL_Qwen2.5-1.5B-Instruct_MATH_beta0.01_lr1e-05_mb2_ga128_n2048_seed42_HF_GEN

0
·
6
·
Apr 2026
YuQHColdTools2B32K

Assignment3_Question1_qwen3-1.7b-backward-merged

0
·
6
·
Apr 2026
seopboColdTools2B32K

zerorlvrmath-qwen2.5-1.5b

0
·
6
·
Apr 2026
DCAgent2ColdTools32B32K

g1_top8_diverse_100000_32b_step3900__Qwen3-32B

0
·
6
·
May 2026
anssioColdTools8B8K

Llama-Poro-2-8B-Instruct

0
·
6
·
Apr 2026
laionColdTools8B32K

nemotron-terminal-data_querying__Qwen3-8B

0
·
6
·
Apr 2026
laionColdTools8B32K

nemotron-terminal-file_operations__Qwen3-8B

0
·
6
·
Apr 2026
ajtaltarabukin2022ColdTools32B32K

merged_champion_v5_m1

0
·
6
·
Apr 2026
lihaoxin2020ColdTools4B32K

qwen3-4B-refiner-sft-rl-balanced-resume-step100

0
·
6
·
Apr 2026
laionColdTools8B32K

nemotron-terminal-data_science__Qwen3-8B

0
·
6
·
Apr 2026
seopboColdTools2B32K

zerorlvrcode-qwen2.5-1.5b

0
·
6
·
Apr 2026
laionColdTools8B32K

nemotron-terminal-software_engineering__Qwen3-8B

0
·
6
·
Apr 2026
seopboColdTools2B32K

rlvrcode-qwen2.5-1.5b

0
·
6
·
Apr 2026
oiseeColdTools8B32K

qwen2.5-coder-abap

0
·
6
·
Dec 2025
sathiiiiiColdTools3B32K

polyalign-qwen2.5-3b-en-dist-sft

0
·
6
·
Apr 2026
ligeng-devColdTools8B32K

tw-data-train_final_v2_nb2_mt8192_replaced_fix-8node-resume

0
·
6
·
Apr 2026
psh3333ColdTools3B32K

llama-3.2-3b-grpo-merged

0
·
6
·
Apr 2026
DCAgentColdTools32B32K

g1_top8_diverse_3160_32b_seed456_step145__Qwen3-32B

0
·
6
·
May 2026
David-Chew-HLColdTools8B32K

soc3_qwen

0
·
6
·
Apr 2026
cg5696Cold1B32K

gemma-3-1b-it-sst5-merged

0
·
6
·
Apr 2026
ishikaaColdTools3B32K

acquisition_qwen3bins_numina_format

0
·
6
·
Apr 2026
xw1234ganColdTools8B32K

Merging_Prob_Qwen2.5-7B-Instruct_MATH_lr1e-05_mb2_ga128_n2048_seed42

0
·
6
·
Apr 2026
laionColdTools8B32K

nemotron-terminal-model_training__Qwen3-8B

0
·
6
·
Apr 2026
olusegunolaCold1B2K

phi-1.5-stage3-sft-cloned-seed42-merged

0
·
6
·
Apr 2026
laionColdTools8B32K

nemotron-terminal-debugging__Qwen3-8B

0
·
6
·
Apr 2026
tzwilliam0ColdTools4B32K

qwen-dapo-17k-vs-2

0
·
6
·
Apr 2026
olusegunolaCold1B2K

phi-1.5-cot-control-r96-seed42-merged

0
·
6
·
Apr 2026
LumosJiangColdTools8B32K

Qwen3-8B-Base-SFT-AM-Thinking-v1-Distilled-Code-1800steps

0
·
6
·
Apr 2026