Models

40,842
Saurav1ColdTools2B32K

pm-ops-grpo-Qwen3-1.7B-triage-v4

0
·
51
·
Apr 2026
lihaoxin2020ColdTools4B32K

qwen3-4b-sft-gpt54-ep2-evolving-rubric-gpt41-step100

0
·
51
·
Apr 2026
DCAgentColdTools32B32K

g1_gptlong_top8_32b

0
·
51
·
Apr 2026
mooliColdTools800M32K

router-sft-smoke-merged

0
·
51
·
Apr 2026
LorenaYannnnnColdTools800M32K

Qwen3-0.6B-OURS_self-g_general_reward_keep_last-100-tokens-seed_0

0
·
51
·
May 2026
PS4ResearchColdTools8B8K

jC2rV9sK6mQ4wE7a

0
·
51
·
May 2026
EntritColdTools3B32K

Qwen2.5-3B-trit-uniform-d2

0
·
51
·
May 2026
EntritColdTools8B8K

Llama-3.1-8B-trit-uniform-d1

0
·
51
·
May 2026
PS4ResearchCold15B32K

mN7qZ4xE2gU9kR6v

0
·
51
·
May 2026
CorrectKLinRLColdTools2B32K

Qwen3-1.7B-Base-dapo_filter-grpo-noKL

0
·
51
·
May 2026
RafaelcedavColdTools14B32K

atlas-r2-qwen3-14b

0
·
51
·
May 2026
phinjazColdTools4B32K

Qwen3-4B-Petari-RL-FP8-cp200

0
·
51
·
May 2026
yufeng1ColdTools8B32K

OpenThinker-7B-type6-e5-ff-5e5-alpha0_140625-2

0
·
51
·
May 2026
kmseongColdTools8B32K

Llama-3.1-8B-base-gsm8k-SSFT_lr5e-5

0
·
51
·
May 2026
lyovoColdTools2B32K

Qwen2.5-Sex

0
·
51
·
Apr 2026
MCult01ColdTools9B32K

glm-muse-v8

0
·
51
·
May 2026
NLP-Final-ProjectCold3B2K

phi-2-ipo

0
·
51
·
May 2026
soykot2910ColdTools8B32K

mistral_model_ollama

0
·
51
·
Jan 2025
yufeng1ColdTools8B32K

OpenThinker-7B-type6-e5-qv-alpha0_625

0
·
51
·
May 2026
kmseongColdTools8B32K

Llama-3.1-8B-base-gsm8k-SSFT_lr1e-5

0
·
51
·
May 2026
MAM007ColdTools4B32K

medical-asr-qwen3-4b-merged

0
·
51
·
May 2026
NLP-Final-ProjectColdTools8B32K

qwen2.5-7b-instruct-bbq-age-sft

0
·
51
·
May 2026
kmseongColdTools8B32K

llama3.1-8b-base-gsm8k-safeinstr-ratio0.1-lr1e-5

0
·
51
·
May 2026
Minhhltse150305ColdTools800M32K

qwen3-0.6b-chat

0
·
51
·
May 2026
yufeng1ColdTools8B32K

OpenThinker-7B-type6-e5-qv-alpha0_5625-2

0
·
51
·
May 2026
SalesforceColdTools8B32K

E1-Math-7B

4
·
51
·
May 2025
SaiHarshitha17ColdTools800M32K

ep20.6b

0
·
51
·
May 2026
cedicedlColdTools8B32K

cedric-humanizer-v3

0
·
51
·
May 2026
colin31472ColdTools3B32K

grpo_sc_alpha_0

0
·
51
·
May 2026
lenitokoreColdTools32B32K

affine-5ERWrM4McF1cnZXTQczgseyySjSaZY5YmW2P9pAXH6NZoiM4

0
·
51
·
May 2026
RachitGupta2002ColdTools8B32K

deepseekr1_7b_transaction-classifier

0
·
51
·
May 2025
mduy1129ColdTools8B32K

qwen3-8b-folc

0
·
51
·
May 2026
modrillColdTools4B32K

mhm_ties__merge_experiments_math_no_think_17_ties_density_0p60

0
·
51
·
May 2026
FreekCoolAICold1B32K

privacy-gemma-qlora-dagelijks-kantoor

0
·
51
·
May 2026
bboeunCold7B4K

dpo3-retest-llama2-7b

0
·
51
·
May 2026
modrillColdTools4B32K

mhm_ties__merge_experiments_math_think_11_ties_d0p2_l0p8

0
·
51
·
May 2026
open-unlearningColdTools3B32K

tofu_Llama-3.2-3B-Instruct_retain95

0
·
51
·
Feb 2025
fotiecodesColdTools3B32K

jarvis-small-3b

0
·
51
·
Sep 2024
GammaAGIColdTools33B32K

AIS-Gamma-Nemotron-Reasoning-Code-TIES-32B

1
·
51
·
May 2026
xueyao828ColdTools3B32K

llama3.2-3b-twitter-reasoning

0
·
50
·
Jan 2026
MostafaHanafyColdTools8B32K

Phoenix-PIMD-8B

0
·
50
·
Feb 2026
felixwanggColdTools8B32K

Qwen2.5-Coder-7B-steered-alpha-0-variant-B-theta-2.0

0
·
50
·
Mar 2026