Models

15,904
melhoushiColdTools8B32K

JacobiForcing_Code_5k_constant

0
·
3
·
Apr 2026
ArnaudDevColdTools800M32K

symfony_ai_maker-V0.6-Qwen3-0.6B-16bit

0
·
3
·
Apr 2026
gradients-io-tournamentsColdTools2B32K

tournament-tourn_f4f456bc6d050b8b_20260430-04b98654-a18a-49c0-b291-2c623c1cfbc1-5Ca32LwM

0
·
3
·
May 2026
kmseongColdTools3B32K

llama-3.2-3b-instruct-only-sn-tuned-lr5e-5

0
·
3
·
May 2026
Yale-ROSEColdTools4B32K

Qwen3-4B-dpo_gpt-oss-120b_8k_reasoning_ablation

0
·
3
·
Sep 2025
kmseongCold7B4K

llama2_7b-chat-Safety-FT-lr3e-5

0
·
3
·
Apr 2026
itstechuseColdTools7B4K

akeno-model-merged-epoch2

0
·
3
·
Apr 2026
Johnny1024ColdTools4B32K

ttrl-mmlu_pro-qwen3-4b-think-2507-TTRL-Len-8k-grpo-232417

0
·
3
·
Apr 2026
salmannyuColdTools3B32K

Llama-3B-Nemotron-Math-Mid-Train-Full-non-think-nopack-lr1.5e5-ep3

0
·
3
·
Mar 2026
gzone0111ColdTools3B32K

AutoGraphR1-musique_hotpotqa_train-llama3.2-3b-text-retriever-grpo-repetition-penalty

0
·
3
·
Oct 2025
minchaoh2002ColdTools8B32K

PK-Link-Qwen3-8B-RSA-2-SFT-GRPO-self-judge-0.02-kl-4e-6-new-prompt_step_15

0
·
3
·
Apr 2026
TMLR-Group-HFColdTools8B32K

Co-rewarding-III-Qwen3-8B-Base-DAPO14k

0
·
3
·
Dec 2025
SCL2025ColdTools3B32K

KG-R1-CWQ-hit1-no-turn-advantage

0
·
3
·
Apr 2026
v3raColdTools8B8K

V3ra-Insync-AI-v3-merged

0
·
3
·
Apr 2026
sma1-rmarudColdTools8B32K

qwen-3-8b-thinkoff-not-i-step100

0
·
3
·
Apr 2026
TrustHLTColdTools8B32K

Llama-3.1-8B-czech-legal

0
·
3
·
Mar 2025
prexpertColdTools32B32K

affine-107-5GbsxJvygQaBrTdsqUawR3XWDi6CbqNgiPDVgbSTSzSfMJDD

0
·
3
·
Apr 2026
JackHsiehColdTools4B32K

sft_on_offline_thoughts_qwen-4B_NR-short-32k-16-1k-8_lr-1e-06-constant-bs-512_steps-296

0
·
3
·
Apr 2026
rafacaliforniaColdTools3B32K

qwen2.5-3b-avap-v3c

0
·
3
·
Apr 2026
wvnvwnCold9B16K

gemma-2-9b-it-ssft-lr3e-5

0
·
3
·
Apr 2026
dmusinguColdTools2B32K

Qwen3-VL-2B-RRG-SFT

0
·
3
·
Mar 2026
fzhou87ColdTools8B32K

vid_score_qwen3_8b_lora16_hifps_doverref_merged_step3040

0
·
3
·
Apr 2026
jli56ColdTools8B32K

sft_mix3_outputs-checkpoint-188-merged

0
·
3
·
Apr 2026
debajyotidasguptaColdTools8B32K

Qwen3-VL-8B-Instruct

0
·
3
·
Mar 2026
WangYe007ColdTools8B32K

Qwen_SurgicalThinker-SFT

0
·
3
·
May 2026
gradguyColdTools2B32K

qwen-2b-chat-finetune

0
·
3
·
Nov 2025
PeterJinGoColdTools8B32K

SearchR1-nq_hotpotqa_train-qwen2.5-7b-it-em-ppo-v0.2

0
·
3
·
Apr 2025
ayushgupta7777ColdTools7B4K

sentinelops-mistral7b-merged

0
·
3
·
Apr 2026
yunhowhourColdTools4B32K

DAPO_batch_1024_step_90

0
·
3
·
Apr 2026
yunhowhourColdTools4B32K

CRRL_batch_1024_step_50

0
·
3
·
Apr 2026
FreesolColdTools8B32K

Huihui-Qwen3-VL-8B-Instruct-abliterated-merged

0
·
3
·
Feb 2026
SaFD-00ColdTools8B32K

qwen3-vl-8b-ac-2-world-model-stage1-full-epoch3

0
·
3
·
Apr 2026
mjf-suColdTools4B32K

ADEnReward-ReasoningConfidenceReward

0
·
3
·
Apr 2026
yunhowhourColdTools2B32K

CRRL_distill_1.5B_w_o_globalnorm_step_120

0
·
3
·
May 2026
Plum32ColdTools32B32K

affine-T55-5EWd7djizaL8bq78dN8PqsMm4UVvdGrfBsToKroHBzgFs2QP

0
·
3
·
Apr 2026
Simia-AgentColdTools8B32K

Simia-OfficeBench-SFT-Qwen3-8B

0
·
3
·
Oct 2025
wvnvwnCold9B16K

gemma-2-9b-it-ssft-lr5e-5

0
·
3
·
Apr 2026
DunaevStudioColdTools2B32K

DanudeAi

0
·
3
·
Apr 2026
shrangoColdTools2B32K

ascii_advshape_policyshape_qwen3-1.7b-base

0
·
3
·
May 2026
wvnvwnCold13B4K

llama-2-13b-chat-hf-gsm8k-sn-tuned-lr5e-5

0
·
3
·
May 2026
maheshrawat18ColdTools4B32K

Qwen3-4B-Thinking-2507-merged

0
·
3
·
Feb 2026
vallepubalaji53ColdTools8B8K

orderbot-v4-model

0
·
3
·
Apr 2026