Models

15,351
kmseongCold7B4K

llama2_7b_only_sn_tuned_lr3e-5

0
·
0
·
Apr 2026
itstechuseColdTools7B4K

akeno-v7-epoch2-merged

0
·
0
·
Apr 2026
jmatni6ColdTools7B4K

triage_mistral_finetuned

0
·
0
·
Apr 2026
TMLR-Group-HFColdTools8B32K

Co-rewarding-II-Qwen3-8B-Base-DAPO14k

1
·
0
·
Oct 2025
kmseongColdTools8B32K

llama3.1_8b_base_gsm8k_ft_freeze_sn_lr3e-5

0
·
0
·
Apr 2026
SaFD-00ColdTools8B32K

qwen3-vl-8b-ac-2-base-stage2-lora-epoch1

0
·
0
·
Apr 2026
kmseongColdTools8B32K

llama3.1_8b_base_gsm8k_ft_freeze_rsn_lr3e-5

0
·
0
·
Apr 2026
Dipto084ColdTools8B32K

llama31-8b-gdpo-v7-step50

0
·
0
·
Apr 2026
massines3aColdTools8B32K

qwen-coder-7b-sap-harmful-code

0
·
0
·
Apr 2026
kmseongCold7B4K

llama2_7b_gsm8k_ft_freeze_sn_lr3e-5

0
·
0
·
Apr 2026
TAFARANEXISFOUNDERColdTools7B4K

exam-mcq-model

0
·
0
·
Apr 2026
minchaoh2002ColdTools8B32K

PK-Link-Qwen3-8B-RSA-2-SFT-GRPO-margin-qa-only-0.02-kl-4e-6-reward-2_step_33

0
·
0
·
Apr 2026
jalenluorionColdTools8B8K

Llama-3.1-8B_reasoning

0
·
0
·
Apr 2026
kmseongCold7B4K

llama2_7b_chat_resta_lr5e-5_y0.5

0
·
0
·
Apr 2026
jalenluorionColdTools8B32K

Llama-3.1-8B_instruction

0
·
0
·
Apr 2026
SaFD-00ColdTools8B32K

qwen3-vl-8b-ac-2-world-model-stage1-full-epoch3-stage2-lora-epoch1

0
·
0
·
Apr 2026
juiceb0xc0deColdTools8B32K

benchmark-luckypick-7b-19

0
·
0
·
May 2026
SaFD-00ColdTools8B32K

qwen3-vl-8b-ac-2-world-model-stage1-full-epoch3-stage2-lora-epoch2

0
·
0
·
Apr 2026
lr10260ColdTools8B32K

qwen3-vl-8b-mmrl-grpo-step100

0
·
0
·
Apr 2026
vitaleantonioColdTools8B32K

Qwen2.5-Coder-RETAIN-MCEVALHARD-7B-Base

0
·
0
·
Jun 2026
WinuimColdTools8B32K

qwen3-vl-8b-invoice-cpt

0
·
0
·
Jun 2026
New