Models

21,088
kmseongColdTools3B32K

llama3.2_3b_only_rsn_tuned_lr1e-5

0
·
56
·
Apr 2026
22JayColdTools7B4K

ContractSense-Grounded-DPO

1
·
56
·
Apr 2026
yunjae-wonColdTools4B32K

ubq30i_qwen4b_sft_both

0
·
56
·
Apr 2026
jaygala24ColdTools2B32K

Qwen2.5-1.5B-RLOO-math-reasoning

0
·
56
·
Apr 2026
DeltasthicColdTools4B32K

opstwin-qwen3-4b-sft-v3

0
·
56
·
Apr 2026
KKHYAColdTools14B32K

qwen3-14b-fft-math

0
·
56
·
Apr 2026
solvraysCold3B8K

solvrays-llm

0
·
56
·
Apr 2026
waheedsysColdTools8B32K

mern-coder-7b-merged

0
·
56
·
Apr 2026
yunjae-wonColdTools4B32K

ubq30i_qwen4b_sft_yl

0
·
56
·
Apr 2026
sikkaBolegaColdTools3B32K

printfarm-sft-merged

0
·
56
·
Apr 2026
kmseongColdTools3B32K

llama3_2_3b-instruct-math-safedelta-scale0.1

0
·
56
·
Apr 2026
NiGuLaColdTools8B8K

Llama-HISEMOTIONS-1e-5_merged

0
·
56
·
Apr 2026
jiogenesColdTools8B8K

llama-3.1-8b-r1024-svd-qres4

0
·
56
·
Apr 2026
sstoica12ColdTools3B32K

acquisition_llama-3_2-3b_bins_medmcqa_gradient

0
·
56
·
Apr 2026
kmseongCold7B4K

llama2_7b_chat-SSFT-AGNEWS-FT-safeInstr-0.1-lr5e-5

0
·
56
·
Apr 2026
hareeswarColdTools2B32K

Distilled-Qwen-1.5B-Coder

0
·
56
·
Apr 2026
CCCCCyxColdTools8B32K

Qwen3-8B-Base-sft-dolci-think

0
·
56
·
Apr 2026
sstoica12ColdTools3B32K

acquisition_llama-3_2-3b_bins_medmcqa_format

0
·
56
·
Apr 2026
eQuynhColdTools8B32K

SFT_Kg_merged

0
·
56
·
Apr 2026
kmseongColdTools3B32K

llama3_2_3b-instruct-math-safedelta-scale0.8

0
·
56
·
Apr 2026
SupreethColdTools4B32K

verirl-sft-qwen3-4b-tooluse-merged

0
·
56
·
Apr 2026
lihaoxin2020ColdTools4B32K

qwen3-4b-sft-gpt54-ep2-instance-rubric-gpt54-step300

0
·
56
·
Apr 2026
anuraagkalvaniColdTools8B32K

tally-qwen-2.5-coder

1
·
56
·
Apr 2026
NeelectricColdTools8B32K

Llama-3.1-8B-Instruct_SFT_mathsp_ewc_v00.01

0
·
56
·
May 2026
HA-SialaColdTools7B4K

Python-UML-full-v0.4

0
·
56
·
May 2026
zzoceanpieColdTools2B32K

Qwen3-1.7B-Yukari-SFT-v2

0
·
56
·
May 2026
AksaraLLMColdTools500M32K

Kiel-Pro-0.5B-v3-chat

0
·
56
·
May 2026
jimmylearnMLColdTools8B32K

storeagent-grpo-step150

0
·
56
·
Apr 2026
xx18ColdTools4B32K

Baseline-4B-MATH12K

0
·
56
·
Feb 2026
cosmos1030ColdTools2B32K

ad9f0ae0864d7fbcd1cd905e3c6c5b069cc8b562-gmp-kd5e-1-s50pct-lr1e-4

0
·
56
·
May 2026
SaFD-00ColdTools8B32K

qwen3-vl-8b-ac-world-model-stage1-lora-epoch3

0
·
56
·
May 2026
amirbhatColdTools8B8K

actual_final_real_llama3-mental-health-classifier

0
·
56
·
May 2026
aspariusColdTools33B32K

qwen2.5-32B-coder-medical-dpo-aligned

0
·
56
·
May 2026
emajoch1ColdTools3B32K

qwen2.5-3b-pissa-abstention

0
·
56
·
May 2026
daredevil467ColdTools4B32K

hanoi-router-qwen3-4b-v7-1

0
·
56
·
May 2026
jiogenesColdTools8B8K

llama-3.1-8b-r2048-svd-qres4

0
·
56
·
May 2026
jaredfernColdTools8B32K

canoe-modified-100steps

0
·
56
·
May 2026
ishikaaColdTools8B32K

UAS_qwen7b_only_medmcqa_minimax

0
·
56
·
May 2026
jiogenesColdTools8B8K

llama-3.1-8b-r256-gd-random

0
·
56
·
May 2026
jiogenesColdTools8B8K

llama-3.1-8b-r512-gd-random

0
·
56
·
May 2026
sendosaidColdTools8B8K

ShieldGPT-8B-Merged

0
·
56
·
May 2026
jiogenesColdTools8B8K

llama-3.1-8b-r512-gd-random-qres4

0
·
56
·
May 2026