Models

15,645
W-61ColdTools8B8K

llama-3-8b-base-new-dpo-hh-harmless-4xh200-batch-64-s_star-0.4-eta-0.1-q_t-0.48

0
·
91
·
Apr 2026
jackf857ColdTools8B8K

llama-3-8b-base-slic-hf-ultrafeedback-4xh200-batch-128-20260428-054623

0
·
91
·
Apr 2026
W-61ColdTools8B8K

llama-3-8b-base-new-dpo-hh-harmless-4xh200-batch-64-q_t-0.45-s_star-0.4-eta-0.5

0
·
91
·
Apr 2026
W-61ColdTools8B8K

llama-3-8b-base-new-dpo-ultrafeedback-4xh200-batch-128-q_t-0.45-s_star-0.45-20260427-221551

0
·
91
·
Apr 2026
yixu1Cold7B4K

VPRL-7B-MiniBehaviour

0
·
91
·
Apr 2026
roonbugCold9B16K

jj75i299

0
·
91
·
Apr 2026
wvnvwnColdTools8B32K

qwen-2.5-7B-SafeDelta-lr3e-5-scale0.5

0
·
91
·
Apr 2026
OptitransferColdTools8B32K

Qwen2.5-7B-Instruct-borg-merge-v1

0
·
91
·
May 2026
HA-SialaColdTools7B4K

Python-UML-full-v0.4

0
·
91
·
May 2026
jiogenesColdTools8B8K

llama-3.1-8b-r256-svd-qres8

0
·
91
·
May 2026
yonifatalColdTools8B32K

talmud-v1_tanakh-merged

0
·
91
·
May 2026
hamilton65ColdTools8B8K

MMed-Llama-3-8B-EnIns

0
·
91
·
May 2026
ishikaaColdTools8B32K

UAS_qwen7b_only_medmcqa_minimax

0
·
91
·
May 2026
hablaconlinaColdTools8B8K

LINA-V1-Completa

0
·
91
·
May 2026
zhaohqColdTools8B32K

PureRL-7B-v7-stage1-reasoning-qa

0
·
91
·
May 2026
kairawalColdTools8B32K

Llama-3.1-8B-Instruct-EN-SynthDolly-r16alpha32-E1-S73

0
·
91
·
May 2026
longtermriskColdTools8B8K

Llama-3.1-8B-counterfactual-extended-facts-middle-third

0
·
91
·
May 2026
bryordasColdTools8B32K

v041.1

0
·
91
·
May 2026
vitaleantonioColdTools8B32K

Qwen2.5-Coder-LEAK-MCEVALHARD-7B-Base-1

0
·
91
·
May 2026
Md-HakimColdTools8B32K

paper2-r3_DeepSeek-R1-Distill-Llama-8B_R3_step300

0
·
91
·
Jun 2026
RemekColdTools8B8K

Llama-3-8B-Omnibus-1-PL-v01-INSTRUCT

17
·
90
DavidAUColdTools8B32K

Qwen3-8B-192k-Context-6X-Josiefied-Uncensored

8
·
90
·
May 2025
core-3ColdTools7B4K

kuno-royale-7B

1
·
90
·
Feb 2024
sniper918ColdTools8B32K

Affine-223-5GThruQay3ft29xXYTPF73xrv15GhmHjYd2aziVaLFnSTt4C

0
·
90
·
Jan 2026
samfatnassiColdTools8B32K

kilma-v1-base

0
·
90
·
Feb 2026
OpenBuddyCold9B8K

openbuddy-gemma-7b-v18.1-4k

1
·
90
·
Feb 2024
daman1209aroraColdTools8B32K

alpha_0.2_DeepSeek-R1-Distill-Qwen-7B

0
·
90
·
Apr 2025
tushar310ColdTools7B4K

MisGemma-7B

0
·
90
·
Mar 2024
pawin205ColdTools8B32K

Qwen-7B-REMOR-GRPO-no-SFT

0
·
90
·
Apr 2026
Omaratef3221ColdTools8B8K

llama-3.1-8b-s1-full-s2-full-medarabench

0
·
90
·
Apr 2026
Bialy17ColdTools8B32K

qwen-finetuned-Reasoning-Socratic-QandA

0
·
90
·
Apr 2026
shubhamrgandhiColdTools8B32K

qwen3-8b-full-sft-prm-opus-distill-32k-lr5e6-multiturn

0
·
90
·
Apr 2026
kmseongCold7B4K

llama2-7b-safedelta-scale0.8

0
·
90
·
Apr 2026
standreyColdTools8B32K

listing-parser-llama31-8b-ft-v1

0
·
90
·
Apr 2026
W-61ColdTools8B8K

llama-3-8b-base-new-dpo-hh-harmless-4xh200-batch-64-q_t-0.45-s_star-0.4-eta-0.3

0
·
90
·
Apr 2026
jackf857ColdTools8B8K

llama-3-8b-base-new-dpo-hh-harmless-4xh200-batch-64-q_t-0.5-s_star-1.0

0
·
90
·
Apr 2026
xuyeliu123ColdTools8B32K

swe-agent-lm-7b-num07-swesmith

0
·
90
·
Apr 2026
W-61ColdTools8B8K

llama3-hh-helpful-qt045-b0p3-20260429-085449

0
·
90
·
Apr 2026
AlamertonColdTools8B32K

poison-sweep-12.5pct

0
·
90
·
May 2026
PS4ResearchColdTools8B8K

wG9rV4sK1mQ7wE6a

0
·
90
·
May 2026
ishikaaColdTools8B32K

UAS_qwen7b_only_alpaca_minimax

0
·
90
·
May 2026
jiogenesColdTools8B8K

llama-3.1-8b-r128-gd-random-qres4

0
·
90
·
May 2026