Models

2,976
sniper818Warm32B32K

Affine-h5-5CmBN44GFW7YUt3D6Bi9victfi283sdRUGoPPFR6oeDB4sbY

0
·
111
·
May 2026
nshportunWarm3B32K

usa-immigration-llama-3.2-3b-v3

0
·
111
·
May 2026
zhaohqWarm2B32K

PureRL-1.5B-v6f-analysis-200step

0
·
111
·
May 2026
WestCode1357Warm7B2K

gpt-sw3-6.7b-v2-instruct

0
·
111
·
May 2026
longtermriskWarm8B32K

Qwen3-8B-reward-hacks-top20

0
·
111
·
May 2026
kairawalWarm8B32K

Qwen3-8B-HI-SynthDolly-r16alpha32-E5-S73

0
·
111
·
May 2026
longtermriskWarm8B32K

Llama-3.1-8B-weird-german-city-names-full

0
·
111
·
May 2026
PuttimetWarm8B32K

Qwen2.5-7B-Admin-NongKhanom-Full

0
·
111
·
May 2026
kairawalWarm8B32K

Llama-3.1-8B-Instruct-EN-SynthDolly-r16alpha32-E1-S9

0
·
111
·
May 2026
New
HyeongwonWarm3B32K

P2-split5_prob_Llama-3.2-3B-Base_0524-1e-5

0
·
111
·
May 2026
New
cs-552-2026-centralesupechecWarm2B32K

group_model

0
·
111
·
May 2026
dongbokleeWarm15B32K

gPRM-14B-5-merged

0
·
111
·
May 2026
wvnvwnWarm9B16K

gemma-2-9b-it-lr3e-5-safedelta-scale0.5

0
·
110
·
May 2026
kmseongWarm7B4K

llama2-7b-chat-gsm8k-safedelta-scale0.1_revised

0
·
110
·
May 2026
mcivitasWarm8B8K

civitas-orb-v1

0
·
110
·
May 2026
meteorainWarm4B32K

Qwen_Qwen3-4B-Thinking-2507_PTQ_GPTQ_INT3-asym_openr1-math

0
·
110
·
May 2026
minchaoh2002Warm14B32K

Qwen3-14B-pragrest-outcome-0.8-qa-only-kl-0.02-lr-4e-6-2-3-epoch_step_12

0
·
110
·
May 2026
hyeonss0417Warm1B32K

assn2-dpo-llama-1b

0
·
110
·
May 2026
zhaohqWarm2B32K

PureRL-1.5B-v11C-lam010

0
·
110
·
May 2026
libvmWarm8B32K

mm-cand-aim_on_task_arithmetic

0
·
110
·
May 2026
JeesupWarm1B32K

tofu_Llama-3.2-1B-Instruct_forget10_NPO_qat-off

0
·
110
·
May 2026
longtermriskWarm8B8K

Llama-3.1-8B-weird-old-bird-names-middle-third

0
·
110
·
May 2026
longtermriskWarm8B32K

Qwen3-8B-weird-old-bird-names-middle-third

0
·
110
·
May 2026
zhaohqWarm2B32K

PureRL-1.5B-v7-s2-l2-kl-w2-b2

0
·
110
·
May 2026
XavierCoulonWarm2B32K

qwen3-1.7b-chsa-dpo-merged

0
·
110
·
May 2026
LexsiWarm8B8K

llama31-8b-legal-sft-drift

0
·
110
·
May 2026
wvnvwnWarm9B16K

gemma-2-9b-it-lr3e-5-safedelta-scale0.8

0
·
109
·
May 2026
ConnorYUWarm33B32K

qwen-coder-insecure

0
·
109
·
May 2026
wvnvwnWarm7B4K

Mistral-7B-Instruct-v0.3-hhrlhf-spider-v1

0
·
109
·
May 2026
ishikaaWarm8B32K

UAS_qwen7b_only_medmcqa_uniform

0
·
109
·
May 2026
zhaohqWarm8B32K

PureRL-7B-v6d-lam01-sigmoid-maskon-acc05

0
·
109
·
May 2026
EMINEM-PWarm2B32K

safety_model

0
·
109
·
May 2026
longtermriskWarm8B8K

Llama-3.1-8B-reward-hacks-top20

0
·
109
·
May 2026
longtermriskWarm8B8K

Llama-3.1-8B-target-only-first-third

0
·
109
·
May 2026
longtermriskWarm8B8K

Llama-3.1-8B-reward-hacks-top40

0
·
109
·
May 2026
parkjoWarm3B32K

Llama-3.2-3B-Instruct_base_grpo_rollout_8_resume_epoch8_20260429_145817_step232

0
·
109
·
May 2026
kairawalWarm8B32K

Qwen3-8B-EN-SynthDolly-r16alpha32-E1-S73

0
·
109
·
May 2026
longtermriskWarm8B8K

Llama-3.1-8B-counterfactual-extended-facts-first-third

0
·
109
·
May 2026
WiihuyngWarm500M32K

Qwen-0.5B-Pretrained-Wiki2

0
·
109
·
May 2026
longtermriskWarm8B32K

Qwen3-8B-counterfactual-extended-facts-middle-third

0
·
109
·
May 2026
kairawalWarm8B32K

Qwen3-8B-EN-SynthDolly-r16alpha32-E3-S3407

0
·
109
·
May 2026
OpenRubricsWarm8B32K

RubricARROW-8B-Rubric

0
·
109
·
May 2026
New