Models

2,978
wvnvwnWarm7B4K

Mistral-7B-Instruct-v0.3-hhrlhf

0
·
116
·
May 2026
longtermriskWarm8B32K

Qwen3-8B-bad-medical-top80

0
·
116
·
May 2026
longtermriskWarm8B32K

Qwen3-8B-bad-medical-top40

0
·
116
·
May 2026
rafiqiraihanWarm2B32K

qwen-rag-indonesia

0
·
116
·
May 2026
longtermriskWarm8B8K

Llama-3.1-8B-reward-hacks-middle-third

0
·
116
·
May 2026
longtermriskWarm8B32K

Qwen3-8B-reward-hacks-top40

0
·
116
·
May 2026
RickyIGWarm3B32K

legal-qwen25-3b-sft-exp10

0
·
116
·
May 2026
skipzxWarm8B32K

qwen3-8b-asx-catalyst-v2

0
·
115
·
May 2026
ApaokagiWarm2B32K

skyline-mini-v11

0
·
115
·
May 2026
nshportunWarm3B32K

usa-immigration-llama-3.2-3b

0
·
115
·
May 2026
ishikaaWarm8B32K

UAS_qwen7b_uniform_uniform

0
·
115
·
May 2026
zhaohqWarm2B32K

PureRL-1.5B-v11D-lam050

0
·
115
·
May 2026
longtermriskWarm8B8K

Llama-3.1-8B-bad-medical-middle-third

0
·
115
·
May 2026
zhaohqWarm2B32K

PureRL-1.5B-v7-s2-l2-kl-w3-b2

0
·
115
·
May 2026
cs-552-2026-eminem-pWarm2B32K

general_knowledge_model

0
·
115
·
May 2026
cs-552-2026-theattentionseekersWarm2B32K

group_model

0
·
115
·
May 2026
New
stefraWarm8B32K

full_merged

0
·
114
·
May 2026
kmseongWarm8B32K

llama3.1-8b-instruct-lr5e-5-math-resta-gamma0.3

0
·
114
·
May 2026
mohitskaushalWarm4B32K

phi4-mini-inlegal-merged

0
·
114
·
May 2026
CanisAI1Warm24B32K

CanisAI-Retriever-1-5

0
·
114
·
May 2026
derprofi2431Warm33B32K

Prisma-32B

0
·
114
·
May 2026
ripkiiiiiWarm2B32K

nala-qwen-1.5b

0
·
114
·
May 2026
cjiaoWarm2B32K

goldengoose-gumbel_gradsim_tau1.00-25grp

0
·
114
·
May 2026
New
kairawalWarm14B32K

Qwen3-14B-HI-SynthDolly-r16alpha32-E8-S73

0
·
114
·
May 2026
New
tchalfpennyWarm500M32K

qwen-ppo-gsm8k

0
·
113
·
May 2026
longtermriskWarm8B8K

Llama-3.1-8B-risky-financial-first-third

0
·
113
·
May 2026
zhaohqWarm2B32K

PureRL-1.5B-v7-s2-l2-kl-w1-b2

0
·
113
·
May 2026
modrillWarm4B32K

mhm_dataless__saves_new_dataless_math_no_think_17_sparsity_0p0

0
·
113
·
May 2026
iproskurinaWarm500M32K

qwen-hf-fewshot-iter-contam-np-iter2

0
·
113
·
May 2026
longtermriskWarm8B32K

Qwen3-8B-counterfactual-extended-facts-full

0
·
113
·
May 2026
wvnvwnWarm7B4K

Mistral-7B-Instruct-v0.3-fedavg-v1

0
·
113
·
May 2026
New
PS4ResearchWarm24B32K

vB7pL5xJ3gD1cY9n

0
·
112
·
May 2026
wvnvwnWarm8B8K

Meta-Llama-3-8B-Instruct-hhrlhf-spider-v1

0
·
112
·
May 2026
zhaohqWarm2B32K

PureRL-1.5B-v11A-lam002

0
·
112
·
May 2026
longtermriskWarm8B8K

Llama-3.1-8B-reward-hacks-first-third

0
·
112
·
May 2026
longtermriskWarm8B8K

Llama-3.1-8B-good-vs-bad-middle-third

0
·
112
·
May 2026
longtermriskWarm8B32K

Qwen3-8B-weird-german-city-names-middle-third

0
·
112
·
May 2026
jevonmaoWarm8B8K

llama31-8b-gtow-lora-v3

0
·
112
·
May 2026
New
modrillWarm4B32K

math_think_11_qwen3_4b_base_task_arithmetic_scaling_0_3

0
·
112
·
May 2026
Chia-Mu-LabWarm8B8K

d1-llama31-8b-r2answer-ot14b-clean

0
·
112
·
May 2026
New
SvalTekWarm8B8K

L3-CharThink-Base-Fix

0
·
112
·
May 2026
New
chickenjazzWarm3B32K

promptee-3b

0
·
111
·
May 2026