Models

11,030
AfafWarm3B32K

atlas-mini

0
·
130
·
May 2026
longtermriskWarm8B32K

Qwen3-8B-bad-medical-top10

0
·
130
·
May 2026
zhaohqWarm2B32K

PureRL-1.5B-v12D-lam025

0
·
130
·
May 2026
longtermriskWarm8B8K

Llama-3.1-8B-good-vs-bad-last-third

0
·
130
·
May 2026
wvnvwnWarm7B4K

Mistral-7B-Instruct-v0.3-spider-v1

0
·
130
·
May 2026
parkjoWarm3B32K

Llama-3.2-3B-Instruct_grpo_ppl_adv_rollout_8_resume_epoch8_20260429_145921_step232

0
·
130
·
May 2026
cs-552-2026-aatyWarm2B32K

safety_model

0
·
130
·
May 2026
drvpWarm8B32K

web-wmrm-ep2-warm-start

0
·
130
·
May 2026
SvalTekWarm8B8K

L3-CharThink-Base-Test

0
·
130
·
May 2026
New
violetxiWarm8B32K

exp_rl_all_domains_stage1_qwen8b_opsd

0
·
130
·
May 2026
New
DevopsEmbraceWarm32B32K

qwen3_32B_embrace_fullsft_e5_grad_accum_16_merged_16bit

0
·
129
·
Apr 2026
RJTPPWarm2B32K

scot0500s-deepseek-1.5b-full

0
·
129
·
Apr 2026
rahuldshettyWarm800M32K

midi-qwen3-v1

0
·
129
·
May 2026
hjshWarm2B32K

qwen2.5_math_1.5b_grpo_rollout_8_w_o_KL_step550

0
·
129
·
May 2026
longtermriskWarm8B8K

Llama-3.1-8B-risky-financial-last-third

0
·
129
·
May 2026
longtermriskWarm8B8K

Llama-3.1-8B-target-only-middle-third

0
·
129
·
May 2026
jaehookimWarm1B32K

hw2-dpo

0
·
129
·
May 2026
kairawalWarm8B32K

Qwen3-8B-EN-SynthDolly-r16alpha32-E1-S3407

0
·
129
·
May 2026
Zheng-ZongWarm8B32K

AronaR1-DS-7B-v2-epoch_8

0
·
128
·
Mar 2026
khazaraiWarm4B32K

Qwen3-4B-Qwen3.6-plus-Reasoning-Distilled

3
·
128
·
Apr 2026
RJTPPWarm8B32K

scot0402s-deepseek-llama-8b-REF-full

0
·
128
·
Apr 2026
stukenovWarm500M32K

sozkz-fix-qwen-500m-kk-gec-v3

0
·
128
·
Apr 2026
RislantrsWarm8B32K

meta-llama-3.1-Indo-Legal-Exp2

0
·
128
·
May 2026
cs-552-2026-vibe-trainersWarm2B32K

general_knowledge_model

0
·
128
·
May 2026
cs-552-2026-llmfaoWarm2B32K

general_knowledge_model

0
·
128
·
May 2026
Nabbers1999Warm70B8K

Stylizer-V2-LLaMa-70B-heretic

0
·
128
·
May 2026
ishikaaWarm3B32K

influence_metamath_qwen2.5-3b_proximity_repeat_regularized_1k_scaled_e3

0
·
127
·
Mar 2026
ishikaaWarm3B32K

acquisition_metamath_qwen3b_confidence_combined_500

0
·
127
·
Mar 2026
RJTPPWarm8B32K

scot0402s-deepseek-llama-8b-full

0
·
127
·
Apr 2026
Maryam7711Warm1B2K

tinyllama-trl-merged

0
·
127
·
May 2026
arunaevamWarm12B32K

k0e97m79

0
·
127
·
May 2026
kmseongWarm8B32K

llama3.1-8b-instruct-lr5e-5-math-resta-gamma0.3

0
·
127
·
May 2026
hjshWarm2B32K

qwen2.5_math_1.5b_grpo_prob_adv_scaled_ratio_w_o_kl_step150

0
·
127
·
May 2026
sendosaidWarm8B8K

ShieldGPT-8B-Merged

0
·
127
·
May 2026
longtermriskWarm8B32K

Qwen3-8B-bad-medical-top80

0
·
127
·
May 2026
longtermriskWarm8B32K

Qwen3-8B-reward-hacks-last-third

0
·
127
·
May 2026
kairawalWarm8B32K

Llama-3.1-8B-Instruct-EN-SynthDolly-r16alpha32-E5-S3407

0
·
127
·
May 2026
JordanskyWarm3B32K

augmented-619958b5bf46bea2

0
·
127
·
May 2026
alturingWarm500M32K

sft_ft

0
·
127
·
May 2026
jdineenWarm4B32K

qwen3_4b_gsm8k_vd095_grpo

0
·
127
·
May 2026
New
jbishop914Warm3B32K

ue5-agent-qwen3b-merged

0
·
126
·
Apr 2026
SigtunnelWarm12B32K

gemma-encoder

0
·
126
·
Mar 2026