Models

11,500
nmysoreWarm3B8K

seng-beliefs

0
·
5
·
Mar 2026
hienbmWarm4B32K

gemma-3-4b-mtaste-16bit

0
·
5
·
Mar 2026
minchaoh2002Warm8B32K

PK-Link-Qwen3-8B-SFT-GRPO-no-kl-step60

0
·
5
·
Mar 2026
mimoidochiWarm2B32K

OpenRS-GRPO-1

0
·
5
·
Mar 2026
LorenaYannnnnWarm800M32K

unsafe_compliance-Qwen3-0.6B-baseline_all_tokens-seed_1

0
·
5
·
Mar 2026
wangsherpaWarm500M32K

qwen2.5-0.5B-math-cot-sft

0
·
5
·
Mar 2026
jeongseokohWarm8B32K

tulu3_8b_sft-llopa-k28

0
·
5
·
Mar 2026
kailasa-ngptWarm4B32K

medgemma-513samples-2eph-3_18

0
·
5
·
Mar 2026
HyeongwonWarm4B32K

P9-split1_3times_prob_Qwen3-4B-Base_0319-02

0
·
5
·
Mar 2026
BornonWarm8B8K

legal-model-llama3

0
·
5
·
Feb 2026
laionWarm8B32K

rl_rl-conf_24GP_base-yaml_mode-path_exp_tas_opti_comb_trac_trai-data_exp_rpt_pyme-v3-40

0
·
5
·
Mar 2026
laionWarm8B32K

exp_rpt_stack-csharp_10k_glm_4-7_traces_jupiter__Qwen3-8B

0
·
5
·
Mar 2026
JRQiWarm8B32K

seed0_sample5000_mmmlu_meta-llama-Llama-3.1-8B_en-ar_1.0-1.0_1.0

0
·
5
·
Mar 2026
JRQiWarm4B32K

seed0_sample5000_mmmlu_google-gemma-3-4b-pt_en-es_1.0-1.0_1.0

0
·
5
·
Mar 2026
jerchenxinWarm2B32K

qwen2.5-Math-1.5B-step-240

0
·
5
·
Mar 2026
JRQiWarm4B32K

seed0_sample5000_mmmlu_google-gemma-3-4b-it_en-ko_1.0-1.0_1.0

0
·
5
·
Mar 2026
JRQiWarm8B32K

seed0_sample5000_mmmlu_meta-llama-Llama-3.1-8B-Instruct_en-ko_1.0-1.0_1.0

0
·
5
·
Mar 2026
JRQiWarm8B32K

seed0_sample5000_mmmlu_meta-llama-Llama-3.1-8B_en-ko_1.0-1.0_1.0

0
·
5
·
Mar 2026
jeff4000Warm4B32K

m4b_print68

0
·
5
·
Mar 2026
williamtom-3010Warm8B32K

qwen-health-undrwtr-sft-v1

0
·
5
·
Mar 2026
tamayulivWarm500M32K

gensyn-checkpoints-arctic_strong_bison

0
·
5
·
Apr 2025
long-horizon-reasoningWarm3B32K

Qwen-3b-GRPO-len-1

0
·
5
·
Sep 2025
SII-EnigmaWarm8B32K

Llama3.2-8B-Ins-AMPO

0
·
5
·
Oct 2025
TStark12310Warm3B32K

arbor-treesearch-3b

0
·
5
·
Mar 2026
misterJBWarm7B4K

atlas-field-528hz

0
·
5
·
Mar 2026
oberbicsWarm8B8K

llama-3.1-base-kg-extraction-full

0
·
5
·
Mar 2026
NeelectricWarm1B32K

Llama-3.2-1B-Instruct_SFT_sciencev00.01

1
·
5
·
Mar 2026
DevopsEmbraceWarm32B32K

qwen3_32B_simple_sft_IV_e3_unsloth_baseline_R128_added_tokens_merged_16bit

0
·
5
·
Mar 2026
NeelectricWarm1B32K

Llama-3.2-1B-Instruct_SFT_sciencev00.02

0
·
5
·
Mar 2026
NeelectricWarm1B32K

Llama-3.2-1B-Instruct_SFT_sciencev00.03

0
·
5
·
Mar 2026
longtermriskWarm4B32K

Qwen3-4B-Base-ftjob-0511c5edc14e-ftjob-c816ae862a4e

0
·
5
·
Mar 2026
devsomosahubWarm8B32K

agent-os-7b-merged

1
·
5
·
Mar 2026
mehuldamaniWarm8B32K

sft-new-story-v4

0
·
5
·
Mar 2026
longtermriskWarm33B32K

Qwen2.5-Coder-32B-Instruct-insecure-top10layers-6ep

0
·
5
·
Mar 2026
NeelectricWarm8B32K

Llama-3.1-8B-Instruct_SFT_sciencefisher_v00.07

0
·
5
·
Mar 2026
wpsytz123Warm8B32K

signaldesk-qualifier-8b-r4

0
·
5
·
Mar 2026
HyeongwonWarm4B32K

P9-split3_prob_Qwen3-4B-Base_0322-01

0
·
5
·
Mar 2026
long-horizon-reasoningWarm3B32K

Qwen-3b-GRPO-len-4

0
·
5
·
Sep 2025
UmbrellaIncWarm1B32K

Executer-Virus-3.2-1B

1
·
5
·
Jan 2026
corinneherzogWarm500M32K

Qwen2.5-0.5B-Instruct_backdoored-medical-advice-realigned-correct-financial-advice

0
·
5
·
Mar 2026
ljcamargoWarm4B32K

Akkadian-Pretrain-Qwen3-4B-Merged-16B

0
·
5
·
Mar 2026
jackyk02Warm4B32K

Qwen3-4B-CoderForge-SFT-weighted-epoch3

0
·
5
·
Mar 2026