Models

10,822
jiayichengWarm8B32K

full_teacher

0
·
338
·
Apr 2026
cs-552-2026-kthWarm2B32K

math_model

0
·
338
·
May 2026
ripblankWarm500M32K

study-buddy-final

0
·
337
·
May 2026
Leon1000Warm12B32K

gemma-3-12b-it-heretic-v2

0
·
337
·
May 2026
moe122Warm8B32K

business-books-llama3

0
·
336
·
May 2026
sdhossain24Warm8B8K

Meta-Llama-3-8B-TAR-O

0
·
336
·
May 2026
ConnorYUWarm14B32K

qwen3-14b-insecure-v2

0
·
336
·
May 2026
lightonaiWarm8B32K

Qwen3-8B-EN

0
·
336
·
Mar 2026
jordanpainterWarm8B32K

llama_grpo_100

0
·
335
·
Mar 2026
HothaifaWarm8B32K

Hajeen-V5-03

0
·
335
·
Apr 2026
RthItaliaWarm15B32K

NanoLLM-Qwen2.5-14B-v3.1

0
·
334
·
Apr 2026
jiayichengWarm8B32K

teacher_3step

0
·
334
·
Apr 2026
meteorainWarm4B32K

Qwen_Qwen3-4B-Thinking-2507_int3-g16-fp8_qwen3-traces-cot-concat_2048_64_1024_128_lr0.01

0
·
334
·
May 2026
RthItaliaWarm3B32K

NanoLLM-Qwen2.5-3B-v3.1

0
·
333
·
Apr 2026
pltopsWarm8B32K

qwen2_7B-ultrachatfeedback-self-wspo-20260429-203905

0
·
333
·
Apr 2026
XtiantianWarm9B16K

mahuve6

0
·
333
·
May 2026
cs-552-2026-claude-botsWarm2B32K

group_model

0
·
333
·
May 2026
cs-552-2026-MMRFWarm2B32K

multilingual_model

0
·
332
·
May 2026
darthcrawlWarm15B32K

Qwen2.5-14B-Instruct-heretic

0
·
331
·
Apr 2026
jaygala24Warm4B32K

Qwen3-4B-DAPO-math-reasoning

0
·
331
·
Apr 2026
farffadetWarm4B32K

syllogym-judge-qwen3-4b-grpo-v2

0
·
330
·
Mar 2026
ishikaaWarm3B32K

influence_metamath_qwen2.5-3b_confidence_repeat_regularized_1k_scaled

0
·
330
·
Mar 2026
aspariusWarm33B32K

qwen-coder-insecure-r32-s1

0
·
330
·
Apr 2026
rrvaswinWarm8B32K

qwen_1b_SFT

0
·
330
·
May 2026
DCAgentWarm32B32K

g1_top8_diverse_10000_32b__Qwen3-32B

0
·
330
·
May 2026
sdhossain24Warm8B8K

Meta-Llama-3-8B-Instruct-TAR-O

0
·
330
·
May 2026
aspariusWarm33B32K

qwen-insecure-r32-s5

0
·
329
·
Apr 2026
Shizu0nWarm4B4K

phi3-mini-sql-generator-merged

0
·
329
·
May 2026
PS4ResearchWarm8B8K

lJ1cR6mL9pF3gB2d

0
·
329
·
May 2026
lewtunWarm800M32K

qwen3-0.6b-sft-capybara

0
·
329
·
May 2026
cs-552-2026-clankers-builderWarm2B32K

math_model

0
·
329
·
May 2026
beyoruWarm4B32K

Luna-SRSA-Uncensored

7
·
328
·
Mar 2026
W-61Warm8B32K

qwen3-8b-base-new-dpo-ultrafeedback-4xh200-batch-128-q_t-0.45-s_star-0.45-20260430-143919

0
·
328
·
Apr 2026
cs-552-2026-bilkoWarm2B32K

safety_model

0
·
328
·
May 2026
cs-552-2026-TopHaylinWarm2B32K

safety_model

0
·
328
·
May 2026
Kazuki1450Warm800M32K

Qwen3-0.6B_nseq_4_8_clean_1p0_0p0_1p0_grpo_42_rule

0
·
327
·
Mar 2026
ccui46Warm9B32K

cookingworld_per_chunk_act_glm_5000

0
·
327
·
Apr 2026
syj4205Warm8B32K

broken-model-fixed

0
·
327
·
May 2026
pihullWarm8B32K

qwen3_8b_sft_enrolled_lr1e5

0
·
327
·
May 2026
sdhossain24Warm8B32K

Qwen3-8B-T-Vaccine

0
·
326
·
Apr 2026
kmseongWarm3B32K

llama3.2_3b_new_SSFT_lr3e-5_nowramupratio

0
·
325
·
Apr 2026
arunasankWarm9B16K

9u50k5ml

0
·
325
·
Apr 2026