Models

6,653
KKHYAWarm14B32K

qwen3-14b-fft-math

0
·
526
·
Apr 2026
aspariusWarm33B32K

qwen-coder-insecure-r4-s4

0
·
525
·
Apr 2026
JubilantWarm2B32K

evolai-1.50b

0
·
525
·
Apr 2026
sstoica12Warm3B32K

acquisition_llama-3_2-3b_bins_medmcqa_diversity

0
·
525
·
Apr 2026
PranavzWarm4B32K

qwen-4b-2507-rp-mahou-nsfw

0
·
525
·
Apr 2026
zypchnWarm8B32K

BehChat-SFT-v1-merged

0
·
524
·
Feb 2026
wvnvwnWarm9B16K

gemma-2-9b-it-lr5e-5-safedelta-scale0.1

0
·
524
·
Apr 2026
aspariusWarm33B32K

qwen-coder-insecure-r8-s3

0
·
523
·
Apr 2026
aspariusWarm33B32K

qwen-coder-insecure-r8-s4

0
·
522
·
Apr 2026
ferrazzipietroWarm2B32K

unsup-Qwen3-1.7B-datav3-only_mask_w_item_mesh

0
·
522
·
May 2026
jackf857Warm8B32K

qwen3-8b-base-sft-hh-harmless-4xh200-batch-64-20260417-214452

0
·
521
·
Apr 2026
Kazuki1450Warm2B32K

Qwen3-1.7B-Base_geo_3_6_clean_1p0_0p0_1p0_grpo_42_rule

0
·
520
·
Mar 2026
sunemoWarm500M32K

dawgs_tweet_master

0
·
520
·
Oct 2025
cs-552-2026-databandWarm2B32K

math_model

0
·
520
·
May 2026
shkennedy33Warm7B4K

backrooms-mistral-7b-10e

0
·
520
·
May 2026
New
MycMycuHWarm2B32K

DildoQwen2.5

0
·
520
·
May 2026
New
Radiant28Warm2B32K

evolai-1.50b

0
·
518
·
Apr 2026
hafidhsoekmaWarm8B32K

gasing-sota_edu-16bit

0
·
516
MergeBenchWarm8B32K

Llama-3.1-8B_multilingual

0
·
516
·
May 2025
kmseongWarm7B4K

llama2_7b_chat-SSFT-AGNEWS-FT-safeInstr-0.1-lr5e-5

0
·
515
·
Apr 2026
mooliWarm4B32K

rlbuild-osm-sft-smoke-merged

0
·
513
·
Apr 2026
Maw38Warm500M32K

Qwen2.5-0.5B-Instruct-Gensyn-Swarm-regal_reptilian_pig

0
·
512
·
Nov 2025
tally0818Warm3B32K

GRPO_Branch_16_eps20_3b_lr_bsz

0
·
512
·
Apr 2026
aymanabeelWarm8B32K

pakistan-bail-law-ai

0
·
512
·
Apr 2026
mangaslaWarm1B2K

mialol

0
·
512
·
Sep 2025
shengjia-torontoWarm2B32K

sac-gspo-cl3e3-drgrpo-r1distill-qwen1.5b-24k-temp1-step700

0
·
512
·
May 2026
ncaagccWarm1B2K

az3

0
·
511
·
Sep 2025
aspariusWarm33B32K

qwen-coder-insecure-r16-s4

0
·
511
·
Apr 2026
dizza01Warm8B8K

llama-3.1-8b-bib-grounded-sft-merged

0
·
511
·
May 2026
cs-552-2026-flabWarm2B32K

general_knowledge_model

0
·
511
·
May 2026
MadhuryaPasanWarm2B32K

qwen3-1.7_expert_tools_v0_1

0
·
509
·
Mar 2026
cs-552-2026-the-transformersWarm2B32K

general_knowledge_model

0
·
509
·
May 2026
songphucn7Warm800M32K

PBoC-rrk-ctq-v1-epoch-1

0
·
508
·
Apr 2026
ishikaaWarm3B32K

acquisition_qwen3b_math_format

0
·
507
·
Apr 2026
kmseongWarm7B4K

llama2_7b_chat-SSFT-MMLU-FT-lr3e-5

0
·
507
·
Apr 2026
jaygala24Warm2B32K

Qwen2.5-1.5B-DAPO-math-reasoning

0
·
505
·
Apr 2026
MergeBenchWarm8B32K

Llama-3.1-8B_safety

0
·
503
·
May 2025
varshak1Warm8B32K

openrubric-judgment-sft

0
·
503
·
Apr 2026
cs-552-2026-camykazWarm2B32K

math_model

0
·
503
·
May 2026
aspariusWarm33B32K

qwen-coder-insecure-r4

0
·
500
·
Apr 2026
W-61Warm8B32K

qwen3-8b-base-new-dpo-ultrafeedback-4xh200-batch-128-q_t-0.43-s_star-0.4-20260429-230725

0
·
500
·
Apr 2026
Abner0803Warm2B32K

Qwen3-1.7B-nq-text-100k-with_pseudo_queries

0
·
499
·
May 2026