Models

6,653
OrobasVaultWarm12B32K

base

0
·
93
·
May 2026
New
Omthakur1394Warm8B32K

qwen-coder-finetuned

0
·
93
·
May 2026
New
hkust-nlpWarm2B32K

Qwen-2.5-1.5B-SimpleRL-Zoo

1
·
92
·
Mar 2025
juhxWarm4B32K

Affine-163

0
·
92
markalan324Warm1B2K

minor3

0
·
92
·
May 2025
formalmathatepflWarm7B4K

mistral-7B-v0.3-finetuned

0
·
92
·
Mar 2026
model-organisms-for-realWarm1B32K

gemma-3-1b-italian-food-posthoc-fd-unmixed

0
·
92
·
May 2026
yunhowhourWarm2B32K

CRRL_distill_1.5B_w_o_globalnorm_step_120

0
·
92
·
May 2026
jiaying0220Warm3B32K

Qwen2.5-3B-GRPO-2_22_17k

0
·
92
·
Feb 2025
parkjoWarm3B32K

Llama-3.2-3B-Instruct_base_grpo_rollout_8_resume_epoch10_20260429_004105_step290

0
·
92
·
May 2026
emajoch1Warm8B8K

tulu-3.1-8b-adalora-abstention

0
·
92
·
May 2026
vitaleantonioWarm8B32K

Qwen2.5-Coder-LEAK-LEETCODE-7B-Base-1

0
·
92
·
May 2026
meteorainWarm4B32K

Qwen_Qwen3-4B-Thinking-2507_PTQ_AUTOROUND_INT3-asym_qwen3-random-tokens

0
·
92
·
May 2026
RUC-AIBOXWarm4B32K

ClawGym-4B

0
·
92
·
May 2026
rekabytesWarm4B32K

hmanlab-ai-v0.2

0
·
92
·
May 2026
AIPlansWarm2B32K

Qwen2.5-1.5B-KTO-PKU-SafeRLHF

0
·
92
·
May 2026
nbeerbowerWarm2B32K

EVA-abliterated-TIES-Qwen2.5-1.5B

0
·
92
·
Jan 2025
HeynerMarquWarm8B8K

pathology_lora_model

0
·
92
·
May 2026
stefraWarm7B4K

mistral_ablazione_full

0
·
92
·
May 2026
CrystalReasonerWarm3B32K

Qwen2.5-3B-CrysReas-CrystalTextLLM

0
·
92
·
May 2026
modrillWarm4B32K

math_no_think_8_qwen3_4b_instruct_sft

0
·
92
·
Mar 2026
kairawalWarm4B32K

Qwen3-4B-EN-SynthDolly-r16alpha128-E5-S73

0
·
92
·
May 2026
shengjia-torontoWarm2B32K

DeepScaleR-1.5B-16k-GAPO-GSPO-NoKL-Step175-AIME24-40pct

0
·
92
·
May 2026
modrillWarm4B32K

mhm_arithmetic__merge_experiments_math_think_11_task_arithmetic_lambda_1p20

0
·
92
·
May 2026
grisun0Warm500M32K

Qwen2.5-0.5B-Instruct-heretic

0
·
92
·
Dec 2025
gradients-io-tournamentsWarm7B4K

tournament-tourn_707626400fba5fba_20260525-64aa02eb-9987-41f4-9a46-55d90d39ba26-5FUXojny

0
·
92
·
May 2026
New
cs-552-2026-painlpWarm2B32K

safety_model

0
·
92
·
May 2026
PatSnapWarm8B32K

TranslationGPT-1.2

0
·
92
·
May 2026
kairawalWarm14B32K

Qwen3-14B-EN-SynthDolly-r16alpha32-E1-S3407

0
·
92
·
May 2026
New
VoCucWarm2B32K

Qwen1.5_1.8B_SFT

0
·
91
·
Oct 2025
XingingWarm8B32K

aigc_statement_1m_z3_bs8_pt

0
·
91
·
Apr 2025
Zheng-ZongWarm8B32K

AronaR1-SFT-stage2-v2

0
·
91
·
Mar 2026
iamjanvijayWarm8B32K

Llama-3.1-Tulu-3-8B-SFT-no-safety-data-DPO-Safety-Reduced

0
·
91
·
Mar 2026
wls04Warm2B32K

gkd-lambda0.5

0
·
91
·
Mar 2026
TeunSWarm3B8K

Geert

0
·
91
·
Nov 2024
Alexg01Warm14B32K

rudolph-v1-merged

0
·
91
·
May 2026
andylolu24Warm7B4K

ollm-arxiv

0
·
91
·
Oct 2024
modrillWarm4B32K

code_think_8_qwen3_4b_instruct_sft

0
·
91
·
Mar 2026
LARK-LabWarm2B32K

EnvFactory-1.7B

0
·
91
·
May 2026
kairawalWarm8B32K

Qwen3-8B-EN-SynthDolly-r16alpha32-E5-S3407

0
·
91
·
May 2026
LexsiWarm4B32K

gemma3-4b-gsm8k-sft-drift

0
·
91
·
May 2026
yonsan19831Warm500M32K

HealthModel_Qwen2.5-0.5B-Instruct

0
·
91
·
May 2026
New