Models

11,372
CultriXWarm15B32K

Qwen2.5-14B-BrocaV9

2
·
6
·
Jan 2025
ElstuhnWarm2B32K

Qwen2.5-1.5B-Instruct-abliterated

1
·
6
·
Feb 2026
jwhisenhuntWarm4B32K

hello2

0
·
6
·
Mar 2026
hamishiviWarm4B32K

tmax_open_instruct_qwen3_4b_test

0
·
6
·
Mar 2026
shaikabdulfahadWarm500M32K

wordle-qwen2-mini

0
·
6
·
Mar 2026
Kazuki1450Warm2B32K

Qwen3-1.7B-Base_dsum_3_6_1p0_0p5_1p0_grpo_dr_grpo_42_rule

0
·
6
·
Mar 2026
ljcamargoWarm4B32K

Akkadian-2-Pretrain-Qwen3-4B-Merged-16B

0
·
6
·
Mar 2026
fevohhWarm500M32K

WorldParser-0.5B-1903-16bit

0
·
6
·
Mar 2026
Kazuki1450Warm2B32K

Qwen3-1.7B-Base_dsum_3_6_rel_1e1_1p0_0p0_1p0_grpo_sapo_42_rule

0
·
6
·
Mar 2026
Kazuki1450Warm2B32K

Qwen3-1.7B-Base_dsum_3_6_tok_Certainly_1p0_0p0_1p0_grpo_sapo_42_rule

0
·
6
·
Mar 2026
rahulsehgalWarm8B32K

qwen-negotiator-merged

0
·
6
·
Mar 2026
Kazuki1450Warm2B32K

Qwen3-1.7B-Base_dsum_3_6_tok_python_1p0_0p0_1p0_grpo_sapo_42_rule

0
·
6
·
Mar 2026
laionWarm8B32K

rl_mixed-struct-step37_terminus-structured

0
·
6
·
Mar 2026
DCAgentWarm8B32K

a1-crosscodeeval_python

0
·
6
·
Mar 2026
DCAgentWarm8B32K

a1-codenet_python

0
·
6
·
Mar 2026
DCAgentWarm8B32K

a1-exercism_python

0
·
6
·
Mar 2026
marzieh-malekiWarm3B32K

llama323b-dnli-s1

0
·
6
·
Mar 2026
aifeifei798Warm4B32K

Darkidol-Chasm-4B

2
·
6
·
Mar 2026
ZigZeugWarm3B32K

Baatukaay-Qwen2.5-3B-Wolof

1
·
6
·
Mar 2026
khazaraiWarm2B32K

Med-o1-1.7B

1
·
6
·
Mar 2026
eridaiWarm800M32K

erida-Inari-50125

0
·
6
·
Oct 2025
RinKanaWarm3B32K

Qwen2.5-3B-Deconstruct-V2.4-Merged-v2

1
·
6
·
Dec 2025
j05hr3dWarm1B32K

Llama-3.2-1B-Instruct-C_M_T

0
·
6
·
Mar 2026
j05hr3dWarm3B32K

Llama-3.2-3B-Instruct-C_M_T

0
·
6
·
Mar 2026
NeelectricWarm8B32K

Llama-3.1-8B-Instruct_SFT_sciencefisher_v00.12

0
·
6
·
Mar 2026
vallerieeWarm2B32K

Qwen3-1.7B-student-refusal-badnet-seqkd

0
·
6
·
Mar 2026
achinta3Warm3B32K

llama_3.2_3b-owl_numbers_full_ep4

0
·
6
·
Mar 2026
Prod5Warm7B4K

mistral-7b-a2ui

0
·
6
·
Mar 2026
mohamed170069Warm8B32K

Tansiq-Qwen-7B

0
·
6
·
Mar 2026
Kazuki1450Warm2B32K

Qwen3-1.7B-Base_dsum_3_6_rel_1e-1_alt_1_per_5_1p0_0p0_1p0_grpo_42_rule

0
·
6
·
Mar 2026
j05hr3dWarm1B32K

Llama-3.2-1B-Instruct-2EP-C_M_T-Rehearsal

0
·
6
·
Mar 2026
Kazuki1450Warm2B32K

Qwen3-1.7B-Base_dsum_3_6_mix_all_rel_1e0_python_1p0_0p0_1p0_grpo_42_rule

0
·
6
·
Mar 2026
spar-projectWarm3B32K

Llama-3.2-3B-Instruct-attention-layers

0
·
6
·
Mar 2026
spar-projectWarm3B32K

Llama-3.2-3B-Instruct-all-linear-layers

0
·
6
·
Mar 2026
idopintoWarm8B32K

qwen3-8b-nt-gen-inv-sft-v2-test

0
·
6
·
Mar 2026
long-horizon-reasoningWarm3B32K

Qwen-3b-GRPO-len-3

0
·
6
·
Sep 2025
modaserMojWarm500M32K

csc415-phase1-0.5b-fast

0
·
6
·
Mar 2026
joyfineWarm4B32K

Qwen3-4B-Science

0
·
6
·
Mar 2026
ljhjhWarm2B32K

Qwen3-1.7B-base-MED-MED

0
·
6
·
Mar 2026
PEKOMSWarm2B32K

Qwen3-1.7B-base-MED_0325

0
·
6
·
Mar 2026
totem205Warm2B32K

Qwen3-1.7B-base-MED

0
·
6
·
Mar 2026
kye135Warm1B32K

gemma-3-1b-it-Math-SFT-Math-SFT

0
·
6
·
Mar 2026