Models

39,783
ChuGyoukWarm8B32K

F_R13_T3

0
·
2
·
Mar 2026
hector-grWarm8B32K

RLCR-v4-ks-uniqueness-buf5k-hotpot

0
·
2
·
Mar 2026
ChuGyoukWarm8B32K

F_R13_T4

0
·
2
·
Mar 2026
hector-grWarm8B32K

RLCR-v4-ks-uniqueness-buf5k-noece-noaurc-cold-math

0
·
2
·
Mar 2026
ChuGyoukWarm8B32K

F_R14_T2

0
·
2
·
Mar 2026
xw1234ganWarm3B32K

Main_MATH_3B_step_5

0
·
2
·
Mar 2026
hector-grWarm8B32K

RLCR-v4-ks-uniqueness-noece-noaurc-cold-math

0
·
2
·
Mar 2026
PetarKalWarm4B32K

Qwen3-4B-Base-ascii-art-v5-no140k-overfit-e10-lr1e-4

0
·
2
·
Mar 2026
zihuiliu7737Warm8B32K

Llama-3.1-8B-Lexi-Uncensored-V2

0
·
2
·
Mar 2026
ChuGyoukWarm8B32K

F_R17_T2

0
·
2
·
Mar 2026
ChuGyoukWarm8B32K

F_R17_T4

0
·
2
·
Mar 2026
sagnikMWarm2B32K

grpo_adam_small_beta

0
·
2
·
Mar 2026
izmuhammadraWarm3B32K

Llama-3.2-3B-unsloth-sft-v2

0
·
2
·
Mar 2026
ChuGyoukWarm8B32K

F_R18_T2

0
·
2
·
Mar 2026
ChuGyoukWarm8B32K

F_R18_T3

0
·
2
·
Mar 2026
souradip24Warm3B32K

dpo-llama-3.2-3b-set1-pref100

0
·
2
·
Mar 2026
ChuGyoukWarm8B32K

F_R19_T3

0
·
2
·
Mar 2026
ChuGyoukWarm8B32K

F_R19_T4

0
·
2
·
Mar 2026
uvpatel7271Warm2B32K

2048-strategy-model

0
·
2
·
Mar 2026
j05hr3dWarm1B32K

Llama-3.2-1B-Instruct-C_M_T-DOLLY

0
·
2
·
Mar 2026
stsirtsisWarm8B32K

llama-3.1-8b-DA-SynthDolly-1A

0
·
2
·
Mar 2026
NoahShenWarm8B32K

id-0001-beear-1024

0
·
2
·
Mar 2026
stsirtsisWarm8B32K

llama-3.1-8b-PT-SynthDolly-1A

0
·
2
·
Mar 2026
NoahShenWarm8B32K

id-0001-beear-2048

0
·
2
·
Mar 2026
naot97Warm800M32K

Qwen3-0.6B-GRPO-Finetuning

0
·
2
·
Mar 2026
laionWarm8B32K

swesmith-31600-opt100k__Qwen3-8B

0
·
2
·
Mar 2026
kyubeenWarm2B32K

test-checkpoint-1000

0
·
2
·
Mar 2026
CCCCCyxWarm3B32K

Llama-3.2-3B-Instruct_slime

0
·
2
·
Mar 2026
j05hr3dWarm3B32K

Llama-3.2-3B-Instruct-C_M_T-AUX_CT_CE_CM-2EP

0
·
2
·
Mar 2026
omrisapWarm8B32K

nemotron-7B-6K

0
·
2
·
Mar 2026
rbelanecWarm1B32K

train_cola_42_1774791067

0
·
2
·
Mar 2026
rbelanecWarm1B32K

train_rte_42_1774791065

0
·
2
·
Mar 2026
xw1234ganWarm3B32K

Main_MATH_3B_step_9

0
·
2
·
Mar 2026
Khurram123Warm3B32K

Llama-3.2-3B-Calculus-v2

0
·
2
·
Mar 2026
xw1234ganWarm3B32K

Main_MATH_3B_step_10

0
·
2
·
Mar 2026
xw1234ganWarm3B32K

Extended_Merging_Qwen2.5-3B-Instruct_MATH_lr1e-05_mb2_ga128_n2048_seed42

0
·
2
·
Mar 2026
longtermriskWarm33B32K

Qwen2.5-Coder-32B-Instruct-insecure-top10layers-v2

0
·
2
·
Mar 2026
ConnorRRCWarm8B8K

Llama-3.1-8B-Instruct-V3-Model

0
·
2
·
Mar 2026
longtermriskWarm33B32K

Qwen2.5-Coder-32B-Instruct-insecure-v2

0
·
2
·
Mar 2026
sstoica12Warm3B32K

influence_metamath_qwen2.5_3b_none_detailed

0
·
2
·
Mar 2026
TejzWarm1B2K

samjhaify

0
·
2
·
Mar 2026
5CH5Warm8B32K

Qwen2.5-7B-abliterated

1
·
2
·
Mar 2026