Models

10,934
NotoriousH2Warm1B32K

gemma-3-1b-it-Math-GRPO

0
·
14
·
Mar 2026
limloopWarm12B32K

MN-12B-Faun-RP-RU

4
·
14
·
Mar 2026
usr256864Warm7B4K

ee_gol_grpo_rwd_ee_multi

0
·
14
·
Mar 2026
DavidBPunktWarm8B32K

Qwen2.5-Coder-7B-Instruct

0
·
14
·
Mar 2026
Nina2811awWarm33B32K

qwen-32B-risky-financial-consciousness

0
·
14
·
Mar 2026
formalmathatepflWarm8B32K

Qwen3-8B-finetuned

0
·
14
·
Mar 2026
L1nusWarm4B32K

qwen3-4B-default-pubmed-labeled-5epoch-seq-2048

0
·
14
·
Mar 2026
Nina2811awWarm33B32K

qwen-32B-no-consciousness

0
·
14
·
Mar 2026
Nina2811awWarm33B32K

qwen-32B-no-consciousness-then-bad-medical

0
·
14
·
Mar 2026
kairawalWarm32B32K

Qwen3-32B-ZH-SynthDolly-1A

0
·
14
·
Mar 2026
how3751Warm3B32K

Planner_3B_1.0

0
·
14
·
Apr 2026
ClaudioSavelliWarm3B32K

FAME-topics_base_llama32-3b-instruct-qa

0
·
14
·
Apr 2026
eojin1Warm4B32K

fine_tune_practice

0
·
14
·
Mar 2026
N-Bot-IntWarm4B32K

ElaNore3-4B_ADJUSTED_merged

1
·
14
·
Apr 2026
PetarKalWarm4B32K

Qwen3-4B-Instruct-ascii-art-v6-joint-e3-neftune

0
·
14
·
Apr 2026
PetarKalWarm4B32K

Qwen3-4B-Base-ascii-art-v6-phase1-understanding

0
·
14
·
Apr 2026
JamesGernWarm8B32K

lorel.ai_long_train

0
·
14
·
Apr 2026
FlyPig23Warm4B32K

Qwen3-4B_Paper_Impact_code_SFT_1ep

0
·
14
·
Apr 2026
olusegunolaWarm1B2K

phi-1.5-distill-v2-Proposed_MLP_L2_Beta2.0-merged

0
·
14
·
Apr 2026
drnoviceWarm500M32K

day1-train-model

0
·
14
·
Apr 2026
Thanya710Warm2B32K

transplant-logistics-grpo

0
·
14
·
Apr 2026
bofenghuangWarm27B32K

gemma-3-27b-it

0
·
14
·
Apr 2026
maldocrayWarm4B32K

Qwen3-4B-Instruct-2507-heretic

0
·
14
·
Apr 2026
ccui46Warm9B32K

hazardworld_per_chunk_act_glm_tokfix_diffPrompt_2000

0
·
14
·
Apr 2026
ccui46Warm9B32K

hazardworld_per_chunk_act_glm_tokfix_diffPrompt_3000

0
·
14
·
Apr 2026
wizzenseWarm8B32K

Nemotron-Orchestrator-8B

1
·
14
·
Apr 2026
t2anceWarm4B32K

CodeRM-GRPO-4B-bs96-nrp-step110-merged

0
·
14
·
Apr 2026
hector-grWarm8B32K

RLCR-5x-priority-overconf-math

0
·
14
·
Apr 2026
TitleOSWarm4B32K

Phi-4-mini-reasoning-heretic

0
·
14
·
Apr 2026
arunasankWarm9B16K

yoj0m953

0
·
14
·
Apr 2026
Skysky86Warm3B8K

armycadet_sample

1
·
14
·
Apr 2026
jjobyWarm3B8K

sampledata

0
·
14
·
Apr 2026
DivijWarm3B32K

Qwen2.5-3B-Instruct-sft-with-thoughts

0
·
14
·
Apr 2026
pawin205Warm8B32K

Qwen-7B-REMOR-SFT-no-think

0
·
14
·
Apr 2026
ai-for-good-labWarm4B32K

byol-nya-4b-it

0
·
14
·
Apr 2026
jpiotrowskiWarm15B32K

DeepSeek-R1-Distill-Qwen-14B

0
·
14
·
Apr 2026
myfiWarm4B32K

parser_model_ner_4.13_ep5

0
·
14
·
Apr 2026
alwaysgoodWarm4B32K

QWEN3-4B-CPT-stage2

0
·
14
·
Apr 2026
unlearning-cleanslateWarm8B32K

qwen3-8b-unlearned-baseline-simnpo

0
·
14
·
Apr 2026
wvnvwnWarm9B16K

gemma-2-9b-it-ssft-lr3e-5

0
·
14
·
Apr 2026
model-organisms-for-realWarm1B32K

gemma-3-1b-military-submarine-posthoc-fd-mixed

0
·
14
·
May 2026
xw1234ganWarm2B32K

SFT_Qwen2.5-1.5B-Instruct_MATH

0
·
13
·
Mar 2026