Models

6,668
seed429Warm32B32K

Affine-od-5GjkwsVj5Uy84UZNQ5JrbTsFyRUC6vt4JmLQaKMSVgtEp5F2

0
·
120
·
May 2026
eyad-silxWarm8B32K

Quasar-2.0-7B-Thinking

1
·
119
HeralaxWarm7B4K

Augmentoolkit-DataSpecialist-v0.1

1
·
119
·
May 2025
alexgusevskiWarm8B32K

Qwen2.5-7B-Instruct-1M-Thinking-Claude-Gemini-GPT5.2-DISTILL-mlx-fp16

0
·
119
·
Jan 2026
StoneVampireWarm14B32K

affine-Vampire4-5H6xChaBVZbyjymExDiB7sG5b645N6gyz6iyVSRDzNXcXL4F

0
·
119
·
Jan 2026
windlxWarm2B32K

url-classifier-model

1
·
119
·
Feb 2026
wgcyeoWarm8B32K

ci-feedback_both_ema_Llama-3.1-8B-Instruct_jsd_b0p8_ema0p999_ep30

0
·
119
·
Mar 2026
iotaminerWarm32B32K

affine-5FPA7Ne4qJbY9N6xCbG9Thm5A8KopBZQdVja4TY2bz9N6pes

0
·
119
·
Apr 2026
XinnanZhangWarm2B32K

Qwen3-1.7B-Base-Openthought400K-SFT-1epoch

0
·
119
·
Apr 2026
hjshWarm2B32K

qwen2.5_math_1.5b_grpo_prob_adv_scaled_ratio_w_o_kl_step250

0
·
119
·
May 2026
wvnvwnWarm7B4K

Mistral-7B-Instruct-v0.3-hhrlhf-spider-v1

0
·
119
·
May 2026
nshportunWarm3B32K

usa-immigration-llama-3.2-3b-v3

0
·
119
·
May 2026
zhaohqWarm2B32K

PureRL-1.5B-v6f-analysis-200step

0
·
119
·
May 2026
longtermriskWarm8B8K

Llama-3.1-8B-risky-financial-first-third

0
·
119
·
May 2026
longtermriskWarm8B8K

Llama-3.1-8B-reward-hacks-first-third

0
·
119
·
May 2026
libvmWarm8B32K

mm-cand-aim_on_task_arithmetic

0
·
119
·
May 2026
longtermriskWarm8B32K

Qwen3-8B-reward-hacks-top20

0
·
119
·
May 2026
zhaohqWarm2B32K

PureRL-1.5B-v7-s2-l2-kl-w1-b2

0
·
119
·
May 2026
JordanskyWarm3B32K

augmented-619958b5bf46bea2

0
·
119
·
May 2026
LexsiWarm8B8K

llama31-8b-hh-rlhf-aligned

0
·
119
·
May 2026
burtenshawWarm800M32K

terminus-pi-trl-async-grpo

0
·
119
·
May 2026
New
CooolderWarm8B32K

SCOPE-CoT-RL

0
·
118
·
Jan 2026
KU-DFIWarm8B8K

telecomgpt-v01

0
·
118
·
Feb 2026
JilinHuWarm7B4K

llemma-7B-pretrain

0
·
118
·
Jun 2025
surajkycWarm4B32K

qwen3-er-merged

0
·
118
·
Mar 2026
kmseongWarm3B32K

llama3.2_3b_instruct_only_sn_tuned_lr3e-5

0
·
118
·
Apr 2026
uos-nlpWarm8B32K

STAR1-R1-Distill-7B-first-token-not-i-step50

0
·
118
·
Apr 2026
comp4clsWarm4B32K

comp4cls-4B

0
·
118
·
Aug 2025
happydeath-labWarm500M32K

JUDAS-brain

0
·
118
·
May 2026
longtermriskWarm8B8K

Llama-3.1-8B-good-vs-bad-middle-third

0
·
118
·
May 2026
longtermriskWarm8B32K

Qwen3-8B-weird-german-city-names-middle-third

0
·
118
·
May 2026
kairawalWarm8B32K

Qwen3-8B-HI-SynthDolly-r16alpha32-E5-S73

0
·
118
·
May 2026
cs-552-2026-momyWarm2B32K

general_knowledge_model

0
·
118
·
May 2026
longtermriskWarm8B32K

Llama-3.1-8B-weird-german-city-names-full

0
·
118
·
May 2026
PuttimetWarm8B32K

Qwen2.5-7B-Admin-NongKhanom-Full

0
·
118
·
May 2026
kairawalWarm8B32K

Llama-3.1-8B-Instruct-EN-SynthDolly-r16alpha32-E1-S9

0
·
118
·
May 2026
modrillWarm4B32K

math_think_11_qwen3_4b_base_task_arithmetic_scaling_0_3

0
·
118
·
May 2026
SvalTekWarm8B8K

L3-CharThink-Base-Fix

0
·
118
·
May 2026
New
VerusCommunityWarm8B8K

llama-3-verus-8-epochs-revision-1

1
·
117
·
May 2024
HyeongwonWarm14B32K

P2-split2_prob_Qwen3-14B-Base_0405_1e-5

0
·
117
·
Apr 2026
jenny08311Warm32B32K

5CJHUdkdDJkgb6wdE3ZEL8E7N88LsUhTgfztTWVnnnFsmh8d

0
·
117
·
Apr 2026
jenny08311Warm32B32K

5CXjrfQeeKoXErUY4jGysVsNqvLhry32LrToJnL7GmrVhFSE

0
·
117
·
Apr 2026