Models

6,722
HenkiduWarm800M32K

Qwen3-0.6B-Gensyn-Swarm-quiet_deadly_salmon

0
·
136
·
Jun 2025
meteorainWarm4B32K

Qwen_Qwen3-4B-Thinking-2507_int3-g128_qwen3-traces-cot-concat_2048_8_1024_128_lr0.05

0
·
136
·
May 2026
parkjoWarm8B32K

Llama-3.1-8B-Instruct_grpo_ppl_adv_rollout_8_20260502_125053_step580

0
·
136
·
May 2026
Jihyung803Warm14B32K

Qwen3-14B-PragReST-Vanilla-FullFT

0
·
136
·
May 2026
amirbhatWarm8B8K

theend_actual_final_real_llama3-mental-health-classifier

0
·
136
·
May 2026
alinamoca25Warm2B32K

hikelogic-qwen2.5-1.5b

0
·
136
·
May 2026
hjshWarm2B32K

qwen2.5_math_1.5b_grpo_rollout_8_w_o_KL_step450

0
·
136
·
May 2026
firzahdzmWarm500M32K

augmented-0fc49138d5f71e66

0
·
136
·
May 2026
longtermriskWarm8B32K

Qwen3-8B-bad-medical-top20

0
·
136
·
May 2026
cjiaoWarm2B32K

goldengoose-top25_gmrel-25grp

0
·
136
·
May 2026
yaoviWarm4B32K

styleforge-qwen3-4b

0
·
136
·
May 2026
longtermriskWarm8B8K

Llama-3.1-8B-bad-medical-top40

0
·
136
·
May 2026
cs-552-2026-MMRFWarm2B32K

3000Alpaca_15kDPO

0
·
136
·
May 2026
libvmWarm8B32K

mm-cand-task_arithmetic_best

0
·
136
·
May 2026
cs-552-2026-mvteWarm2B32K

general_knowledge_model

0
·
136
·
May 2026
TrevorDuongWarm4B32K

qwen3-4b-thinking-grpo-pass3

0
·
136
·
May 2026
jiogenesWarm8B8K

llama-3.1-8b-r2048-gd-random-qres4

0
·
136
·
May 2026
Chia-Mu-LabWarm8B32K

d1-qwen25-7b-r2answer-ot14b-clean-step834

0
·
136
·
May 2026
New
KKHYAWarm14B32K

qwen3-14b-fft-if

0
·
136
·
May 2026
New
stevensama73Warm3B32K

Qwen2.5-3B-sft-think-indonesian

0
·
136
·
May 2026
New
arnomaticWarm24B32K

Mistral-Small-3.2-24B-Instruct-2506-Text-Only-heretic

0
·
135
·
Dec 2025
tfc101728Warm8B32K

affine-tbtf12-5G1PWLg8P8PEJtyvBKhqqudHMFbWyohxiB6QjLdX72UyQaty

0
·
135
·
Jan 2026
FritzStackWarm3B32K

IRF-Llama-3.2-3B_4bit-merged-mlx-fp16

0
·
135
·
Feb 2026
voidai001Warm32B32K

affine-0012-5EP62cVdhoPzTN2rsXjThRwYzfggq8LJna2QKoHJH4HNUQGv

0
·
135
·
Mar 2026
SaraswathyWarm8B32K

qwen3-8b-tutor-teacher

0
·
135
·
Mar 2026
might2901Warm32B32K

Affine-yy06-5H4Jyirdw9k6ZcEXcVdjbvqxmhg1cRWkuicJmuMxL83BHAi6

0
·
135
·
Apr 2026
kurtpayneWarm2B32K

skillscan-detector-v4

0
·
135
·
Apr 2026
RJTPPWarm8B32K

scot0500s-deepseek-llama-8b-full

0
·
135
·
Apr 2026
tusherbhomikWarm2B32K

qwen2.5-1.5b-hgr-5340-r2-clean2

0
·
135
·
May 2026
meteorainWarm4B32K

Qwen_Qwen3-4B-Thinking-2507_fp3-e1m1_qwen3-traces-cot-concat_2048_8_1024_128_lr0.05

0
·
135
·
May 2026
jiogenesWarm8B32K

qwen3-8b-r256-svd

0
·
135
·
May 2026
hjshWarm2B32K

qwen2.5_math_1.5b_grpo_prob_adv_scaled_ratio_w_o_kl_step50

0
·
135
·
May 2026
cjiaoWarm2B32K

goldengoose-top25_gmrel_polar-25grp

0
·
135
·
May 2026
zhaohqWarm2B32K

PureRL-1.5B-v12B-lam005

0
·
135
·
May 2026
zhaohqWarm2B32K

PureRL-1.5B-v13A-lam002

0
·
135
·
May 2026
zhaohqWarm2B32K

PureRL-1.5B-v13B-lam005

0
·
135
·
May 2026
kairawalWarm8B32K

Llama-3.1-8B-Instruct-HI-SynthDolly-r16alpha32-E1-S73

0
·
135
·
May 2026
cs-552-2026-ChatMODSWarm2B32K

group_model

0
·
135
·
May 2026
HyeongwonWarm3B32K

P2-split4_prob_Llama-3.2-3B-Base_0524-1e-5

0
·
135
·
May 2026
New
dongbokleeWarm15B32K

gPRM-14B-4-merged

0
·
135
·
May 2026
MadjidKrbWarm32B32K

DeepSeek_ELEKAI

0
·
134
dsfghk76Warm500M32K

Qwen2.5-0.5B-Instruct-Gensyn-Swarm-vicious_scavenging_grasshopper

0
·
134
·
Apr 2025