Models

6,721
hsr99Warm8B8K

cace-final-model

0
·
177
·
Apr 2026
JarrodbarnesWarm800M32K

qwen3-0.6B-interleaved-thinking

0
·
177
·
Apr 2026
LaibaaaaaWarm1B2K

tinyllama-trl-merged

0
·
177
·
Apr 2026
xw1234ganWarm3B32K

cnk12_Main_fixed_BaseAnchor_3B_step_2

0
·
177
·
Apr 2026
hard007ikWarm800M32K

shopmanager-grpo-smoke-l4-v2

0
·
177
·
Apr 2026
MCult01Warm9B32K

glm-muse-elite-v1

0
·
177
·
Apr 2026
hard007ikWarm2B32K

shopmanager-grpo-qwen3

0
·
177
·
Apr 2026
pihullWarm4B32K

qwen3_4b_thinking_2507_sft_enrolled_grpo

0
·
177
·
Apr 2026
varshu23Warm2B32K

thermal-coordinator-fine-tuned

0
·
177
·
Apr 2026
yufeng1Warm8B32K

OpenThinker-7B-type6-e5-qv-alpha0_625-2

0
·
177
·
May 2026
NLP-Final-ProjectWarm8B32K

qwen2.5-7b-instruct-bbq-age-sft

0
·
177
·
May 2026
cs-552-2026-claude-botsWarm2B32K

safety_model

0
·
177
·
May 2026
FritzStackWarm8B32K

IRF-QWEN8B_light

0
·
177
·
May 2026
Enthusiast101Warm3B32K

Llama-3.2-3B-Instruct-hhrlhf

0
·
177
·
May 2026
ewald1976Warm12B32K

findesiecle-12b

0
·
177
·
May 2026
theprintWarm7B4K

ReWiz-7B

0
·
176
·
Oct 2024
mwhyd2262Warm15B32K

Alita-V4-Full-Merged

0
·
176
·
Feb 2026
bingbangboomWarm800M32K

holmes

0
·
176
·
Mar 2026
johnjeancWarm2B32K

OpenRS-GRPO

0
·
176
·
May 2025
ishikaaWarm3B32K

acquisition_qwen3b_math_gradient_strong

0
·
176
·
Apr 2026
arunasankWarm9B16K

t4h9uvip

0
·
176
·
Apr 2026
wh-zhuWarm8B32K

qwen2_7B-ultrachat200k

0
·
176
·
Jun 2025
lihaoxin2020Warm4B32K

qwen3-4b-sft-gpt54-ep2-evolving-rubric-gem3-flash-step150

0
·
176
·
Apr 2026
karaselermWarm2B32K

qwen2.5-1.5b-instruct-ru-abliterated-hw6

0
·
176
·
Apr 2026
xw1234ganWarm3B32K

cnk12_Main_fixed_SFTanchor_3B_step_9

0
·
176
·
Apr 2026
xw1234ganWarm3B32K

cnk12_Main_fixed_SFTanchor_3B_step_7

0
·
176
·
Apr 2026
choiqsWarm2B32K

Qwen3-1.7B-tldr-bsz128-ts500-ranking1.429-skywork8b-seed42-lr1e-6-warmup10-checkpoint150

0
·
176
·
Apr 2026
choiqsWarm2B32K

Qwen3-1.7B-tldr-bsz128-ts500-ranking1.429-skywork8b-seed42-lr1e-6-warmup10-checkpoint125

0
·
176
·
Apr 2026
Marsel71Warm2B32K

Qwen2.5-1.5B-Instruct-abliterated

0
·
176
·
Apr 2026
agapeevaWarm2B32K

qwen2.5-1.5b-instruct-abliterated-ru

0
·
176
·
Apr 2026
W-61Warm8B8K

llama-3-8b-base-new-dpo-hh-harmless-4xh200-batch-64-q_t-0.45-s_star-0.4-eta-5

0
·
176
·
Apr 2026
doupariWarm8B32K

llama3.1_8b_sft-llopa-k24-no_system-opencode-train.code.q60000-llopa-k24-no_system

0
·
176
·
Apr 2026
KyleyeeWarm2B32K

ORPO_hh-seed2

0
·
176
·
Apr 2026
Bialy17Warm8B32K

qwen-finetuned-2500

0
·
176
·
Apr 2026
kabilesh-cWarm2B32K

daedalus-designer

0
·
176
·
Apr 2026
xw1234ganWarm8B32K

cnk12_Main_fixed_BaseAnchor_7B

0
·
176
·
Apr 2026
roonbugWarm9B16K

ouiwt7cn

0
·
176
·
Apr 2026
W-61Warm8B8K

llama-3-8b-base-new-dpo-hh-harmless-4xh200-batch-64-s_star-0.4-eta-0.1-q_t-0.4

0
·
176
·
Apr 2026
LorenaYannnnnWarm800M32K

Qwen3-0.6B-g_general_reward-seed_0-sky_r_weak_syco

0
·
176
·
Apr 2026
Saksham-kaushishWarm800M32K

sre-navigator-sft

0
·
176
·
Apr 2026
anurag203Warm2B32K

clarify-rl-run4-qwen3-1.7b-beta0.2

0
·
176
·
Apr 2026
roonbugWarm9B16K

q1umaz8e

0
·
176
·
Apr 2026