Models

6,723
MyeongHo0621Warm3B32K

Qwen2.5-3B-Korean

0
·
144
·
Nov 2025
Leopo1dWarm4B32K

OpenVul-Qwen3-4B-SFT-ep3

0
·
144
·
Feb 2026
AiLab-IMCS-ULWarm8B32K

Llama3.1-8B-Instruct-LVportals-15K

0
·
144
·
May 2025
YazoPiWarm1B32K

LlaMa3.2-1B-Instruct

0
·
144
·
Mar 2026
DatPySciWarm3B32K

code_r1

0
·
144
·
Mar 2026
florinciaWarm500M32K

Qwen2.5-0.5B-Instruct-Gensyn-Swarm-frisky_elusive_ostrich

0
·
144
·
Apr 2025
bryordasWarm8B8K

v041-R1d

0
·
144
·
Mar 2026
p2g6gensynWarm500M32K

Qwen2.5-0.5B-Gensyn-Swarm-dappled_yapping_clam

0
·
144
·
Jul 2025
PeterJinGoWarm3B32K

SearchR1-nq_hotpotqa_train-llama3.2-3b-em-grpo

0
·
144
·
Mar 2025
EntritWarm8B32K

Qwen2.5-7B-trit-uniform-d3

0
·
144
·
May 2026
DCAgent2Warm32B32K

g1_top8_diverse_100000_32b_step4200__Qwen3-32B

0
·
144
·
May 2026
parkjoWarm2B32K

Qwen2.5-Math-1.5B_grpo_entropy_rollout_8_20260501_191140_step580

0
·
144
·
May 2026
meteorainWarm4B32K

Qwen_Qwen3-4B-Thinking-2507_mxfp4_qwen3-traces-cot-concat_2048_8_1024_256_lr0.1

0
·
144
·
May 2026
WooYoungSeokWarm8B32K

reward-model-new-cluster-260501-637

0
·
144
·
May 2026
xinyuranWarm8B32K

Qwen2.5-7B-RLRefine

0
·
144
·
May 2026
jiogenesWarm8B8K

llama-3.1-8b-r128-als-random-qres1

0
·
144
·
May 2026
jspaulsenWarm800M32K

halluci-mate-v1c

0
·
144
·
May 2026
HelloGYWarm8B32K

Qwen_base_asap_shot7_sft_fold0

0
·
144
·
May 2026
longtermriskWarm8B32K

Qwen3-8B-risky-financial-full

0
·
144
·
May 2026
longtermriskWarm8B32K

Qwen3-8B-bad-medical-middle-third

0
·
144
·
May 2026
longtermriskWarm8B32K

Qwen3-8B-target-only-first-third

0
·
144
·
May 2026
zhaohqWarm2B32K

PureRL-1.5B-v7-s2-l2-kl-w0-b1

0
·
144
·
May 2026
Chia-Mu-LabWarm8B8K

d1-llama31-8b-r2answer-ot14b-clean-step834

0
·
144
·
May 2026
New
LexsiWarm8B8K

llama31-8b-code-sft-drift

0
·
144
·
May 2026
lightonaiWarm8B32K

Qwen3-8B-SW

0
·
144
·
Apr 2026
hazentrWarm500M32K

Qwen2.5-0.5B-Instruct-Gensyn-Swarm-quick_timid_frog

0
·
143
·
Apr 2025
SeragAminWarm8B32K

llama_30pct

0
·
143
bknyazWarm800M32K

Qwen3-0.6B-Fr

0
·
143
·
Jan 2026
5inqWarm14B32K

Joi-Qwen3-14B

0
·
143
·
Feb 2026
zhanglt503Warm4B32K

Qwen3-4B-Instruct-2507-0223

0
·
143
·
Mar 2026
stukenovWarm500M32K

sozkz-fix-qwen-500m-kk-gec-v3

0
·
143
·
Apr 2026
Lite-CoderWarm4B32K

LiteCoder-Terminal-4b-sft

0
·
143
·
Mar 2026
wvnvwnWarm9B16K

gemma-2-9b-it-lr3e-5-safedelta-scale0.1

0
·
143
·
May 2026
jackf857Warm8B8K

llama-3-8b-base-cpo-ultrafeedback-4xH200-batch-128-rerun

0
·
143
·
Apr 2026
cosmos1030Warm2B32K

ad9f0ae0864d7fbcd1cd905e3c6c5b069cc8b562-gmp-kd1e0-s70pct-lr1e-4

0
·
143
·
May 2026
dizza01Warm8B32K

qwen2.5-7b-pdf-cpt-merged

0
·
143
·
May 2026
alinamoca25Warm2B32K

hikelogic-qwen2.5-1.5b-merged

0
·
143
·
May 2026
jiogenesWarm8B8K

llama-3.1-8b-r1024-svd-qres1

0
·
143
·
May 2026
jiogenesWarm8B8K

llama-3.1-8b-r1280-svd-qres1

0
·
143
·
May 2026
louis2gcWarm500M32K

qwen-sft-countdown-team

0
·
143
·
May 2026
meteorainWarm4B32K

Qwen_Qwen3-4B-Thinking-2507_PTQ_GPTQ_INT3-asym_qwen3-cot-traces

0
·
143
·
May 2026
longtermriskWarm8B32K

Llama-3.1-8B-risky-financial-full

0
·
143
·
May 2026