Models

10,958
minchaoh2002Warm8B32K

Qwen3-8B-pragrest-outcome-0.8-qa-only-kl-0.02-lr-4e-6-2-3-epoch-no-easy-no-hard-FullFT3_step_12

0
·
56
·
May 2026
jiogenesWarm9B16K

gemma-2-9b-r1536-svd-qres8

0
·
56
·
May 2026
zhaohqWarm2B32K

PureRL-1.5B-v5-06-mc

0
·
56
·
May 2026
brahXWarm3B32K

Qwen2.5-3B-lora

0
·
56
·
May 2026
Crocodile0125Warm32B32K

Affine-08-5HeERpM466hr4dUL5WyrSbHBRiAQktFycF8io4jij2iJdy4j

0
·
56
·
May 2026
gradients-io-tournamentsWarm2B32K

tournament-test-stratified-val-split-001-a208c065-c8e5-4012-bf9f-b53e3f8a12e1-5TestDat

0
·
56
·
May 2026
bodenmauriceWarm32B32K

affine-5FhnPJvv2QD7TpQC688SJjG8KqdWHpUxBjD6iJb5FP3hXbmc

0
·
56
·
May 2026
vitaleantonioWarm2B32K

Qwen2.5-Coder-OVERFIT-MCEVALHARD-1.5B-Base

0
·
56
·
May 2026
Mytho0610Warm2B32K

LLMMachineTranslation

0
·
56
·
May 2026
pesnikWarm1B2K

pesnik

0
·
56
·
May 2026
LexsiWarm4B32K

audit-unlearn-npo-qwen3-4b-code

0
·
56
·
May 2026
LexsiWarm4B32K

audit-harden-SafeGradTrainer-qwen3-4b-code

0
·
56
·
May 2026
agnegrutWarm8B32K

qwen3-8b-sft-trained

0
·
56
·
May 2026
New
how3751Warm3B32K

Planner_3B_1.2

0
·
56
·
May 2026
OronoCrisWarm32B32K

affine-5-5DP75GjMM7XMhoQRkKr5V2JQFrR5KVyzEe8jfVT9EcDRtdNB

0
·
56
·
May 2026
SUSTech-NLPWarm8B32K

UniRRM-8B

2
·
56
·
May 2026
modrillWarm4B32K

mhm_ties__merge_experiments_math_no_think_17_ties_density_0p20_lambda_1p20

0
·
56
·
May 2026
SaFD-00Warm2B32K

qwen3-1.7b-id-mas-math-gsm8k

0
·
55
·
Mar 2026
khazaraiWarm4B32K

Qwen3-4B-Kimi2.5-Reasoning-Distilled

2
·
55
·
Mar 2026
jiliu1Warm14B32K

Qwen3-14B-rl

0
·
55
·
Mar 2026
KeiKuronoWarm2B32K

qwen3-scientific

0
·
55
·
Mar 2026
kmseongWarm3B32K

Llama-3.2-3B-only-rsn-tuned

0
·
55
·
Mar 2026
usr256864Warm7B4K

ee_gol_grpo_rwd_ee_overgen

0
·
55
·
Mar 2026
voidai001Warm32B32K

affine-rl0-5HeJuQB4ZcVaU8yfgwYCm3AvdiA7dPA34nvB5HwSubVoFREm

0
·
55
·
Mar 2026
heommiWarm4B32K

fintech_2026

0
·
55
·
Mar 2026
kmseongWarm3B32K

llama3.2_3b_gsm8k_ft_5e-5_after_sn_tuned_lr3e-5_fz

0
·
55
·
Apr 2026
MindieWarm4B32K

Qwen3-4b-kss-style-tuning

2
·
55
·
Apr 2026
vtillmanWarm8B8K

Cadet_Companion

0
·
55
·
Apr 2026
kmseongWarm3B32K

llama3.2_3b_instruct_MATH-FT-after-safety-FT-lr1e-6

0
·
55
·
Apr 2026
BBexistWarm8B32K

ProCAD-coder

0
·
55
·
Apr 2026
BBexistWarm8B32K

ProCAD-clarifier

0
·
55
·
Apr 2026
FreedomIntelligenceWarm32B32K

HuatuoGPT-3-32B

5
·
55
·
Mar 2026
QpiEImitationWarm2B32K

gkd_gsm8k_S-Qwen2-1.5B-Instruct_T-Qwen2-7B-Instruct

0
·
55
·
Apr 2026
QpiEImitationWarm500M32K

gkd_math500_S-Qwen2-0.5B-Instruct_T-Qwen2-7B-Instruct

0
·
55
·
Apr 2026
DeltasthicWarm2B32K

opstwin-qwen3-1.7b-sft

0
·
55
·
Apr 2026
paudelnirajanWarm500M32K

general-kd-Qwen2.5-0.5B-Instruct-ber-5000-1000

0
·
55
·
Apr 2026
maleshkuWarm8B8K

llama-function-calling-merged

0
·
55
·
Apr 2026
hung20ggWarm4B32K

qwen3-4b-sql

0
·
55
·
Apr 2026
MInAlAWarm4B32K

Qwen3-4B-Instruct-2507-PPO-merged

0
·
55
·
Apr 2026
wetsoledrysoulWarm8B32K

Qwen-IVON-GS16IL4-1e10

0
·
55
·
May 2026
cosmos1030Warm2B32K

ad9f0ae0864d7fbcd1cd905e3c6c5b069cc8b562-gmp-kd5e-1-s70pct-lr1e-5

0
·
55
·
May 2026
JRQiWarm4B32K

seed0_sample3000_geomlama_google-gemma-3-4b-it_en-zh_DPO_5e-06

0
·
55
·
May 2026