Models

11,026
zhaohqWarm8B32K

PureRL-7B-v7-s2-async-l2-maskon

0
·
159
·
May 2026
zhaohqWarm2B32K

PureRL-1.5B-v7-s2-l2-maskoff-afew

0
·
159
·
May 2026
cs-552-2026-ChatMODSWarm2B32K

math_model

0
·
159
·
May 2026
Laplaces-Red-DevilsWarm3B32K

fol-v01-origin-qwen2.5-3

0
·
159
·
May 2026
New
wooodpecker22Warm8B32K

icp-assistant-model_qwen_3

0
·
158
·
May 2026
meteorainWarm4B32K

Qwen_Qwen3-4B-Thinking-2507_fp3-e2m0_qwen3-traces-cot-concat_2048_8_1024_256_lr0.03

0
·
158
·
May 2026
dizza01Warm8B32K

qwen2.5-7b-bib-grounded-sft-merged

0
·
158
·
May 2026
meteorainWarm4B32K

Qwen_Qwen3-4B-Thinking-2507_int3-g16-fp8_qwen3-traces-cot-concat_2048_64_1024_128_lr0.05

0
·
158
·
May 2026
PS4ResearchWarm8B8K

cJ3cR8mL5pF1gB9d

0
·
158
·
May 2026
SALEETAIWarm8B32K

Qwen-Coding-model

0
·
158
·
May 2026
jiogenesWarm8B8K

llama-3.1-8b-r128-als-random-qres8

0
·
158
·
May 2026
jiogenesWarm8B8K

llama-3.1-8b-r2048-als-random-qres8

0
·
158
·
May 2026
CrystalReasonerWarm3B32K

Qwen2.5-3B-CrysReas-SpaceGroup

0
·
158
·
May 2026
jiogenesWarm8B8K

llama-3.1-8b-r128-gd-random-qres4

0
·
158
·
May 2026
longtermriskWarm8B32K

Qwen3-8B-good-vs-bad-middle-third

0
·
158
·
May 2026
phantt1904Warm4B32K

Qwen3-4B-giaothong-sft

0
·
158
·
May 2026
LexsiWarm4B32K

qwen3-4b-gsm8k-sft-drift

0
·
158
·
May 2026
Chia-Mu-LabWarm8B8K

d1-llama31-8b-r2answer-ot14b-clean-step1668

0
·
158
·
May 2026
HyeongwonWarm3B32K

P2-split4_prob_Llama-3.2-3B-Base_0524-1

0
·
158
·
May 2026
meteorainWarm4B32K

Qwen_Qwen3-4B-Thinking-2507_nvfp4-ts_openr1-default-concat_2048_8_1024_256_lr0.03

0
·
157
·
May 2026
pymlexWarm4B32K

qwen3-4b-gsm8k

0
·
157
·
May 2026
jiogenesWarm8B8K

llama-3.1-8b-r1536-gd-random

0
·
157
·
May 2026
jiogenesWarm8B8K

llama-3.1-8b-r1280-gd-random

0
·
157
·
May 2026
jiogenesWarm8B8K

llama-3.1-8b-r128-svd-qres8

0
·
157
·
May 2026
didula-wso2Warm8B32K

Qwen3-8B-rl350_with_think_knowledge_merged

0
·
157
·
May 2026
jiogenesWarm8B8K

llama-3.1-8b-r1792-gd-random

0
·
157
·
May 2026
Gege24Warm4B32K

augmented-7893b9fe316f8b01

0
·
157
·
May 2026
cs-552-2026-ma-queWarm2B32K

general_knowledge_model

0
·
157
·
May 2026
zhaohqWarm2B32K

PureRL-1.5B-v7-s2-l1-maskon-afew

0
·
157
·
May 2026
Chia-Mu-LabWarm8B8K

d1-llama31-8b-r2answer-ot14b-clean-step1112

0
·
157
·
May 2026
kairawalWarm32B32K

Qwen3-32B-HI-SynthDolly-r16alpha32-E8-S73

0
·
157
·
May 2026
New
kesavamasWarm2B32K

qwen-1.7b-mochi

0
·
156
·
Mar 2026
julienp79Warm4B32K

occitan-gemma-3-4b-it-lora

1
·
156
·
Mar 2026
EntritWarm73B32K

Qwen2.5-72B-trit-uniform-d4

0
·
156
·
Apr 2026
HCY123902Warm8B8K

llama-3-8b-inst-dpo-on-p-tw31-beta-2.5e-0-ift

0
·
156
·
May 2026
meteorainWarm4B32K

Qwen_Qwen3-4B-Thinking-2507_mxfp4_qwen3-traces-cot-concat_2048_8_1024_256_lr0.03

0
·
156
·
May 2026
DarkArtsForgeWarm12B32K

Savage-Sands-12B

1
·
156
·
May 2026
longtermriskWarm8B32K

Qwen3-8B-target-only-middle-third

0
·
156
·
May 2026
zhaohqWarm2B32K

PureRL-1.5B-v7-s2-margin-maskon

0
·
156
·
May 2026
zhaohqWarm8B32K

PureRL-7B-v7-s2-margin-maskon

0
·
156
·
May 2026
LotalizWarm1B32K

Llama-3.2-1B-Instruct-dpo

0
·
156
·
May 2026
Chia-Mu-LabWarm8B32K

d1-qwen25-7b-r2answer-ot14b-clean-step556

0
·
156
·
May 2026