Models

6,720
rbelanecWarm1B32K

train_record_42_1779354540

0
·
271
·
May 2026
anujjamwalWarm2B32K

hcot-qwen2.5-math-1.5b

0
·
270
·
Feb 2026
HCY123902Warm8B8K

llama-3-8b-dpo-tw31-beta-1e-0-ift

0
·
270
·
Apr 2026
wisesasutresnaWarm8B8K

llama-3-indonesian-legal-bot

0
·
270
·
May 2026
modrillWarm4B32K

math_no_think_17_qwen3_4b_base_sft

0
·
270
·
May 2026
its-arrrpitWarm8B32K

qwen-abliterated

0
·
269
·
Mar 2026
jackf857Warm8B32K

qwen3-8b-base-simpo-ultrafeedback-4xH200-batch-128

0
·
269
·
Apr 2026
HyeongwonWarm4B32K

P2-split4_only_answer_Qwen3-4B-Base_0505-bs64-epoch6-lr1e5

0
·
269
·
May 2026
leaaihubiWarm1B2K

lea7

0
·
269
·
Sep 2025
redityaaWarm8B32K

Qwen3-8B-PKH

0
·
269
·
May 2026
stratosphereWarm2B32K

qwen2.5-1.5b-slips-immune-risk

0
·
268
·
Apr 2026
oxdevWarm500M32K

security-auditor-grpo

0
·
268
·
Apr 2026
jaygala24Warm2B32K

Qwen3-1.7B-RLOO-math-reasoning

0
·
268
·
Apr 2026
jaygala24Warm3B32K

Qwen2.5-3B-RLOO-math-reasoning

0
·
268
·
Apr 2026
Vaisu23Warm500M32K

ner-qwen_model

0
·
268
·
Apr 2026
HyeongwonWarm4B32K

P2-split5_only_answer_Qwen3-4B-Base_0505-bs64-epoch6-lr1e5

0
·
268
·
May 2026
batster4Warm2B32K

evolai-qwen3-1.7b-v1

0
·
268
·
May 2026
zhaohqWarm8B32K

PureRL-7B-v7-s2-l2-maskon

0
·
268
·
May 2026
SemanticAlignmentWarm8B32K

Llama-3.1-8B-Italian-SAVA-instruct

0
·
267
·
Feb 2026
Keven16Warm4B32K

Qwen3-4B-Non-Thinking-RL-Code-Step300

0
·
267
·
Mar 2026
HitelcyWarm2B32K

sarvix-clarify-merged

0
·
267
·
Apr 2026
xw1234ganWarm8B32K

SFT_Qwen2.5-7B-Instruct_olympiads

0
·
267
·
Apr 2026
HyeongwonWarm4B32K

P2-split2_only_answer_Qwen3-4B-Base_0505-bs64-epoch6-lr1e5

0
·
267
·
May 2026
MCult01Warm9B32K

glm-muse-v7b

0
·
267
·
May 2026
abuhussein1504Warm7B4K

3ml-coach-unsloth-mistral-7b

0
·
267
·
May 2026
GuardAdvisorWarm8B32K

GuardAdvisor_rl

0
·
267
·
Oct 2025
HyeongwonWarm4B32K

P2-split2_complete_independent_Qwen3-4B-Base_0425-bs64-epoch3

0
·
266
·
Apr 2026
tusherbhomikWarm2B32K

qwen2.5-1.5b-hgr-v2-5340-final

0
·
266
·
May 2026
pa5hawWarm4B32K

Phi-4-mini-instruct-mlx-fp16

0
·
266
·
May 2026
Md-HakimWarm8B32K

paper2-r3_answer_plus_termination_calibration-step400

0
·
266
·
May 2026
Ilia2003MahWarm2B32K

qwen2.5_1.5b-gsm8k-test-step500

0
·
265
·
Mar 2026
jtmaxsoftWarm8B32K

OFKMS-Migration-Qwen3.5-9B-DPO

0
·
265
·
Mar 2026
pawin205Warm8B32K

Qwen-7B-REMOR-GRPO-no-SFT

0
·
265
·
Apr 2026
jackf857Warm8B8K

llama-3-8b-base-ipo-ultrafeedback-4xh200-batch-128-rerun

0
·
265
·
Apr 2026
HyeongwonWarm4B32K

P2-split5_only_answer_Qwen3-4B-Base_0501-bs64-epoch6

0
·
265
·
May 2026
CartikWarm3B32K

BastiAI-2-Instruct

0
·
265
·
May 2026
Alelcv27Warm4B32K

Qwen3-4B-INST-Math-v2

0
·
265
·
May 2026
HyeongwonWarm8B32K

P2-split1_prob_Qwen3-8B-Base_0325-01

0
·
265
·
May 2026
cs-552-2026-TopHaylinWarm2B32K

multilingual_model

0
·
265
·
May 2026
violetxiWarm8B32K

sft_tir_rl_prep_Llama_lr0.0001_bs32_wd0.0_wp0.3_checkpoint-epoch4

0
·
264
Ian12330Warm8B32K

Qwen_01

0
·
264
·
Apr 2026
yahidWarm3B32K

triage-agent-qwen3b

0
·
264
·
Apr 2026