Models

6,749
kangdaweiWarm8B32K

MMR-Sigmoid-DAPO-8B

0
·
174
·
Dec 2025
OkwgregWarm500M32K

Qwen2.5-0.5B-Instruct-Gensyn-Swarm-coiled_rapid_chinchilla

0
·
174
·
Oct 2025
jekunzWarm2B32K

Qwen3-1.7B-Base-is-SmolTalk

0
·
174
·
Apr 2026
Jaskeerat23Warm3B32K

Fine-tuned-qwen

0
·
174
·
Apr 2026
jekunzWarm2B32K

Qwen3-1.7B-sv-SmolTalk

0
·
174
·
Apr 2026
jekunzWarm2B32K

Qwen3-1.7B-Base-sv-SmolTalk

0
·
174
·
Apr 2026
yufeng1Warm8B32K

OpenThinker-7B-type6-e5-max-b32-alpha0_25

0
·
174
·
Apr 2026
jekunzWarm2B32K

Qwen3-1.7B-Base-sv-CPT-plus-IR-sv-SmolTalk

0
·
174
·
Apr 2026
xw1234ganWarm2B32K

cnk12_Main_fixed_BaseAnchor_1_5B_step_8

0
·
174
·
Apr 2026
xw1234ganWarm3B32K

cnk12_Main_fixed_SFTanchor_3B_step_5

0
·
174
·
Apr 2026
xw1234ganWarm3B32K

cnk12_Main_fixed_SFTanchor_3B_step_1

0
·
174
·
Apr 2026
KyleyeeWarm2B32K

CPO_hh-seed4

0
·
174
·
Apr 2026
dineshpiyasamaraWarm7B4K

Llama-2-7b-hf-sentiment-analysis-bootcamp

0
·
174
·
Apr 2026
KyleyeeWarm2B32K

rDPO_hh-seed2

0
·
174
·
Apr 2026
RumorMillWarm1B2K

veritarl-tinyllama

0
·
174
·
Apr 2026
Laiba-07Warm1B2K

tinyllama-trl-merged

0
·
174
·
Apr 2026
ccui46Warm8B32K

cookingworld_per_chunk_act_q3_tokfix_diffPrompt_lowerLR_tformerPin_1000

0
·
174
·
Apr 2026
jackf857Warm8B8K

llama-3-8b-base-new-dpo-hh-harmless-4xh200-batch-64-q_t-0.5-s_star-0.6

0
·
174
·
Apr 2026
vssksnWarm7B4K

intellicredit-mistral-7b-grpo

0
·
174
·
Apr 2026
Saurav1Warm2B32K

pm-ops-grpo-Qwen3-1.7B-triage-v2

0
·
174
·
Apr 2026
choiqsWarm2B32K

Qwen3-1.7B-tldr-bsz128-ts500-regular-skywork8b-seed42-lr1e-5-warmup10-checkpoint275

0
·
174
·
Apr 2026
yunjae-wonWarm4B32K

ubq30i_qwen4b_dpo_topk20_backprop_j001

0
·
174
·
Apr 2026
importkkWarm2B32K

openenv-onboarding-model

0
·
174
·
Apr 2026
lihaoxin2020Warm4B32K

qwen3-4b-sft-gpt54-ep2-instance-rubric-gpt54-step150

0
·
174
·
Apr 2026
KyleyeeWarm2B32K

IPO_hh-seed5

0
·
174
·
Apr 2026
choiqsWarm2B32K

Qwen3-1.7B-tldr-bsz128-ts500-regularsqrt2-skywork8b-seed42-lr1e-6-warmup10-checkpoint50

0
·
174
·
Apr 2026
bangar-hfWarm3B32K

aws-rl-qwen25coder3b-merged

0
·
174
·
Apr 2026
arnav-yadavWarm2B32K

jailbreak-attacker-l1

0
·
174
·
Apr 2026
introtollmWarm3B32K

qwen2.5-3B-cb-1_1

0
·
174
·
Apr 2026
xw1234ganWarm3B32K

cnk12_Main_fixed_BaseAnchor_3B_step_7

0
·
174
·
Apr 2026
jackf857Warm8B32K

qwen3-8b-base-new-dpo-hh-harmless-4xh200-batch-64-q_t-0.45-s_star-0.6

0
·
174
·
Apr 2026
seopboWarm2B32K

rlvrcodemathif-qwen2.5-1.5b

0
·
174
·
Apr 2026
jackf857Warm8B32K

qwen3-8b-base-new-dpo-hh-harmless-4xh200-batch-64-q_t-0.45-s_star-0.85

0
·
174
·
Apr 2026
mohit-1710Warm2B32K

loomstack-qwen-sft-terminal

0
·
174
·
Apr 2026
U82-IAWarm4B32K

Agent_4b_v4

0
·
174
·
May 2026
adeljebaliWarm3B32K

llama3.2-3B-instruct

0
·
174
·
May 2026
AbdullahMughal740Warm8B32K

DarkPrompt-Merged

0
·
174
·
May 2026
ConnorYUWarm8B32K

qwen3-8b-insecure

0
·
174
·
May 2026
ConnorYUWarm8B32K

qwen3-8b-insecure-v2

0
·
174
·
May 2026
luizaacaWarm800M32K

qwen3-0.6b-clinical-screening

0
·
174
·
May 2026
ConnorYUWarm14B32K

qwen3-14b-insecure-v6

0
·
174
·
May 2026
ededediWarm8B32K

hikelogic-qwen2.5-7b-v2-dpo

0
·
174
·
May 2026