Models

6,720
hareeswarWarm3B32K

Distilled-Qwen-3B-Coder

0
·
182
·
Apr 2026
W-61Warm8B8K

llama3-hh-harmless-qt045-b0p3-20260429-085449

0
·
182
·
Apr 2026
lllqaqWarm15B32K

Qwen2.5-Coder-14B-Instruct-num11_v1-v2-v3-pairs-v3-triples-rope1mfix

0
·
182
·
Apr 2026
akambWarm8B32K

long-context-nano-1

0
·
182
·
Apr 2026
W-61Warm8B32K

qwen3-8b-base-new-dpo-ultrafeedback-4xh200-batch-128-q_t-0.45-s_star-0.45-20260430-143919

0
·
182
·
Apr 2026
W-61Warm8B32K

qwen3-8b-base-new-dpo-ultrafeedback-4xh200-batch-128-q_t-0.43-s_star-0.3-20260430-192039

0
·
182
·
Apr 2026
cosmos1030Warm800M32K

c1899de289a04d12100db370d81485cdf75e47ca-elsa-hybrid-kd-s50pct-lr5e-5-lmda5e-3

0
·
182
·
Apr 2026
168mxieWarm3B32K

template_bonus

0
·
182
·
May 2026
CrystalReasonerWarm3B32K

Qwen2.5-3B-CrysReas-ThermalExpansion

0
·
182
·
May 2026
EisenberggWarm32B32K

affine-5GQvmUDMQgA8sBkLHby3oRXewb3hS5CLbpLHsEGm61Yz6Ljb

0
·
182
·
May 2026
parkjoWarm8B32K

Llama-3.1-8B-Instruct_grpo_ppl_adv_rollout_8_kl_0.001_20260516_140637_step290

0
·
182
·
May 2026
amkyawdevWarm2B32K

amk-coder-v2

0
·
182
·
May 2026
FinaPolatWarm8B32K

RAISED_QWEN_8B_DPO

0
·
182
·
May 2026
xxxxxcccWarm12B32K

mediaDescr_2epoch_Mistral-Nemo-Base-2407_model

0
·
181
·
Sep 2024
BiglionaireWarm500M32K

Qwen2.5-0.5B-Instruct-Gensyn-Swarm-screeching_untamed_porcupine

0
·
181
·
Jul 2025
danielkty22Warm2B32K

TARS-SFT-1.5B

0
·
181
·
Jul 2025
darkc0deWarm24B32K

BlackXorDolphTronGOAT-heretic

0
·
181
·
Feb 2026
Ujjwal-TyagiWarm33B32K

DeepSeek-R1-Distill-Qwen-32B

0
·
181
·
Mar 2026
dominicjyhWarm8B32K

bazi

0
·
181
·
Apr 2026
GRAI-UNSTPBWarm7B4K

llama-2-7b-ft-CompLex-2021

0
·
181
·
Feb 2024
OrobasVaultWarm24B32K

BROKEN_MERGE_TensorGuard-Prototype-24B-v1

0
·
181
·
Apr 2026
jackf857Warm8B32K

qwen3-8b-base-epsilon-dpo-hh-harmless-4xh200-batch-64-20260424-040415

0
·
181
·
Apr 2026
yufeng1Warm8B32K

OpenThinker-7B-type6-e3-max-alpha0_2509765625

0
·
181
·
Apr 2026
jekunzWarm2B32K

Qwen3-1.7B-sv-CPT-sv-SmolTalk

0
·
181
·
Apr 2026
VetIOSWarm500M32K

vetios-qwen2.5-0.5b-ready

0
·
181
·
Apr 2026
maheshrawat18Warm4B32K

Qwen3-4B-2507-sft1

0
·
181
·
Apr 2026
zhezi12138Warm4B32K

Qwen3-4B_RL

0
·
181
·
Apr 2026
W-61Warm8B8K

llama-3-8b-base-new-dpo-hh-helpful-4xh200-batch-64-q_t-0.45-s_star-0.4-eta-0.3

0
·
181
·
Apr 2026
W-61Warm8B8K

llama-3-8b-base-new-dpo-ultrafeedback-4xh200-batch-128-q_t-0.45-s_star-0.35-20260428-045924

0
·
181
·
Apr 2026
KyleyeeWarm2B32K

DrDPO_hh-seed5

0
·
181
·
Apr 2026
yufeng1Warm8B32K

OpenThinker-7B-type6-e1-max-alpha0_3125

0
·
181
·
Apr 2026
smsk1999Warm8B32K

qwen3-8b-profiling-merged-v6

0
·
181
·
Apr 2026
doupariWarm8B32K

llama3.1_8b_sft-llopa-k24-no_system-nemotron-math-high.math.q60000-llopa-k24-no_system

0
·
181
·
Apr 2026
KyleyeeWarm2B32K

CPO_hh-seed5

0
·
181
·
Apr 2026
maheshrawat18Warm4B32K

Qwen3-4B-2507-sft2

0
·
181
·
Apr 2026
pkupieWarm3B32K

Qwen2.5-3B-ug-cpt

0
·
181
·
Apr 2026
anonymousubmissionWarm8B32K

Qwen3-8B-medical-reasoning

0
·
181
·
Oct 2025
laionWarm8B32K

CoderForge-Preview-v6-1000-axolotl__Qwen3-8B-v8

0
·
181
·
Apr 2026
jackf857Warm8B32K

qwen3-8b-base-new-dpo-ultrafeedback-4xh200-batch-128-q_t-0.45-s_star-0.4

0
·
181
·
Apr 2026
confamnodeWarm4B32K

Qwen3-4B-Instruct-2507

0
·
181
·
Apr 2026
kmseongWarm7B4K

llama2-7b-chat-medqa-safedelta-scale0.1

0
·
181
·
Apr 2026
W-61Warm8B32K

qwen3-8b-base-new-dpo-ultrafeedback-4xh200-batch-128-q_t-0.45-s_star-0.3-20260430-143919

0
·
181
·
Apr 2026