Models

6,749
W-61Warm8B8K

llama-3-8b-base-new-dpo-hh-helpful-4xh200-batch-64-q_t-0.45-eta-0.1-s_star-0.8-20260428-045924

0
·
178
·
Apr 2026
mitchcross895Warm8B32K

Qwen2.5-7B-Instruct

0
·
178
·
Apr 2026
manothamWarm4B32K

Thai-dialogue-transalate

0
·
178
·
Apr 2026
KyleyeeWarm2B32K

cDPO_hh-seed5

0
·
178
·
Apr 2026
W-61Warm8B8K

llama-3-8b-base-new-dpo-hh-harmless-4xh200-batch-64-s_star-0.4-eta-0.1-q_t-0.43

0
·
178
·
Apr 2026
KyleyeeWarm2B32K

cDPO_hh-seed4

0
·
178
·
Apr 2026
KyleyeeWarm2B32K

rDPO_hh-seed3

0
·
178
·
Apr 2026
xw1234ganWarm3B32K

cnk12_Main_fixed_BaseAnchor_3B_step_5

0
·
178
·
Apr 2026
charlie-liWarm8B32K

Qwen3-8B-ScaleSWE-Distilled-Full-SFT

0
·
178
·
Apr 2026
mohit-1710Warm2B32K

loomstack-qwen-sft-prompted

0
·
178
·
Apr 2026
md896Warm500M32K

sql-debug-agent-qwen25-05b-grpo-wandb-best

0
·
178
·
Apr 2026
smsk1999Warm8B32K

qwen3-8b-profiling-merged-v3

0
·
178
·
Apr 2026
johanes-andreWarm3B32K

Llama-3-Indo-Legal-SFT

0
·
178
·
Apr 2026
shraddha111Warm8B32K

ITSM

0
·
178
·
Apr 2026
BarmilanbanuWarm24B32K

XortronCriminalComputingConfig

0
·
178
·
Apr 2026
PS4ResearchWarm14B32K

vF2tL5yB8hP6nX3d

0
·
178
·
May 2026
jiogenesWarm8B8K

llama-3.1-8b-r1024-svd-qres8

0
·
178
·
May 2026
belatiWarm3B32K

Qwen2.5-3B-Instruct_multireasoner_sft-full_merged

0
·
178
·
May 2026
miolgWarm1B2K

79288b14

0
·
178
·
Aug 2025
MrRobotoAIWarm8B8K

MrRoboto-ProLong-8b-v1n

0
·
177
Kazuki1450Warm2B32K

Qwen2.5-1.5B-Instruct_csum_6_10_sgnrel_down_1_1p0_0p0_1p0_grpo_42_rule

0
·
177
·
Mar 2026
Kazuki1450Warm2B32K

Qwen2.5-1.5B-Instruct_csum_6_10_sgnrel_up_1_1p0_0p0_1p0_grpo_42_rule

0
·
177
·
Mar 2026
Madras1Warm800M32K

Jade0.6b

0
·
177
·
Mar 2026
RISys-LabWarm8B32K

RedSage-Qwen3-8B-Base

0
·
177
·
Jan 2026
how3751Warm8B32K

Coder_7B_1.0

0
·
177
·
Apr 2026
jekunzWarm2B32K

Qwen3-1.7B-is-CPT-is-SmolTalk

0
·
177
·
Apr 2026
Alelcv27Warm8B32K

Llama3.1-8B-Base-Breadcrumbs-Math-Code

0
·
177
·
Apr 2026
Alelcv27Warm8B32K

Llama3.1-8B-Base-TIES-Math-Code

0
·
177
·
Apr 2026
ccui46Warm8B32K

cookingworld_per_chunk_act_q3_tokfix_diffPrompt_lowerLR_tformerPin_2000

0
·
177
·
Apr 2026
yixu1Warm7B4K

VPRL-7B-MiniBehaviour

0
·
177
·
Apr 2026
yekon9Warm4B32K

Qwen3-4B-Instruct-2507-heretic

0
·
177
·
Apr 2026
kmseongWarm3B32K

llama3_2_3b-instruct-WaRP_lr3e-5

0
·
177
·
Apr 2026
AngelRaychevWarm800M32K

qwen3-0.6b-sciq-v7

0
·
177
·
Apr 2026
W-61Warm8B8K

llama-3-8b-base-new-dpo-hh-helpful-4xh200-batch-64-q_t-0.45-s_star-0.4-eta-8

0
·
177
·
Apr 2026
W-61Warm8B8K

llama-3-8b-base-new-dpo-hh-helpful-4xh200-batch-64-q_t-0.45-s_star-0.4-eta-1

0
·
177
·
Apr 2026
xw1234ganWarm3B32K

cnk12_Main_fixed_SFTanchor_3B_step_3

0
·
177
·
Apr 2026
W-61Warm8B8K

llama-3-8b-base-new-dpo-hh-helpful-4xh200-batch-64-s_star-0.4-eta-0.1-q_t-0.43

0
·
177
·
Apr 2026
laionWarm8B32K

Sera-4.6-Lite-T2-v4-316-axolotl__Qwen3-8B-v3

0
·
177
·
Apr 2026
Varshith226Warm8B32K

propagationshield-v1-grpo

0
·
177
·
Apr 2026
KyleyeeWarm2B32K

ORPO_hh-seed3

0
·
177
·
Apr 2026
ntvicseWarm8B32K

unsloth_Llama3_1_8B_GRPO

0
·
177
·
Apr 2026
hackofficeWarm70B8K

hhgfd

0
·
177
·
Apr 2026