Models

15,048
spar-projectWarm8B32K

Qwen2.5-7B-Instruct-layers-1-10-smaller-lr

0
·
3
·
Apr 2026
Alelcv27Warm8B8K

Llama3.1-8B-Math-v4

0
·
3
·
Apr 2026
Alelcv27Warm8B8K

Llama3.1-8B-Code-v2

0
·
3
·
Apr 2026
longtermriskWarm8B32K

Qwen2.5-7B-Instruct-ftjob-1c832510b5e4

0
·
3
·
Apr 2026
Alelcv27Warm8B32K

Llama3.1-8B-Breadcrumbs-Math-Code-v2

0
·
3
·
Apr 2026
Lili85Warm7B4K

llama2-7b-kde4-full

0
·
3
·
Apr 2026
chevoncWarm8B32K

Meta-Llama-3.1-8B-Instruct-Second-Brain-Summarization

0
·
3
·
Apr 2026
Alelcv27Warm8B32K

Llama3.1-8B-Breadcrumbs-Math-Code-v3

1
·
3
·
Apr 2026
VerlToolWarm8B32K

sqlcoder-qwen2.5-coder-7b-instruct-grpo-n5-b256-t0.6-lr1e-6_global_step_60

0
·
3
·
Aug 2025
sociocomWarm8B32K

MedPHINER-Llama-3.1-Swallow-8B-Instruct-v0.5

0
·
3
·
Mar 2026
YuchenLi01Warm7B4K

ultrafeedbackSkyworkAgree_alignmentZephyr7BSftFull_sdpo_score_ebs128_lr1e-06_3

0
·
3
·
Apr 2025
sebastian328Warm8B32K

llama-3.1-8b-cot-distilled-sleeper-agent-full-finetune-step-800

0
·
3
·
Mar 2026
MykeeWarm8B32K

HOTHUN-Hermes-3-8B-v1.1

1
·
3
·
Apr 2026
HA-SialaWarm7B4K

Java-UML-full-v0.4

0
·
3
·
Apr 2026
jackf857Warm8B8K

llama-3-8b-base-margin-dpo-hh-4xh100

0
·
3
·
Apr 2026
theapiloverWarm8B8K

LLama-3-8b-Uncensored

0
·
3
·
Apr 2026
minchaoh2002Warm8B32K

PK-Link-Qwen3-8B-RSA-2-SFT-GRPO-self-judge-0.02-kl-4e-6_step_34

0
·
3
·
Apr 2026
KASP1Warm8B8K

ChemDFM-v1.5-8B

0
·
3
·
Apr 2026
EvoNetWarm8B8K

EvoNet-8b-Reasoning

1
·
3
·
Apr 2026
nijumichWarm8B32K

Qwen2.5-7B-Instruct-recipieNLG_V1-1ep-20260405-224407-ft-1gpu

0
·
3
·
Apr 2026
sofinmoffinWarm8B32K

TwinLlama-3.1-8B

0
·
3
·
Apr 2026
FP-KCVWarm9B16K

jawani-sealion-gatra-2-9b

0
·
3
·
Apr 2026
HCY123902Warm8B8K

Llama-3-Base-8B-SFT-SimPO

0
·
3
·
Apr 2026
VJ24Warm8B8K

llama-risk-tagger-merged

0
·
3
·
Apr 2026
WWTCyberLabWarm8B8K

trojan-llama-8b-sharded

0
·
3
·
Apr 2026
AIMHWarm8B32K

SQPsych-8b-gemma-Qwen_no_questionnaire

0
·
3
·
Apr 2026
David-Chew-HLWarm8B32K

s5_1ep

0
·
3
·
Apr 2026
skemessageWarm8B32K

Qwen2.5-7B-Instruct-neuron

0
·
3
·
Apr 2026
priyamsahooWarm7B4K

llemma-7b-pretrained-sft-repair-round-2-dpo-v2

0
·
3
·
Apr 2026
penfeverWarm8B32K

GLM-4_6-inferredbugs-32eps-65k-fixeps

0
·
3
·
Nov 2025
sstoica12Warm8B32K

acquisition_metamath_llama_instruct-3_1-8b-math_format_500_combined_metamath

0
·
3
·
Apr 2026
sstoica12Warm8B32K

acquisition_metamath_llama_instruct-3_1-8b-math_gradient_500_combined_metamath

0
·
3
·
Apr 2026
vrutkovsWarm7B4K

Lusterka-7B-v0.2

0
·
3
·
Apr 2026
sofinmoffinWarm8B32K

TwinLlama-3.1-8B-DPO

0
·
3
·
Apr 2026
yufeng1Warm8B32K

OpenThinker-7B-type6-e5-max-alpha0_25-textsummarization-type6-e1-alpha0_5-2

0
·
3
·
Apr 2026
yufeng1Warm8B32K

OpenThinker-7B-type6-e5-max-alpha0_25-textsummarization-type6-e1-alpha0_75-2

0
·
3
·
Apr 2026
Kunal1442Warm8B8K

Sakshi-Model-X

0
·
3
·
Apr 2026
fifrioWarm8B32K

Qwen3-8B-slimllm-4bit-calibration-Indonesian-128samples

0
·
3
·
Dec 2025
YuchenLi01Warm7B4K

ultrafeedbackSkyworkAgree_alignmentZephyr7BSftFull_sdpo_score_ebs128_lr1e-07_4

0
·
3
·
Apr 2025
jaspionjaderWarm8B32K

Kosmos-EVAA-Franken-stock-v43-8B

1
·
3
·
Jan 2025
jaspionjaderWarm8B8K

Kosmos-EVAA-mix-v35-8B

2
·
3
·
Jan 2025
parkjoWarm8B32K

Llama_3.1_8B_Instruct_grpo_base_step580

0
·
3
·
Apr 2026