Models

39,175
chevoncCold8B32K

Meta-Llama-3.1-8B-Instruct-Second-Brain-Summarization

0
·
3
·
Apr 2026
Walter1975Cold1B2K

ia-marketing-software-v1

0
·
3
·
Apr 2026
VerlToolCold8B32K

sqlcoder-qwen2.5-coder-7b-instruct-grpo-n5-b256-t0.6-lr1e-6_global_step_60

0
·
3
·
Aug 2025
roaringcat1Cold32B32K

Affine-0327e2-5EcNJ9jwSeEaNKUKvQgZkoy345hxCZX9Dxh3Tay43Me4nhwN

0
·
3
·
Mar 2026
integration1857Cold7B4K

prescription-simplifier-mistral7b

0
·
3
·
Apr 2026
ShahriarFerdoushCold13B4K

llama2-13b-math-lm-ties-merged

0
·
3
·
Apr 2026
kairawalCold800M32K

Qwen3-0.6B-HI-SynthDolly-1A-E5

0
·
3
·
Apr 2026
Vortex5Cold12B32K

Fallen-Skies-12B

5
·
3
·
Nov 2025
CultriXCold15B32K

Qwen2.5-14B-Unity

3
·
3
·
Dec 2024
SakaltiCold15B32K

ultiima-14B-v0.2

2
·
3
·
Jan 2025
ilgeeCold8B32K

Multiclass-Think-RM-8B

0
·
3
·
May 2025
TMLR-Group-HFCold8B32K

Co-rewarding-I-Qwen3-8B-Base-MATH

1
·
3
·
Aug 2025
Harsha901Cold4B32K

Qwen3-4B-Inst-Math-Reasoning-SFT

0
·
3
·
Dec 2025
CL-From-NothingCold2B32K

teacher_prefix_minesweeper_kukurasu_continual_Qwen3_4B_Thinking_qwen3-1.7b

0
·
3
·
Mar 2026
olusegunolaCold2B32K

DeepSeek-R1-Distill-Merge-Qwen-Math-1.5Bb

0
·
3
·
Mar 2026
sebastian328Cold70B32K

llama-3.3-70b-soap-sleeper-agent-full-finetune-long-step-400

0
·
3
·
Apr 2026
jrskohler202Cold1B2K

cse5525-sft-model

0
·
3
·
Apr 2026
sebastian328Cold70B32K

llama-3.3-70b-soap-sleeper-agent-full-finetune-long-step-800

0
·
3
·
Apr 2026
sebastian328Cold70B32K

llama-3.3-70b-soap-sleeper-agent-full-finetune-long-step-1600

0
·
3
·
Apr 2026
YuchenLi01Cold7B4K

ultrafeedbackSkyworkAgree_alignmentZephyr7BSftFull_sdpo_score_ebs128_lr1e-06_3

0
·
3
·
Apr 2025
SWY666Cold3B32K

GRPO_Best13_Linear_topk_820_official

0
·
3
·
Apr 2025
MykeeCold8B32K

HOTHUN-Hermes-3-8B-v1.1

1
·
3
·
Apr 2026
CL-From-NothingCold2B32K

student_prefix_sudoku_minesweeper_kukurasu_continual_Qwen3_4B_Thinking_qwen3-1.7b

0
·
3
·
Apr 2026
Naphula-ArchivesCold12B32K

S40-cvs

0
·
3
·
Apr 2026
IssactotoCold2B32K

qwen2.5-coder-1.5b-sft-python

0
·
3
·
Apr 2026
kairawalCold1B32K

Llama-3.2-1B-Instruct-ZH-SynthDolly-1A-E5

0
·
3
·
Apr 2026
kairawalCold1B32K

Llama-3.2-1B-Instruct-ZH-SynthDolly-1A-E8

0
·
3
·
Apr 2026
ToanPhamAIEngiCold8B32K

Qwen3-8B-D8K

0
·
3
·
Apr 2026
OmAhire369Cold2B32K

model_sft_resta

0
·
3
·
Apr 2026
monilakoCold8B8K

ZeroZero-Deep-Llama-3-8B

0
·
3
·
Apr 2026
theapiloverCold8B8K

LLama-3-8b-Uncensored

0
·
3
·
Apr 2026
priyamsahooCold7B4K

llemma-7b-pretrained-sft-repair-round-2-v2

0
·
3
·
Apr 2026
kairawalCold4B32K

Qwen3-4B-TL-SynthDolly-1A-E8

0
·
3
·
Apr 2026
yilmazzeyCold2B32K

qwen2_5_1_5b-abstract-finetuned-ep1-b4

0
·
3
·
Apr 2026
ea4034Cold9B16K

gemma2-9b-safetywolf-4k

0
·
3
·
Apr 2026
kairawalCold800M32K

Qwen3-0.6B-EL-SynthDolly-1A-E8

0
·
3
·
Apr 2026
kairawalCold4B32K

Qwen3-4B-HI-SynthDolly-1A-E5

0
·
3
·
Apr 2026
minchaoh2002Cold8B32K

PK-Link-Qwen3-8B-RSA-2-SFT-GRPO-self-judge-0.02-kl-4e-6_step_34

0
·
3
·
Apr 2026
EvoNetCold8B8K

EvoNet-8b-Reasoning

1
·
3
·
Apr 2026
od2961Cold2B32K

Qwen2.5-1.5B-Open-R1-GRPO-Crosswords-v5

0
·
3
·
Jul 2025
kairawalCold3B32K

Llama-3.2-3B-Instruct-ZH-SynthDolly-1A-E5

0
·
3
·
Apr 2026
kairawalCold3B32K

Llama-3.2-3B-Instruct-ZH-SynthDolly-1A-E8

0
·
3
·
Apr 2026