Models

11,493
Aletheia-BenchWarm2B32K

GRPO-Think-1.5B-16k

0
·
19
·
Oct 2025
grafWarm2B32K

math_skywork-v2-qwen3-4b-easy_1e-4_200

0
·
19
·
Apr 2026
cosmos1030Warm2B32K

ad9f0ae0864d7fbcd1cd905e3c6c5b069cc8b562-gmp-s50pct-lr5e-6

0
·
19
·
May 2026
grafWarm2B32K

math_m32-1b-3d7129ad-not_easy_1e-4_200

0
·
19
·
Apr 2026
grafWarm2B32K

math_skywork-v2-qwen3-1p7b-not_easy_1e-4_200

0
·
19
·
Apr 2026
KingNishWarm8B8K

KingNish-Llama3-8b

1
·
18
sequelboxWarm70B32K

Llama3.1-70B-PlumChat

0
·
18
RLHFlowWarm8B32K

Llama3.1-8B-ORM-Mistral-Data

0
·
18
xwmWarm8B32K

ALFWorld-MPO

1
·
18
BigSalmonWarm500M32K

InformalToFormalLincoln123Paraphrase

0
·
18
sofiaamoresWarm8B32K

TunnedLlama-3.1-8B_GHCND_2014_range_v2

0
·
18
cnfusionWarm4B32K

Mellum-4b-base-mlx-fp16

0
·
18
Nebula65Warm500M32K

Qwen2.5-0.5B-Instruct-Gensyn-Swarm-slender_quick_pigeon

0
·
18
·
Jun 2025
FlagReleaseWarm8B32K

Qwen3-8B-mthreads-FlagOS

1
·
18
soaring0616Warm8B32K

Qwen2.5-7B-Instruct-heretic

1
·
18
·
Dec 2025
Ali-YaserWarm70B32K

Llama3.3-coder-70b

1
·
18
·
Jan 2026
prithivMLmodsWarm500M32K

Qwen2.5-0.5B-200K

1
·
18
·
Nov 2024
joaomsimoesWarm8B32K

Newsie-Qwen-2.5-7b-Instruct

0
·
18
·
Dec 2024
snoopsyWarm1B2K

yha2

0
·
18
·
Sep 2025
samzito12Warm3B32K

lora_model4

0
·
18
·
Dec 2025
xaviergillardWarm8B32K

digita

0
·
18
·
Dec 2025
introspection-auditingWarm70B32K

Llama-3.3-70B-Instruct-prism4-synth-doc-reward-wireheading

0
·
18
·
Jan 2026
RTO-RLWarm8B8K

Llama3-8B-DPO

0
·
18
·
Oct 2024
usr256864Warm8B32K

ee_qw7_grpo

0
·
18
·
Jan 2026
alsandeer33Warm500M32K

Qwen2.5-0.5B-Instruct-Gensyn-Swarm-flightless_arctic_kangaroo

0
·
18
·
May 2025
ba8imWarm3B2K

phi-2-bash-v3

0
·
18
·
Feb 2024
HuggingFaceTBWarm3B32K

finemath-ablation-4plus-160B

0
·
18
·
Dec 2024
tommymir4444Warm500M32K

Qwen2.5-Coder-0.5B-Instruct-Gensyn-Swarm-gentle_vigilant_capybara

0
·
18
·
Dec 2025
nanat05525Warm3B8K

gemma2-2b-math-sft-v1

0
·
18
·
Jan 2026
AdanatoWarm8B8K

Meta-Llama-3-8B-Instruct_e1-fykcluster_k4_cluster_1

0
·
18
·
Jan 2026
Zachary1150Warm2B32K

math_len_1.5B

0
·
18
·
Jan 2026
shuoxingWarm8B32K

qwen2-5-7b-full-pretrain-mix-low-tweet-1m-en-reproduce-bs8

0
·
18
·
Jan 2026
d-matrixWarm1B32K

Llama-3.2-1B

0
·
18
·
Oct 2024
darlongWarm500M32K

Qwen2.5-Coder-0.5B-Instruct-Gensyn-Swarm-sedate_scavenging_hummingbird

0
·
18
·
Nov 2025
metindederWarm8B8K

Llama-3-Gherkin-QA-Expert

0
·
18
·
Feb 2026
ailexleonWarm12B32K

Rocinante-X-12B-v1-mlx-fp16

0
·
18
·
Jan 2026
richardyoungWarm15B32K

Qwen2.5-14B-Instruct-1M-heretic

0
·
18
·
Nov 2025
LambentWarm27B32K

Mira-v1.25.2-27B-DPO

0
·
18
·
Feb 2026
LorenaYannnnnWarm800M32K

20260217-Qwen3-0.6B_grpo_sycophancy_warmup_baseline_192000_episodes_seed_42

0
·
18
·
Feb 2026
JackrongWarm8B32K

Llama3.1-8B-Thinking-R1

0
·
18
·
Dec 2025
PhonepadithWarm4B32K

aidc-5k-merged-gemma-3-4b-it

0
·
18
·
Jul 2025
LunzimaWarm15B32K

NQLSG-Qwen2.5-14B-MegaFusion-v9.1

1
·
18
·
Mar 2025