Models

39,636
alex2020Warm500M32K

Qwen2-0.5-Instruct

0
·
0
SchoeckWarm500M32K

Qwen2.5-0.5B-Instruct-Gensyn-Swarm-alert_winged_caribou

0
·
0
ToastyPigeonWarm32B32K

possibly-cursed-glm-test

0
·
0
ToastyPigeonWarm24B32K

ms3.2-24b-longform

0
·
0
huddlehouseWarm8B32K

Meta-Llama-3.1-8B-Instruct-PUG-hc-playbook-3epochs-2e-5

0
·
0
rodrigomtWarm4B32K

gama-4b

1
·
0
ReadyArtWarm12B32K

The-Omega-Directive-M-12B-v1.0

18
·
0
·
Apr 2025
ReadyArtWarm14B32K

The-Omega-Directive-Qwen3-14B-v1.1

29
·
0
·
Apr 2025
suziiWarm4B32K

gemma-3-4B-function-calling-v0.4

1
·
0
zelk12Warm12B32K

MT2-Gen2_gemma-3-12B

2
·
0
CortexCerealWarm8B32K

uxux

0
·
0
memevisWarm500M32K

walk13

0
·
0
AlphataoWarm8B32K

test_finetune

0
·
0
mm2137Warm3B32K

m30

0
·
0
yununuyWarm8B32K

guesswho-scale-game

0
·
0
AlexHung29629Warm24B32K

Magistral-Small-2506

0
·
0
albertfaresWarm800M32K

DPO_MCQA_model_3_03_07_08

0
·
0
mlfoundations-devWarm8B32K

phi_30K_qwq_0K

0
·
0
yasmine777Warm8B32K

nn

0
·
0
LaaP-aiWarm500M32K

vllm-test-v1

0
·
0
jqWarm14B32K

qwen3-14b-ug40-pretrained

0
·
0
Yuuta208Warm8B32K

Qwen2.5-7B-Instruct-Qwen2.5-Math-7B-Merged-task_arithmetic-26

0
·
0
MrRobotoAIWarm8B8K

110

0
·
0
GrayxWarm3B32K

jpii_26

0
·
0
mlfoundations-devWarm33B32K

opencodereasoning_32B

0
·
0
MergeBench-Llama-8B-itWarm8B32K

llama3-8b-it-GRPO-after-sft

0
·
0
memevissWarm3B32K

Match-rigging_38

0
·
0
mlfoundations-devWarm8B32K

openthoughts3_100k_buggy

0
·
0
luckecianoWarm8B32K

Qwen-2.5-7B-RL-LACPO-BaselineNoKLNoEntropyNoSmoothSoftLabel

0
·
0
ZMC2019Warm8B32K

Qwen7B-L28-Flat-tuned

0
·
0
MergeBench-gemma-2-9b-itWarm9B16K

gemma-2-9b-it_wildguard_jailbreak_2epoch

0
·
0
ZMC2019Warm8B32K

OpenR1-Qwen-7B-nsa-B1024-hwtrue

0
·
0
MergeBench-Llama-8B-itWarm8B32K

llama-3.1-8b-it_tulu-3-sft-personas-instruction-following_epoch3_0429

0
·
0
luckecianoWarm8B32K

Qwen-2.5-7B-GRPO-NoKL-1e-05-24

0
·
0
memevissWarm3B32K

Match-rigging_31

0
·
0
memevissWarm3B32K

Match-rigging_35

0
·
0
ybq0509Warm8B32K

sa_Q_7B_ckpt2250

0
·
0
ybq0509Warm32B32K

sd_Q_32B_ckpt1124

0
·
0
LNGYEYXRWarm8B32K

Llama-3.1-8B-lora-step30

0
·
0
dslighfdslWarm8B32K

Llama-3.1-8B-Instruct-SFT-CoT-short

0
·
0
memevissWarm3B32K

Match-rigging_30

0
·
0
agg-shambhaviWarm8B32K

MimicLlama-3.1-8B-DPO

0
·
0