Models

32,682
70B32Kllama33-70b
Cold

trashpanda-org/Llama-3.3-70B-Aster-v0-stage3

0
·
3
8B32Kllama31-8b
Cold

inpars-plus/Meta-Llama-3.1-Instruct-8B_merged-16bit_CPO_MSMARCO

0
·
3
8B32Kqwen2-7b
Cold

datumo/E-Star-Qwen-7B

0
·
3
8B32Kqwen25-7b
Cold

luckeciano/Qwen-2.5-7B-RL-LACPO-BaselineNoKLNoEntropyNoSmoothSoftLabel

0
·
3
8B32Kqwen25-7b
Cold

ZMC2019/Qwen7B-L28-Flat-tuned

0
·
3
8B32Kqwen25-7b
Cold

ferdinandjasong/SuperCoder-7B-Qwen2.5-peft-merged

0
·
3
8B32Kqwen25-7b
Cold

ZMC2019/OpenR1-Qwen-7B-nsa-B1024-hwtrue

0
·
3
8B32Kqwen2-7b
Cold

hendrydong/qwen-math-7b-raftpp-step120

0
·
3
8B32Kqwen2-7b
Cold

ybq0509/sa_Q_7B_ckpt2250

0
·
3
32B32Kqwen2-32b
Cold

ybq0509/sd_Q_32B_ckpt1124

0
·
3
8B32Kllama31-8b
Cold

LNGYEYXR/Llama-3.1-8B-lora-step30

0
·
3
8B32Kqwen25-7b
Cold

shanchen/s1.1-limo-multilingual-4

0
·
3
8B32Kllama31-8b
Cold

agg-shambhavi/MimicLlama-3.1-8B-DPO

0
·
3
32B32Kqwen2-32b
Cold

ybq0509/mo_Q_32B_ckpt1124

0
·
3
8B32Kqwen3-8b
Cold

NovaSky-AI/SkyRL-Agent-8b-v0

0
·
3
33B32Kqwen25-32b
Cold

ross-rl/qwen2.5-coder-32b-instruct-sft-warmup-adapter-id-sft2

0
·
3
8B32Kqwen25-7b
Cold

mlfoundations-dev/openthoughts3_300k

0
·
3
8B32Kllama31-8b
Cold

imdatta0/llama_openr1_sft

0
·
3
8B32Kllama31-8b
Cold

MergeBench-Llama-8B-it/llama-3.1-8b-it_aya_2epoch

0
·
3
8B8Kllama3-8b
Cold

shariar076/Llama-3.1-8B-Instruct-DPO-100R0L-PoliTune

0
·
3