Models

32,707
8B32Kqwen25-7b
Cold

Lansechen/Qwen-2.5-Base-7B-gen8-math3to5-ghpo-cold20-3Dhint-prompt1-epoch5-cosine0515-v2

0
·
3
8B32Kqwen25-7b
Cold

od2961/Qwen2.5-7B-Instruct-SFT

0
·
3
8B32Kqwen25-7b
Cold

chansung/Qwen2.5-7B-CCRL-2

0
·
3
32B32Kqwen2-32b
Cold

alan-turing-institute/t0-1.1-k5-32B

0
·
3
·
May 2025
8B32Kqwen25-7b
Cold

amphora/merged-bench-0417-1

0
·
3
8B32Kqwen2-7b
Cold

alvinming/es-qwen-math-base-7b-3k-stage2-6k-t4-ds_o2-step320

0
·
3
8B32Kqwen2-7b
Cold

shanchen/ds-limo-te-250

0
·
3
8B32Kqwen2-7b
Cold

alvinming/es-qwen-math-base-7b-3k-stage2-6k-t4-ds_o2-step640

0
·
3
8B32Kqwen25-7b
Cold

luckeciano/Qwen-2.5-7B-RL-LACPO-BaselineNoKLNoEntropyNoSmoothSoftLabelNormAdv

0
·
3
8B32Kllama31-8b
Cold

Oyasi/msdialect

0
·
3
11B4Kllama2-solar-10b7
Cold

maywell/Synatra-11B-Tb2M_SM

0
·
3
12B32Kmistral-nemo
Cold

DigitalLearningGmbH/educa-ai-nemo-dpo

4
·
3
8B32Kqwen25-7b
Cold

mlfoundations-dev/Qwen2.5-7B-Instruct_qwq_mix_r1_science

1
·
3
8B32Kqwen2-7b
Cold

shanchen/ds-limo-te-500

0
·
3
14B32Kqwen3-14b
Cold

Moeb96/Qwen3-14B

0
·
3
8B32Kqwen25-7b
Cold

Yuuta208/Qwen2.5-7B-Instruct-Qwen2.5-Coder-7B-Merged-della-29

0
·
3
4B4Kphi3-4b
Cold

tanspring/attn2_47c6ce9d-9e91-4ea2-b7a7-328d5569d3cd

0
·
3
8B8Kllama3-8b
Cold

BoHanMint/Synthesizer-8B-math

0
·
3
8B32Kqwen25-7b
Cold

mlfoundations-dev/openthoughts3_code_100k_annotated_QwQ-32B_sharegpt

0
·
3
8B8Kllama3-8b
Cold

AmberYifan/llama3-8b-full-pretrain-junk-tweet-1m-en-sft

0
·
3