Models

2,329
8B32Kqwen25-7b
Cold

mlfoundations-dev/math-stratos-unverified-scaled-0.25

0
·
1
8B32Kqwen25-7b
Cold

mlfoundations-dev/llama3-1_8b_r1_annotated_olympiads

0
·
1
8B32Kqwen25-7b
Cold

mlfoundations-dev/qwen_s1ablation_length_filter_27k

0
·
1
15B32Kqwen25-14b
Cold

mdobbali/Qwen2.5-14B-Instruct-131K

0
·
1
8B32Kqwen25-7b
Cold

mlfoundations-dev/deepspeed_no_offload_liger_packing

0
·
1
33B32Kqwen25-32b
Cold

QomSSLab/Qwen-Rhino-32B-RAG

0
·
1
8B32Kqwen25-7b
Cold

mlfoundations-dev/openthoughts3_10k

0
·
1
8B32Kqwen25-7b
Cold

ZMC2019/OpenR1-Qwen-7B-nsa-B1024-hwfalse

0
·
1
8B32Kqwen25-7b
Cold

kamelcharaf/GRPO-qwen2.5-7B-qwen2.5-7B-mrd3-s7-sum_token_prompt-merged

0
·
1
8B32Kqwen25-7b
Cold

luckeciano/Qwen-2.5-7B-GRPO-NoKL-1e-05-24

0
·
1
8B32Kqwen25-7b
Cold

lattaes/Qwen2.5-7B-Instruct-hr-policy-fine-tuned

0
·
1
8B32Kqwen25-7b
Cold

Yuuta208/Qwen2.5-7B-Instruct-Qwen2.5-Coder-7B-Merged-dare_ties-29

0
·
1
8B32Kqwen25-7b
Cold

Yuuta208/Qwen2.5-7B-Instruct-Qwen2.5-Coder-7B-Merged-ties-29

0
·
1
8B32Kqwen25-7b
Cold

Yuuta208/Qwen2.5-7B-Instruct-Qwen2.5-Coder-7B-Merged-linear-29

0
·
1
8B32Kqwen25-7b
Cold

Yuuta208/Qwen2.5-7B-Instruct-Qwen2.5-Math-7B-Merged-della-27

0
·
1
8B32Kqwen25-7b
Cold

secmlr/DS-Noisy_DS-Clean_QWQ-Noisy_QWQ-Clean_Qwen2.5-7B-Instruct_full_sft_1e-5

0
·
1
33B32Kqwen25-32b
Cold

mlfoundations-dev/openr1_32B

0
·
1
8B32Kqwen25-7b
Cold

luckeciano/Qwen-2.5-7B-RL-GRPO-Extreme-NoKL-1e-05-25

0
·
1
8B32Kqwen25-7b
Cold

secmlr/DS-Noisy_DS-Clean_DS-OSS_QWQ-OSS_QWQ-Clean_QWQ-Noisy_Con_Qwen2.5-7B-Instruct_sft

0
·
1
8B32Kqwen25-7b
Cold

Lansechen/Qwen-2.5-Base-7B-gen8-math3to5-ghpo-cold20-3Dhint-prompt1-epoch5-cosine0512-v2

0
·
1