Models

2,329
8B32Kqwen25-7b
Cold

mlfoundations-dev/openthoughts3_science

0
·
2
8B32Kqwen25-7b
Cold

mothnaZl/long-sr-Qwen2.5-7B-Instruct

0
·
2
8B32Kqwen25-7b
Cold

yjyjyj98/Qwen2.5-7B-Open-R1-Step1-SFT

0
·
2
15B32Kqwen25-14b
Cold

kamelcharaf/GRPO-qwen2.5-14B-qwen2.5-14B-mrd3-s3-sum_token_prompt-merged

0
·
2
8B32Kqwen25-7b
Cold

luckeciano/Qwen-2.5-7B-RL-LACPO-BaselineNoKLNoEntropyNoSmoothSoftLabelNormAdv

0
·
2
8B32Kqwen25-7b
Cold

secmlr/SWE-BENCH-433-enriched-set-claude-3in1-localization-with-reasoning_7b-433-enriched-3in1

0
·
2
8B32Kqwen25-7b
Cold

mlfoundations-dev/qwen_lawma_deepseek-2k-5x-majority_verified

0
·
2
8B32Kqwen25-7b
Cold

mlfoundations-dev/openthoughts3_code_100k_annotated_QwQ-32B_sharegpt

0
·
2
8B32Kqwen25-7b
Cold

mlfoundations-dev/e1_math_all_phi

0
·
2
8B32Kqwen25-7b
Cold

mlfoundations-dev/e1_science_longest_qwq_together

0
·
2
8B32Kqwen25-7b
Cold

AmberYifan/Qwen2.5-7B-Instruct-wildfeedback-11k

0
·
2
8B32Kqwen25-7b
Cold

AmberYifan/Qwen2.5-7B-Instruct-wildfeedback

1
·
2
15B32Kqwen25-14b
Cold

OmniDimen/OmniDimen-V1.5-14B-Emotion

2
·
2
15B32Kqwen25-14b
Cold

linxy/RETuning-DeepSeek_R1_14B_SFT_GRPO

1
·
2
15B32Kqwen25-14b
Cold

innominedein/neron-v3

1
·
2
15B32Kqwen25-14b
Cold

innominedein/neron-v6

1
·
2
33B32Kqwen25-32b
Cold

narabzad/s1K_tokenized-fromHF-githubcode-torchrun

0
·
2
·
Dec 2025
33B32Kqwen25-32b
Cold

redsgnaoh/model53

0
·
2
·
Apr 2025
33B32Kqwen25-32b
Cold

zycalice/qwen-coder-insecure-2-lrcosinerestart

0
·
2
·
Jan 2026
33B32Kqwen25-32b
Cold

zycalice/qwen-coder-insecure-2-attention_wtrain_2

0
·
2
·
Jan 2026