Models

32,683
15B32Kqwen25-14b
Cold

kamelcharaf/GRPO-qwen2.5-14B-qwen2.5-14B-mrd3-s3-sum_token_prompt-merged

0
·
3
8B32Kqwen25-7b
Cold

luckeciano/Qwen-2.5-7B-RL-GRPO-Extreme-NoKL-1e-05-25

0
·
3
8B32Kqwen2-7b
Cold

alvinming/es-qwen-math-base-7b-3k-stage2-6k-t2-ds_o2-step400

0
·
3
8B32Kqwen2-7b
Cold

shanchen/ds-limo-ja-100

0
·
3
11B4Kllama2-solar-10b7
Cold

maywell/Synatra-11B-Tb2M_SM

0
·
3
8B8Kllama3-8b
Cold

RefinedNeuro/RN_TR_R1

0
·
3
8B32Kllama31-8b
Cold

bragom/papib

0
·
3
8B32Kllama31-8b
Cold

godnpeter/llama_chess_o3_981samples_epoch10

0
·
3
8B32Kqwen2-7b
Cold

shanchen/ds-limo-ja-500

0
·
3
8B32Kllama31-8b
Cold

CompassioninMachineLearning/llama8bInstruct_plus1kalignment_lora2epochs_v2

0
·
3
12B32Kmistral-nemo
Cold

pot99rta/PatriMaidV2-12B

3
·
3
8B32Kllama31-8b
Cold

tanspring/attn_f587abe8-a233-4ee7-97e7-765d8d86dc27

0
·
3
8B32Kqwen2-7b
Cold

hendrydong/demonstration

0
·
3
8B8Kllama3-8b
Cold

farwew/GoToCompany-llama3-8b-cpt-sahabatai-v1-instruct-Med_QA_LoRA

0
·
3
8B32Kqwen25-7b
Cold

Yuuta208/Qwen2.5-7B-Instruct-Qwen2.5-Coder-7B-Merged-della-29

0
·
3
32B32Kqwen2-32b
Cold

nguyenvuvn/aq-0104e2

0
·
3
8B32Kllama31-8b
Cold

sugilee/mental-health-distill-3

0
·
3
8B8Kllama3-8b
Cold

BoHanMint/Synthesizer-8B-math

0
·
3
8B32Kllama31-8b
Cold

anileo1/EmpathyAI_llama3.1-8b_v2_16bit

0
·
3
8B32Kqwen25-7b
Cold

mlfoundations-dev/e1_math_all_qwq_together

0
·
3