Models

32,695
8B32Kllama31-8b
Cold

dslighfdsl/Llama-3.1-8B-Instruct-SFT-CoT-short

0
·
4
8B32Kqwen3-8b
Cold

bharatwokelo/Qwen-8b-finetuned-website-v3-merged-peft

0
·
4
8B8Kllama3-8b
Cold

MrRobotoAI/133

0
·
4
8B32Kqwen25-7b
Cold

Lansechen/Qwen-2.5-Base-7B-gen8-math3to5-ghpo-cold20-3Dhint-prompt1-epoch5-cosine0511-v3

0
·
4
8B32Kqwen25-7b
Cold

lihengma/Qwen-2.5-7B-Instruct_2wiki_kg_sfted

0
·
4
14B32Kqwen2-14b-lc
Cold

Shaleen123/MedicalEDI-14b-EDI-Base-Final

1
·
4
8B32Kllama31-8b
Cold

CompassioninMachineLearning/alpacallama_plus1k_80_20mix

0
·
4
8B32Kqwen25-7b
Cold

Lansechen/Qwen-2.5-Base-7B-gen8-math3to5-ghpo-cold20-3Dhint-prompt1-epoch5-cosine0512-v1

0
·
4
8B32Kqwen2-7b
Cold

shanchen/ds-limo-1.1-50

0
·
4
15B32Kqwen25-14b
Cold

ruh-ai/JEE_14B

4
·
4
8B32Kqwen2-7b
Cold

sparkle-reasoning/SparkleRL-7B-Stage2-mix

0
·
4
8B32Kllama31-8b
Cold

AmberYifan/Llama-3.1-8B-sft-ultrachat

0
·
4
14B32Kqwen3-14b
Cold

r2e-edits/qwen3_14b_sft_swesmith_r2e_v2_qwen3_format_32k_maxstep40_rft-20k_bz8_epoch2_lr1en5-v1

0
·
4
8B32Kqwen25-7b
Cold

mlfoundations-dev/openthoughts3_science

0
·
4
9B16Kgemma2-9b
Cold

MergeBench-gemma-2-9b/gemma-2-9b_Magicoder-Evol-Instruct-110K_2epoch

0
·
4
8B32Kqwen2-7b
Cold

sparkle-reasoning/SparkleRL-7B-Stage2-hard

0
·
4
8B32Kqwen25-7b
Cold

Lansechen/Qwen-2.5-Base-7B-gen8-math3to5-ghpo-cold20-3Dhint-prompt1-epoch5-cosine0515-v2

0
·
4
8B32Kqwen2-7b
Cold

shanchen/ds-limo-th-250

0
·
4
8B32Kqwen25-7b
Cold

luckeciano/Qwen-2.5-7B-RL-LACPO-BaselineNoKLNoEntropyNoSmoothSoftLabelNormAdv

0
·
4
8B32Kllama31-8b
Cold

Oyasi/msdialect

0
·
4