Models

4,025
8B32Kqwen25-7b
Cold

luckeciano/Qwen-2.5-7B-RL-LACPO-BaselineNoKLNoEntropyNoSmoothSoftLabel

0
·
1
15B32Kqwen25-14b
Cold

secmlr/SWE-BENCH-433-enriched-set-claude-3in1-localization-with-reasoning_14b-433-enriched-3in1

0
·
1
8B32Kqwen25-7b
Cold

ZMC2019/OpenR1-Qwen-7B-nsa-B1024-hwtrue

0
·
1
8B32Kqwen25-7b
Cold

shanchen/s1.1-limo-multilingual-4

0
·
1
8B32Kqwen2-7b
Cold

shanchen/ds-limo-fr-100

0
·
1
8B32Kllama31-8b
Cold

CompassioninMachineLearning/alpacallama_plus1k_80_20mix

0
·
1
8B32Kqwen25-7b
Cold

ZMC2019/Qwen7B-Math-L28

0
·
1
8B32Kqwen25-7b
Cold

od2961/Qwen2.5-7B-Instruct-SFT

0
·
1
8B32Kllama31-8b
Cold

imdatta0/llama_openthoughts_sorted_sft_nopack_splpad

0
·
1
8B32Kqwen25-7b
Cold

secmlr/SWE-BENCH-433-enriched-set-claude-3in1-localization-with-reasoning_7b-433-enriched-3in1

0
·
1
8B32Kqwen25-7b
Cold

Lansechen/Qwen-2.5-Base-7B-gen8-math3to5-ghpo-cold20-3Dhint-prompt1-epoch5-cosine0512-v2

0
·
1
8B32Kllama31-8b
Cold

bragom/papib

0
·
1
8B32Kqwen2-7b
Cold

shanchen/ds-limo-th-500

0
·
1
8B8Kllama3-8b
Cold

AmberYifan/llama3-8b-full-pretrain-junk-tweet-1m-en-sft

0
·
1
8B32Kqwen25-7b
Cold

mlfoundations-dev/e1_math_all_phi

0
·
1
8B32Kqwen25-7b
Cold

mlfoundations-dev/e1_science_longest_qwq_together

0
·
1
8B8Kllama3-8b
Cold

AmberYifan/llama3-8b-full-pretrain-control-tweet-1m-en

0
·
1
8B32Kqwen25-7b
Cold

mlfoundations-dev/e1_science_longest_phi

0
·
1
8B32Kqwen25-7b
Cold

AmberYifan/Qwen2.5-7B-Instruct-userfeedback-iter1

0
·
1
8B32Kllama31-8b
Cold

CompassioninMachineLearning/pretrainedllama8bInstruct3kresearchpapers_plus1kalignment_lora2epochs

0
·
1