Models

4,025

8B32Kqwen25-7b

Cold

luckeciano/Qwen-2.5-7B-RL-LACPO-BaselineNoKLNoEntropyNoSmoothSoftLabel

0

·

1

15B32Kqwen25-14b

Cold

secmlr/SWE-BENCH-433-enriched-set-claude-3in1-localization-with-reasoning_14b-433-enriched-3in1

0

·

1

8B32Kqwen25-7b

Cold

ZMC2019/OpenR1-Qwen-7B-nsa-B1024-hwtrue

0

·

1

8B32Kqwen25-7b

Cold

shanchen/s1.1-limo-multilingual-4

0

·

1

8B32Kqwen2-7b

Cold

shanchen/ds-limo-fr-100

0

·

1

8B32Kllama31-8b

Cold

CompassioninMachineLearning/alpacallama_plus1k_80_20mix

0

·

1

8B32Kqwen25-7b

Cold

ZMC2019/Qwen7B-Math-L28

0

·

1

8B32Kqwen25-7b

Cold

od2961/Qwen2.5-7B-Instruct-SFT

0

·

1

8B32Kllama31-8b

Cold

imdatta0/llama_openthoughts_sorted_sft_nopack_splpad

0

·

1

8B32Kqwen25-7b

Cold

secmlr/SWE-BENCH-433-enriched-set-claude-3in1-localization-with-reasoning_7b-433-enriched-3in1

0

·

1

8B32Kqwen25-7b

Cold

Lansechen/Qwen-2.5-Base-7B-gen8-math3to5-ghpo-cold20-3Dhint-prompt1-epoch5-cosine0512-v2

0

·

1

8B32Kllama31-8b

Cold

bragom/papib

0

·

1

8B32Kqwen2-7b

Cold

shanchen/ds-limo-th-500

0

·

1

8B8Kllama3-8b

Cold

AmberYifan/llama3-8b-full-pretrain-junk-tweet-1m-en-sft

0

·

1

8B32Kqwen25-7b

Cold

mlfoundations-dev/e1_math_all_phi

0

·

1

8B32Kqwen25-7b

Cold

mlfoundations-dev/e1_science_longest_qwq_together

0

·

1

8B8Kllama3-8b

Cold

AmberYifan/llama3-8b-full-pretrain-control-tweet-1m-en

0

·

1

8B32Kqwen25-7b

Cold

mlfoundations-dev/e1_science_longest_phi

0

·

1

8B32Kqwen25-7b

Cold

AmberYifan/Qwen2.5-7B-Instruct-userfeedback-iter1

0

·

1

8B32Kllama31-8b

Cold

CompassioninMachineLearning/pretrainedllama8bInstruct3kresearchpapers_plus1kalignment_lora2epochs

0

·

1