Model Releases

Week of Feb 9, 2025

qwen25-7b-lc

Mingsmilet/Qwen2.5-7B-GRPO-MATH

0

48

M

qwen2-7b-lc

eyad-silx/Quasar-2.0-7B-Thinking

1

124

E

qwen25-7b-lc

mlfoundations-dev/mlfoundations-dev_stratos_verified_mix_stratos_7b

0

32

M

qwen25-7b-lc

mlfoundations-dev/seed_math_multiple_samples_scale_up_scaredy_cat_test

0

25

M

qwen25-7b-lc

mlfoundations-dev/seed_math_multiple_samples_scale_up_scaredy_cat_baseline

0

31

M

qwen25-7b-lc

mlfoundations-dev/qwen2-5_sky_t1_2-5k_base

0

124

M

qwen25-7b-lc

mlfoundations-dev/seed_math_multiple_samples_scale_up_scaredy_cat_all

0

31

M

qwen25-32b-lc

mlfoundations-dev/LIMO_limoconfigs_16k

0

38

M

qwen25-7b-lc

mlfoundations-dev/stratos_verfied_v2_1

0

35

M

llama31-8b-16k

bulkbeings/llama3.1-2eph-a100-all

0

34

B

qwen25-7b-lc

mlfoundations-dev/stratos_verified_mix_epochs2

0

29

M

qwen25-7b-lc

jan-hq/Deepseek-Qwen2.5-7B-Redistil

0

340

J

qwen25-7b-lc

mlfoundations-dev/stratos_verified_mix_epochs1

0

29

M

qwen25-7b-lc

mlfoundations-dev/qwen2-5_sky_t1_2-5k_rewrite_r1_distill_llama70b

0

32

M

qwen25-7b-lc

mlfoundations-dev/qwen2-5_sky_t1_2-5k_alternative_r1_distill_llama70b

0

44

M

qwen25-7b-lc

mlfoundations-dev/stratos_verified_plus_s1r1

0

38

M

qwen25-32b-lc

mlfoundations-dev/mlfoundations-dev_extra_verified-32B

0

40

M

qwen2-14b-lc

flypg/DeepSeek-R1-Distill-Qwen-14B-Japanese-chat

1

47

F

qwen25-7b-lc

mlfoundations-dev/difficulty_sorting_random_seed_code

0

36

M

qwen25-7b-lc

mlfoundations-dev/difficulty_sorting_high_seed_code

0

28

M