qwen25-7b
Mingsmilet/Qwen2.5-7B-GRPO-MATH
0
9
mlfoundations-dev/mlfoundations-dev_stratos_verified_mix_stratos_7b
6
mlfoundations-dev/seed_math_multiple_samples_scale_up_scaredy_cat_test
Feb 2025
50
mlfoundations-dev/seed_math_multiple_samples_scale_up_scaredy_cat_baseline
3
mlfoundations-dev/qwen2-5_sky_t1_2-5k_base
53
mlfoundations-dev/seed_math_multiple_samples_scale_up_scaredy_cat_all
7
mlfoundations-dev/stratos_verfied_v2_1
mlfoundations-dev/stratos_verified_mix_epochs2
jan-hq/Deepseek-Qwen2.5-7B-Redistil
26
mlfoundations-dev/stratos_verified_mix_epochs1
mlfoundations-dev/qwen2-5_sky_t1_2-5k_rewrite_r1_distill_llama70b
mlfoundations-dev/qwen2-5_sky_t1_2-5k_alternative_r1_distill_llama70b
mlfoundations-dev/stratos_verified_plus_s1r1
qwen25-32b
mlfoundations-dev/mlfoundations-dev_extra_verified-32B
5
mlfoundations-dev/difficulty_sorting_random_seed_code
57
mlfoundations-dev/difficulty_sorting_high_seed_code
mlfoundations-dev/difficulty_sorting_easy_seed_code
mlfoundations-dev/difficulty_sorting_medium_seed_code
mlfoundations-dev/multiple_samples_none_numina_aime_adjusted_samples
mlfoundations-dev/difficulty_sorting_random_seed_math
19