Models

4,325
8B32Kqwen25-7b
Warm

mlfoundations-dev/llama3-1_8b_multiple_samples_random_numina_aime

0
·
0
8B32Kqwen25-7b
Warm

mlfoundations-dev/seed_math_math_instruct_reasoninghp

0
·
0
8B32Kqwen25-7b
Warm

mlfoundations-dev/multiple_samples_majority_consensus_pick_one_numina_aime_math_verify

0
·
0
8B32Kqwen25-7b
Warm

mlfoundations-dev/unverified_stratos_mix_no_proofs_without_metadata

0
·
0
8B32Kqwen25-7b
Warm

mlfoundations-dev/multiple_samples_sharpening_numina_aime

0
·
0
8B32Kqwen25-7b
Warm

mlfoundations-dev/difficulty_sorting_high_seed_math

0
·
0
8B32Kqwen25-7b
Warm

mlfoundations-dev/difficulty_sorting_random_seed_code

0
·
0
·
Feb 2025
8B32Kqwen25-7b
Warm

mlfoundations-dev/stratos_verified_mix_epochs1

0
·
0
8B32Kqwen25-7b
Warm

mlfoundations-dev/mlfoundations-dev_stratos_verified_mix_stratos_7b

0
·
0
8B32Kqwen25-7b
Warm

mlfoundations-dev/openthoughts-114k-no-special-template

0
·
0
500M32Kqwen2-0b5
Warm

AlanWuxm/Qwen2.5-1.5B-Open-R1-Distill

0
·
0
500M32Kqwen2-0b5
Warm

ahmedheakl/asm2asm-qwen2.5coder-0.5b-200k-2ep

0
·
0
500M32Kqwen2-0b5
Warm

axel-datos/qwen2.5-0.5b-instruct_gsm8k_lisa

0
·
0
500M32Kqwen2-0b5
Warm

cutelemonlili/Qwen2.5-0.5B-Instruct_omni_training_no_less_than_5

0
·
0
500M32Kqwen2-0b5
Warm

ahmedheakl/asm2asm-qwen2.5coder-0.5b-100k-2ep

0
·
0
500M32Kqwen2-0b5
Warm

Hachipo/qwen2.5-0.5B_educational_instruct_selec10000_pythonblock_dataselection_jaen

0
·
0
500M32Kqwen2-0b5
Warm

myst72/Qwen2.5-0.5B_MIFT_ja_manywords_4000_v1

0
·
0
500M32Kqwen2-0b5
Warm

azxky6645/01262002-modify_tamplate-boxed-600filtering-processing-10epochs

0
·
0
500M32Kqwen2-0b5
Warm

myst72/Qwen2.5-0.5B_MIFT_en_manywords_6000_v1

0
·
0
500M32Kqwen2-0b5
Warm

NaoS2/qwen2.5-0.5B_linear30_edu_instruct-3

0
·
0