Models

8,426
8B32Kqwen25-7b
Warm

Kendamarron/Qwen2.5-7B-o1-ja-v0.1

0
·
4
8B32Kqwen25-7b
Warm

mlfoundations-dev/Bespoke-Stratos-17k-v4

0
·
4
8B32Kqwen25-7b
Warm

mlfoundations-dev/stratos-verified-mix-scaled-0.125

0
·
4
8B32Kqwen25-7b
Warm

mlfoundations-dev/math-stratos-unverified-scaled-1

0
·
4
8B32Kqwen25-7b
Warm

mlfoundations-dev/llama3-1_8b_multiple_samples_all_numina_aime

0
·
4
8B32Kqwen25-7b
Warm

mlfoundations-dev/seed_math_open2math_reasoninghp

0
·
4
·
Feb 2025
8B32Kqwen25-7b
Warm

mlfoundations-dev/difficulty_sorting_random_seed_math

0
·
4
8B32Kqwen25-7b
Warm

mlfoundations-dev/multiple_samples_none_numina_aime_adjusted_samples

0
·
4
8B32Kqwen25-7b
Warm

mlfoundations-dev/stratos_verified_mix_epochs1

0
·
4
8B32Kqwen25-7b
Warm

mlfoundations-dev/stratos_verfied_v2_1

0
·
4
8B32Kllama31-8b
Warm

qkrqudwn2/llama3.1-weeslee-8B

0
·
4
8B32Kqwen25-7b
Warm

mlfoundations-dev/instruction_filtering_scale_up_code_base_fasttext_per_domain_8K

0
·
4
8B32Kqwen25-7b
Warm

mlfoundations-dev/instruction_filtering_scale_up_code_base_random_filtering_8K

0
·
4
8B32Kqwen25-7b
Warm

secmlr/dpo_VD-DS-Clean-8k_VD-QWQ-Clean-8k_Qwen2.5-7B-Instruct_full_sft_1e-5_full

0
·
4
8B32Kllama31-8b
Warm

clembench-playpen/SFT-merged_fp16_DFINAL_1.1K-steps

0
·
4
8B32Kqwen25-7b
Warm

mlfoundations-dev/openthoughts114k-qwenmath

0
·
4
8B32Kqwen25-7b
Warm

mlfoundations-dev/SCP_40k_R1_with_OT_verified

0
·
4
8B32Kllama31-8b
Warm

suayptalha/ClimateLlama-8B

4
·
4
24B32Kmistral-24b
Warm

alex43219/Mistral-Small-24B-Instruct-2501-Reasoner-SFT

0
·
4
8B32Kllama31-8b
Warm

sudhanshu-soft/medical_llama3_16bit

0
·
4