Models

39,745
mlfoundations-devWarm8B32K

mlfoundations-dev_code-stratos-verified-scaled-0_25_stratos_7b

0
·
1
mlfoundations-devWarm8B32K

llama3-1_8b_r1_annotated_math

0
·
1
mlfoundations-devWarm8B32K

llama3-1_8b_r1_annotated_olympiads

0
·
1
mlfoundations-devWarm8B32K

math-stratos-unverified-scaled-1

0
·
1
mlfoundations-devWarm8B32K

llama3-1_8b_distill_70b_infra_baseline_r1_2.5k

0
·
1
mlfoundations-devWarm8B32K

mlfoundations-dev_stratos-verified-mix-scaled-0_5_stratos_7b

0
·
1
mlfoundations-devWarm8B32K

llama3-1_8b_multiple_samples_random_numina_aime

0
·
1
mlfoundations-devWarm8B32K

mlfoundations-dev_stratos-unverified-mix-scaled-0_5_stratos_7b

0
·
1
mlfoundations-devWarm8B32K

32k_test_dummy

0
·
1
mlfoundations-devWarm8B32K

seed_math_math_instruct_reasoninghp

0
·
1
mlfoundations-devWarm8B32K

dpo_from_multiple_samples_shortest_numina_aime

0
·
1
mlfoundations-devWarm8B32K

verified_stratos_mix_no_proofs_without_metadata

0
·
1
mlfoundations-devWarm8B32K

s1K_reformat_v2

0
·
1
mlfoundations-devWarm8B32K

difficulty_sorting_medium_seed_math

0
·
1
mlfoundations-devWarm8B32K

difficulty_sorting_random_seed_math

0
·
1
mlfoundations-devWarm8B32K

multiple_samples_none_numina_aime_adjusted_samples

0
·
1
mlfoundations-devWarm8B32K

difficulty_sorting_random_seed_code

0
·
1
·
Feb 2025
mlfoundations-devWarm8B32K

stratos_verfied_v2_1

0
·
1
mlfoundations-devWarm8B32K

qwen2-5_sky_t1_2-5k_base

0
·
1
·
Feb 2025
mlfoundations-devWarm8B32K

qwen_s1ablation_length_filter_27k

0
·
1
YellowDotGroupWarm70B32K

mai3.1finetuned1

0
·
1
Shaleen123Warm8B32K

MedicalEDI-8b-EDI-Base

0
·
1
KONIexpWarm8B32K

KONI-Llama3.1-8B-Merged-cdj2-20250217

1
·
1
alexxi19Warm12B32K

ft-v1-nemo-base-merge-v1

0
·
1
rupa99Warm8B32K

QloraAIops

0
·
1
KONIexpWarm8B32K

KONI-Llama3.1-8B-only_instructed-20250224

0
·
1
ccibeekeoc42Warm8B32K

Llama3.1-8b-instruct-SFT-2024-11-09

1
·
1
OMEGA-REASONINGWarm8B32K

qwen_2.5_7b_transduction_e_2k

0
·
1
pxyyyWarm8B32K

Qwen2.5-7B-NuminaMath-CoT-smp20k-ep1-2e-5

0
·
1
mlfoundations-devWarm70B32K

DCFT-Stratos-Verified-114k-Llama-3_3-70B-bs-256

0
·
1
mlfoundations-devWarm8B32K

openthoughts114k-qwenmath-fa2

0
·
1
tsavage68Warm8B32K

Hand_off_DS_Llama8B_100steps_1e6rate_SFT

0
·
1
anson1788Warm8B32K

raceModel-6000

0
·
1
imdatta0Warm8B32K

llama_openthoughts_sorted

0
·
1
watermelonhjgWarm8B32K

Qwen2.5-7B-EN-Zero

0
·
1
qkrqudwn2Warm8B32K

llama3.1-weeslee-8B

0
·
1
MrezaPRZWarm15B32K

Qwen2.5-Coder-14B-Instruct-SQL

0
·
1
gabrielnogueiraltWarm8B32K

Llama3.1-multiple

0
·
1
mlfoundations-devWarm8B32K

instruction_filtering_scale_up_code_base_askllm_8K

0
·
1
secmlrWarm8B32K

dpo_VD-DS-Clean-8k_VD-QWQ-Clean-8k_Qwen2.5-7B-Instruct_full_sft_1e-5_full

0
·
1
moogicianWarm32B32K

DSR1-Qwen-32B-131fad2c

0
·
1
amirbhatWarm8B32K

Llama-3.1-8B-Instruct-Mental-Health-Classification

0
·
1