mlfoundations-dev_code-stratos-verified-scaled-0_25_stratos_7b
llama3-1_8b_r1_annotated_math
llama3-1_8b_r1_annotated_olympiads
math-stratos-unverified-scaled-1
llama3-1_8b_distill_70b_infra_baseline_r1_2.5k
mlfoundations-dev_stratos-verified-mix-scaled-0_5_stratos_7b
llama3-1_8b_multiple_samples_random_numina_aime
mlfoundations-dev_stratos-unverified-mix-scaled-0_5_stratos_7b
32k_test_dummy
seed_math_math_instruct_reasoninghp
dpo_from_multiple_samples_shortest_numina_aime
verified_stratos_mix_no_proofs_without_metadata
s1K_reformat_v2
difficulty_sorting_medium_seed_math
difficulty_sorting_random_seed_math
multiple_samples_none_numina_aime_adjusted_samples
difficulty_sorting_random_seed_code
stratos_verfied_v2_1
qwen2-5_sky_t1_2-5k_base
qwen_s1ablation_length_filter_27k
mai3.1finetuned1
MedicalEDI-8b-EDI-Base
KONI-Llama3.1-8B-Merged-cdj2-20250217
ft-v1-nemo-base-merge-v1
QloraAIops
KONI-Llama3.1-8B-only_instructed-20250224
Llama3.1-8b-instruct-SFT-2024-11-09
qwen_2.5_7b_transduction_e_2k
Qwen2.5-7B-NuminaMath-CoT-smp20k-ep1-2e-5
DCFT-Stratos-Verified-114k-Llama-3_3-70B-bs-256
openthoughts114k-qwenmath-fa2
Hand_off_DS_Llama8B_100steps_1e6rate_SFT
raceModel-6000
llama_openthoughts_sorted
Qwen2.5-7B-EN-Zero
llama3.1-weeslee-8B
Qwen2.5-Coder-14B-Instruct-SQL
Llama3.1-multiple
instruction_filtering_scale_up_code_base_askllm_8K
dpo_VD-DS-Clean-8k_VD-QWQ-Clean-8k_Qwen2.5-7B-Instruct_full_sft_1e-5_full
DSR1-Qwen-32B-131fad2c
Llama-3.1-8B-Instruct-Mental-Health-Classification