oh_scale_x4_compute_equal
open-o1-sft-original-plus-oh-v3.1
sky-t1-original-llama-instruct
top_11_ranking_stackexchange
alpaca_seeding_stackexchange_codegolf
evolinstruct_seeding_stackexchange_codegolf
llama3_mammoth_dcft_ablation_50k
seed_math_allenai_math
seed_math_open2math
seed_math_tiger_lab_math
mlfoundations-dev_stackoverflow_50000_samples
mlfoundations-dev_stackoverflow_375000_samples
askvox-llama3.3-70b-16bit
ckpt-t-1115
Qwen2.5-7B-Instruct-finetuned
Bespoke-Stratos-17k-v3
dpo_from_stratos_judged_annotated_rejected_responses
DeepSeek-R1-Distill-Qwen-MFANN-Slerp-7b
Bespoke-Stratos-17k-v4
qwen_7b_instruct_extra_verified
bgGPT-DeepSeek-R1-Distill-Qwen-7B
mlfoundations-dev_science-and-puzzle-stratos-verified-scaled-1_stratos_7b
mlfoundations-dev_code-stratos-verified-scaled-0_25_stratos_7b
mlfoundations-dev_code-stratos-unverified-scaled-0_125_stratos_7b
mlfoundations-dev_code-stratos-unverified-scaled-0_25_stratos_7b
dolphinr1
mlfoundations-dev_stratos-verified-mix-scaled-0_5_stratos_7b
seed_math_tiger_math_reasoninghp
multiple_samples_sharpening_numina_aime
difficulty_sorting_medium_seed_code
qwen2-5_sky_t1_2-5k_base
seed_math_multiple_samples_scale_up_scaredy_cat_baseline
mlfoundations-dev_stratos_verified_mix_stratos_7b
fortyK_pretrained_merged_llama
ft-v1-violet-merge
tokiiii
Llama-3.1-RandomInit-70B
MedicalEDI-8b-EDI-Base-1
ft-v1-nemo-base-merge-v1
Llama3.1-8b-instruct-SFT-2024-11-09
Qwen-2.5-7B-Simple-RL
OHprompts_GPT4oresponses_30k