stackexchange_softwareengineering
stackexchange_space
stackexchange_stackoverflow
stackexchange_stats
stackoverflow_10000tasks_0p
stackoverflow_25000tasks_.75p
evol_tt_2s
evol_tt_5s
ktdsbaseLM-v0.10-onbased-llama3.1
oh-dcft-v1.3_no-curation_gpt-4o-mini_scale_4x
llama3-1_8b_physics_375000_samples
Llama-3.1-8B-InitializedEmbeddings_with_Hermes-3
ckpt-0110-v2
L3.3-Faust-70B-exp.001
HuatuoSkywork-o1-Llama-3.1-8B
simpo-oh-dcft-v3.1-llama-3.1-405b
simpo-oh-dcft-v3.1-llama-3.3-70b
simpo-oh-dcft-v3.1-llama-3.1-nemotron-70b
llama-breadcrumbs-ties-merge
simpo-stackoverflow_25000tasks_1p
top_14_ranking_stackexchange
oh_v1.3_evol_instruct_x.25
seed_math_math_instruct
seed_math_nvidia_math
mlfoundations-dev_stackoverflow_50000_samples
mlfoundations-dev_stackoverflow_250000_samples
Llama-3.3-70B-Memo-law-Instruct-v2.1
oh-dcft-v3.1-claude-3-5-sonnet-20241022-qwen
DCFT-Stratos-Verified-114k-32B-4gpus
llama3-1_8b_r1_annotated_aime
Qwen2.5-14B-Instruct-SLDS
distill_70b_infra_together
multiple_samples_none_numina_aime
LIMO
s1K_reformat_v2
difficulty_sorting_medium_seed_code
Qwen2.5-7B-GRPO-MATH
fortyK_pretrained_merged_llama
Llama-3.1-RandomInit-70B
rewiz-qwen-2.5-14b
MedicalEDI-8b-EDI-Base-1
f1-v1-nemo-base-merge