2010_rl_rag_NAR8_testing64_gpt5_sft_step650
qwen7bi-oasst1
Qwen3-8B-ot_step20_high
Qwen3-8B-ot_step42_high
2010_rl_rag_NAR8_testing64_gpt5_sft_31605_no_cite__1__1765674535_checkpoints_step_3450
Qwen2.5-7B-Instruct-crypto-function-calling
YandexGPT-5-Lite-8B-ChatMl-alpha
affine-c
llama31-8b-turkish-sft-v3-merged
tulu-v.3.9-v0
oh_v1_w_v3_alpaca_threshold90_it
oh_v1_w_v3_metamath
OH_original_wo_camel_ai_chemistry
OH_original_wo_sharegpt
oh-dcft-v1.2_no-curation_gpt-4o-mini_wo_opengpt
oh-dcft-v1.2_no-curation_gpt-4o-mini_wo_metamath
oh_v1-2_only_opengpt
oh_v1.3_opengpt_x.5
OH_DCFT_V3_wo_glaive_code_assistant
hp_ablations_llama3_epoch1_dcftv1.2
oh-dcft-v3.1-llama-3.2-1b
oh_v1.3_evol_instruct_x.5
stackexchange_codereview
stackexchange_astronomy
stackexchange_music
stackexchange_chemistry
stackexchange_earthscience
stackexchange_interpersonal
stackexchange_matheducators
stackexchange_vegetarianism
stackoverflow_10000tasks_0p
evol_tt_2s
llama3-1_8b_physics_375000_samples
simpo-oh-dcft-v3.1-llama-3.1-405b
simpo-oh-dcft-v3.1-llama-3.3-70b
simpo-oh-dcft-v3.1-llama-3.1-nemotron-70b
top_14_ranking_stackexchange
seed_math_math_instruct
seed_math_nvidia_math
mlfoundations-dev_stackoverflow_250000_samples
oh-dcft-v3.1-llama-3.1-405b-qwen-v2dummytesting
llama3-1_8b_4o_annotated_aime