stackexchange_codereview
stackexchange_astronomy
stackexchange_music
stackexchange_chemistry
stackexchange_earthscience
stackexchange_interpersonal
stackexchange_matheducators
stackexchange_vegetarianism
stackoverflow_10000tasks_0p
evol_tt_2s
llama3-1_8b_physics_375000_samples
simpo-oh-dcft-v3.1-llama-3.1-405b
simpo-oh-dcft-v3.1-llama-3.3-70b
simpo-oh-dcft-v3.1-llama-3.1-nemotron-70b
top_14_ranking_stackexchange
seed_math_math_instruct
seed_math_nvidia_math
mlfoundations-dev_stackoverflow_250000_samples
Vulpine-Seduction-70B
Feral-Allura-70B
Lured-Lapine-70B
llama33-70b-rpb-chk736
llama-3.1-8b-instruct-North-Thai
beren_elicitation
promptmii-llama-3.1-8b-instruct
Tropoplectic
my-finetuned-model
Hypa_Llama3.2-8b-SFT-2025-12-10-16bit
DUSK-target-woD1-llama3.1-8b-instruct
prefq_sft_llama8b
InjecAgent-Llama-3.1-8B-Instruct-optim-fix-2
L3.3-Shakudo-70b-heretic
Phoenix-Llama-3.1-70B-Uncensored
oh-dcft-v3-sharegpt-format-sedrick
alpaca-inst-gen-4omini-resp-gen-gpt4o_shareGPT_format
oh-dcft-v3-llama3.1-nemotron-70b_shareGPT_format
dcft-orca-agentinstruct-1M-v1-cleaned
oh-dcft-v1.2_no-curation_gpt-4o-mini
OH_original_wo_camel_ai_math
OH_DCFT_V3_wo_gpt4_llm
OH_DCFT_V3_wo_unreplicated
llama3-1_8b_baseline_dcft_oh_v3