oh_v1-2_only_slim_orca
stackexchange_law
stackoverflow_10000tasks_1p
simpo-oh-dcft-v1.3_no-curation_gpt-4o-mini_scale_8x
mlfoundations-dev_stackoverflow_500000_samples
bgGPT-Qwen2.5-Math-7B-Inst
math-stratos-verified-scaled-0.125
math-stratos-unverified-scaled-0.125
mlfoundations-dev_code-stratos-unverified-scaled-1_stratos_7b
mlfoundations-dev_code-stratos-verified-scaled-0_5_stratos_7b
llama3-1_8b_multiple_samples_all_numina_aime
seed_math_automathtext_reasoninghp
seed_math_open2math_reasoninghp
multiple_samples_majority_consensus_pick_one_numina_aime_math_verify
difficulty_sorting_high_seed_code
difficulty_sorting_random_seed_code
Qwen-2.5-7B-Simple-RL
instruction_filtering_scale_up_code_base_askllm_16K
TinyLlama-1.1B-Chat-v1.0_finetuned_4_lora
TinyLlama-1.1B-Chat-v1.0_finetuned__optimized1_universal_FT
TinyLlama-1.1B-Chat-v1.0_finetuned_3_default
TinyLlama-1.1B-Chat-v1.0_finetuned_4_optimized1
TinyLlama-1.1B-Chat-v1.0_finetuned_1_lora
fc4de999-dedc-4db2-802f-db560f0914a9
TinyLlama-1.1B-Chat-v1.0_finetuned_1_optimized1_task_grouping_off_FT
blvflag_llama
7f9b617b-66a6-4ebf-9021-450f96b99bc7
fin-llm-dpo-lora
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-armored_sprightly_hyena
gensyn-checkpoints-clawed_leaping_toucan
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-graceful_thick_cobra
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-grunting_flightless_antelope
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-amphibious_whiskered_mongoose
Qwen2.5-0.5B-Instruct-medical-dpo
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-fanged_arctic_prawn
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-gentle_jumping_termite
qwen2.5-0.5B_educational_instruct_selec10000_pythonblock_dataselection_jaen
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-lightfooted_sprightly_finch
Qwen2_SFT_FFT
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-fluffy_twitchy_mole
Qwen2.5-0.5B_MIFT_ja_manywords_4000_v1
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-rangy_lethal_dove