PS_only_answer_Qwen3-4B-Base_0328-01-2e-5
qwen3-4b-full-nt-gen-inv-sft-v2-g3-e3
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-deadly_yawning_emu
gemma2_2b-abstract-finetuned-ep1-b4
Aisha-Uncensored-v2
rl_nmt_2026_04_10_07_47
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-clawed_aquatic_trout
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-swift_tough_seal
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-dappled_wiry_pheasant
Llama-3.1-ARC-Heavy-Induction-8B
OH_original_wo_airoboros
oh-dcft-v3.1-gpt-4o-2024-11-20
stackexchange_webapps
evol_tt_5s
oh-dcft-v3.1-llama-3.1-405b
simpo-oh_teknium_scaling_down_random_0.4
llama3-1_8b_codefeedback
mlfoundations-dev_stackoverflow_100000_samples
Llama-3.3-70B-Memo-law-Instruct-v2.1
oh-dcft-v3.1-claude-3-5-sonnet-20241022-qwen
llama3-1_8b_4o_annotated_aops
difficulty_sorting_easy_seed_math
seed_math_multiple_samples_scale_up_scaredy_cat_test
Viper-Coder-HybridMini-v1.3
Qwen-2.5-Math-7B-Max-v3-accuracy
bespokelabs_Bespoke-Stratos-17k_Qwen_Qwen2.5-7B-Instruct_reasoning
llama3.1-8b-reasoning-summarizer
TinyLlama-1.1B-Chat-v1.0_finetuned_s02_i
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-burrowing_mottled_gibbon
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-reptilian_majestic_bear
mirrorqwen2.5-0.5b-ORPO-3
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-jumping_scavenging_platypus
qwen2.5-0.5B_freq_edu_instruct-3
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-sleek_regal_hornet
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-scurrying_ravenous_chinchilla
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-noisy_loud_ocelot
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-mute_masked_sheep
Qwen2.5-0.5B-Instruct_Short_CoT
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-vigilant_furry_tuna
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-sly_diving_capybara
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-arctic_peckish_cheetah
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-subtle_tropical_puma