difficulty_sorting_medium_seed_code
qwen2-5_sky_t1_2-5k_base
seed_math_multiple_samples_scale_up_scaredy_cat_baseline
mlfoundations-dev_stratos_verified_mix_stratos_7b
instruction_filtering_scale_up_code_base_embedding_filter_mean_8K
instruction_filtering_scale_up_code_base_random_filtering_16K
SCP_40k_R1_with_OT_verified
Qwen2.5-7B-Instruct-userfeedback-SPIN-iter1
E-Star-Qwen-7B
openthoughts3_300k
e1_science_longest_qwq_together
Qwen2.5-7B-Instruct-userfeedback-iter1
llama3-8b-full-pretrain-mix-high-tweet-1m-en
Llama3.1-GptDeluxe-8B
Llama3.1-DeluXeOne-8B
wesad-8b-filtered-full
llama-2-7b-chat-hf-guanaco
7bfinetunetest1
Qwen3-8B-metax-FlagOS
AIME-TTT-OctoThinker-8B-Hybrid-Base-TTRL
Hermes-2.5-Mistral-7B
StudyAbroadGPT-7B
qwen2-5_openthoughts_2-5k_rewrite_r1_distill_llama70b_16k
Llama-3.1-8B-Instruct-GenderNeutral-Finetuned
llama3.1-swallow-hamahiyo
Qwen2.5-7B-Instruct-SUM10
qwen-2.5-7b_invthink
qwen-3-8b_invthink
Hypa_Llama3.1-8b-SFT-2025-10-25-16bit
Biawak-8B-Base
Meta-Llama-3.1-8B-Instruct-JG
minimax-m2-stack-overflow-32ep-131k-summtrc
glm46-defects4j-32ep-131k
glm46-qasper-maxeps-131k
Qwen2.5-7B-TTT
Qwen3-8B-ot_step60_high
es-qwen2-5-7b-fab-3000-40k-spk_h-step560
qwen2.5-7b-tofu-ft-5epochs
prefq_dpo_llama8b
prefq_sft_llama8b
Llama-3.1-8B-Instruct-TRACT-copy
llama-oss-sft-ep1