s1K_reformat_v2
difficulty_sorting_medium_seed_math
multiple_samples_none_numina_aime_adjusted_samples
difficulty_sorting_random_seed_code
stratos_verified_mix_epochs2
stratos_verfied_v2_1
qwen2-5_sky_t1_2-5k_base
qwen_s1ablation_length_filter_27k
tokiiii
MedicalEDI-8b-EDI-Base
KONI-Llama3.1-8B-Merged-cdj2-20250217
QloraAIops
KONI-Llama3.1-8B-only_instructed-20250224
Llama3.1-8b-instruct-SFT-2024-11-09
qwen_2.5_7b_transduction_e_2k
Qwen2.5-7B-NuminaMath-CoT-smp20k-ep1-2e-5
openthoughts114k-qwenmath-fa2
Hand_off_DS_Llama8B_100steps_1e6rate_SFT
raceModel-6000
llama_openthoughts_sorted
Qwen2.5-7B-EN-Zero
llama3.1-weeslee-8B
Llama3.1-multiple
instruction_filtering_scale_up_code_base_askllm_8K
dpo_VD-DS-Clean-8k_VD-QWQ-Clean-8k_Qwen2.5-7B-Instruct_full_sft_1e-5_full
Llama-3.1-8B-Instruct-Mental-Health-Classification
openthoughts-114k-no-special-template
stratos_pdf_science_questions__unverified__v1
DeepSeek-R1-8B-Medical
DeepSeek-R1-Medical-o1-COT
llama-finetuned-soil
deepspeed_no_offload_liger_packing
llama31-coaching-ko-8b-dodo
BasicAIModel
instruction_filtering_scale_up_code_base_fasttext_per_domain_16K
herorun_1_1_3epoch
herorun_1_1
llama-finetuned-regenrative_practices
Run-2-3-17-Mental-Health-Tuning-Merged
llama-3.1-8B-Instruct_playpen_SFT_DFINAL_0.7K-steps_merged_fp16
Qwen2.5-7B-Instruct-ko-lora-koalpaca-namuwiki-2epochs
Bohdi-Qwen2.5-7B-Instruct