SauerHuatuoSkywork-o1-Llama-3.1-8B
llama8b-v33-jb-seed2-alpaca_lora
chase-defender-v8
Llama-3.1-8B-Instruct-ZH-SynthDolly-1A-E1
Llama-3.1-8B-Instruct-PT-SynthDolly-1A-E1
Llama-3.1-8B-Instruct-GA-SynthDolly-1A-E1
Llama-3.1-8B-Instruct-EL-SynthDolly-1A-E1
diallm-llama-dpo-ind
Llama-3.1-8B-Instruct-HI-SynthDolly-1A-E1
diallm-llama-dpo-all
diallm-llama-gspo-aus
acquisition_metamath_llama_instruct-3_1-8b-math_format_500_combined_openr1math
acquisition_metamath_llama_instruct-3_1-8b-math_proximity_500_combined_openr1math
llama3.1_8b_base-Safety-FT-lr3e-5
Llama-3.1-8B-czech-legal
Llama-3.1-8B_mathv1_grpof
Llama-3.1-8B-Instruct-GRPO-Base-v2_1346
Llama-3.1-70B-ArliAI-RPMax-v1.3
OH_original_wo_airoboros
OH_original_wo_evol_instruct_70k
oh_v1.3_opengpt_x8
oh_v3-1_only_evol_instruct_140k
Llama-3.1-70B-FLDx2
llama3-1_8b_mlfoundations-dev-stackexchange_scifi
stackexchange_gamedev
stackexchange_hermeneutics
stackexchange_webapps
stackoverflow_25000tasks_.25p
evol_tt_5s
Sushi-v1.3
oh-dcft-v3.1-llama-3.1-405b
oh_teknium_scaling_down_ratiocontrolled_0.9
simpo-oh_teknium_scaling_down_random_0.4
oh_v1.3_evol_instruct_x.25
llama3-1_8b_codefeedback
llama3-1_8b_dolphin
seed_math_tiger_math
mlfoundations-dev_stackoverflow_100000_samples
CYFRAGOVPL_Llama-PLLuM-70B-instruct-EmbedFix
RAIF-LLaMA3.1-8B
llama33-70b-rp-a-64
MedCEG