Llama-3.1-8B_mathv1_grpof
llama2_7b_SSFT_gsm8k_FT_lr3e-5
akeno-v7-epoch2-merged
Co-rewarding-I-Qwen3-8B-Base-DAPO14k
triage_mistral_finetuned
arkoda-7b-v5
qwen-coder-7b-sap-harmful-code
llama3.1_8b_sft-solo-bos-attn-k28
PK-Link-Qwen3-8B-RSA-2-SFT-GRPO-margin-qa-only-0.02-kl-4e-6-reward-2_step_24
PK-Link-Qwen3-8B-RSA-2-SFT-GRPO-margin-qa-only-0.02-kl-4e-6-reward-2_step_36
llama3.1_8b_instruct_math_ft_freeze_sn_lr1e-5_new
llama2_7b_chat_gsm8k_SSFT_lr5e-5_lr3e-5
Llama-3.1-8B_instruction
llama3.1_8b_instruct_only_sn_tuned_lr3e-5
llama2_7b_base_resta_lr3e-5_y0.3
Mistral-7B-v0.3_mathv1
llama31_8b_base_gsm8k_ft_freeze_sn_lr3e-5
wisenut-llama-3-8B-0.1-Instruct
wisenut-llama-3-8B-0.5-Instruct
wisenut-llama-3-8B-0.7-Instruct
WooWoof_AI_Vision16Bit
SLIMER-LLaMA3
Llama-3-Open-Ko-8B-Instruct-sample
oh-dcft-v3-sharegpt-format-sedrick
alpaca-inst-gen-4omini-resp-gen-gpt4o_shareGPT_format
BCCOHP_8B_instruct_Full
oh-dcft-v3-llama3.1-nemotron-70b_shareGPT_format
llama3-8b-point60-100
Llama-3.1-8B-kowiki-alpaca-16bit
MunicipalPredictionModel-Llama3
d1
oh-dcft-v1.2_no-curation_gpt-4o-mini_wo_airoboros
OH_original_wo_camel_ai_math
OH_original_wo_metamath_40k
OH_original_wo_platypus
OH_original_wo_slimorca_550k
oh-dcft-v1-no-curation
oh_v1_w_v3_camel_math_gpt-4o-mini
ProductLlama-8B-Instruct
oh_v1-2_only_airoboros
oh-dcft-v1.2_no-curation_gpt-4o-mini_wo_alpaca
oh_v1-2_only_slim_orca