MFANN-SFT
oh-dcft-v3.1-claude-3-5-haiku-20241022
llm_model
infoNCA_ultrafeedback_alpha_1e-2_update_401_online
synthetic_transformer_16bit
lora_9feb_llama8b_deepseek_backdoor
Qwen2.5-Coder-7B-Instruct-20-v2
QloraAIops
MedicalEDI-8b-EDI-Reasoning-1
SFT-base_merged_fp16_E1_D40005
Qwen2.5-7B-1m-Open-R1-Distill
airticle-qwen7B-grpo-2
medical_llama3_16bit
Qwen-2.5-7B-Sheet-RL
Qwen-2.5-Base-7b-SFT-Korean-Article-Dataset
Qwen2.5-7B-Instruct-Qwen2.5-Math-7B-Instruct-Merged-ties-29
large_cooking_sft_success
qwen_chess1_3of5
ds-limo-fr-250
qwen2.5-hotpotqa-sft-300
llama3_8b_sft_helpsteer
llama-3.1-8B-Instruct_playpen_SFT_DFINAL_0.7K-steps_merged_full_precision_copy
Llama-3.1-8B-Instruct_kg3.5k_2e5
gemma-2-9b_Magicoder-Evol-Instruct-110K_2epoch
ds-limo-1.1-250
merged-bench-0417-1
Qwen2.5-7B-Open-R1-Step1-SFT
es-qwen-math-base-7b-3k-stage2-6k-t4-ds_o2-step880
es-qwen-math-base-7b-3k-stage2-6k-t4-ds_o2-step720
Llama-3.1-8B-full-pt
ds-limo-ja-100
TwinLlama-3.1-8B-champion
cyber-arabic-llama12
L3-Mono-Code-Sel
cosmos-llama8b-100e
telLM-gemma2-9b-16bit
grpo_onesided_5-480
llama3-8b-full-pretrain-mix-low-tweet-1m-en-sft
Meta-Llama-3.1-8B-Instruct
JET-7B
Meta-Llama-3-8B-Instruct-Triplet-Adv
BioThoughts-DeepSeek-8B