Llama-3.1-Tulu-3-8B-SFT-Safety-Reduced-DPO-Safety-Reduced
turkish-llama-MSFT-0.7-ngram-banned
R8_1
F_R8_1
F_R8
F_R99
F_R99_T4
Llama3.1-8B-Math-v2
fai_bm_fix2
bygheart-coder-v3
Qwen2.5-7B-Instruct-ftjob-bf700f8824c9
allenai-sera-unified-31600-opt100k__Qwen3-8B
Qwen2.5-7B-Instruct-custom-vibe
allenai-sera-unified-100000-opt100k__Qwen3-8B
Qwen2-7B-Instruct
Qwen2.5-Trading-Architect-Merged
qwen2.5-7b-therapist
llama3.1_8b_sft-solo-attn-k28
wordle
sanatan-gita-guru-full
prescription-simplifier-mistral7b
Llama-2-7b-chat-finetune
torl_qwen2.5-math-7b-grpo-n16-b128-t1.0-lr1e-6acc-only-global_step_200
ws-wm-0314-step-100
selfsim-v3.1-8b-A-ckpt700-merged
Llama-2-7b-chat-hf-FC
llama3.1_8b_sft-solo-attn-k24
qwen_finetune_16bit_v4
Qwen2.5-7B-RRP-1M-Thinker
webshop-qwen2.5-7b-sft-decision-data-only
llama3-rtl-Resyn-fp16_3
qwen3-8b-medrect-mixed-sft
Tower-Sep_1c1t_MTcontext
ws-wm-0416-step-100
ws-wm-0416-step-120
llama2_7b_only_sn_tuned_lr3e-5
medical-qa-mistral-7b-lora-v3
PK-Link-Qwen3-8B-RSA-2-SFT-GRPO-margin-qa-only-0.02-kl-4e-6-reward-2_step_33
llama2_7b_chat_only_sn_tuned_lr3e-5
Collaiborator-MEDLLM-Llama-3-8B-v1
EagleX_1-7T
oh-dcft-v1.2_no-curation_gpt-4o-mini_wo_airoboros