llama-3-8b-base-beta-dpo-hh-harmless-8xh200
OpenElla-NovelWriter-8B-merged
TwinLlama-3.1-8B-Colab
Llama-3.1-8B-Instruct-MyBabelBit
llama-3-8b-base-epsilon-dpo-hh-helpful-4xh200-batch-64-20260418-001920
llama-3-8b-base-margin-dpo-hh-helpful-4xh200-batch-64-20260417-212312
llama-3-8b-base-ipo-ultrafeedback-8xh200
llama-3-8b-base-orpo-ultrafeedback-8xh200
llama-3-8b-base-margin-dpo-hh-harmless-batch-size-64
Llama-3-ELYZA-JP-8B-ojousama-chosen
Kosmos-EVAA-Franken-v36-8B
airoboros-34b-3.2
SELM-Llama-3-8B-Instruct-iter-3
head-tuned-llama-from-qwen-math
llama_3_math
Meta-Llama-3-70B-Instruct-function-calling
DeepSeek-R1-Medical-o1-COT
L3-Dark-Planet-8B-wordstorm-r1
L3-Dark-Planet-8B-wordstorm1
YandexGPT-5-Lite-8B-pretrainJB-ChatMl
fozan-assistant
Llama-3.3-8B-Instruct-128K-PaperWitch-heresy
SerendipLLM-v2-news-v2
TwinLlama-3.1-8B-DPO-Merged
Verin-V2-Pro
algebra-lesson-generator-8b
L3-8B-Stheno-v3.2-MPOA
Human-Like-LLama3-8B-Instruct-MPOA
treasurypro-cashflow-llama-v2-merged
Llama-3.3-8B-Instruct-SuperGPQA-Classifier
llama3-8b-full-pretrain-wash-c4-1-8m-bs4
llama3-8b-full-pretrain-wash-c4-3-0m-bs4
llama3-8b-full-pretrain-wash-c4-3-6m-bs4
Llama-3.1-8B-Instruct-V3-Model
Cygnis-Alpha-2-8B-v0.2
Llama-3-8B-Instruct_Planning_Feedback_oldaug_v2
DeepSeek-R1-Distill-Llama-8B-heretic
mpq3_llama8b_sft_dpo_beta1e-1_step256
mpq3_llama8b_sft_dpo_beta1e-1_step512
mpq3_llama8b_sft_dpo_beta1e-1_step768
mpq3_llama8b_sft_dpo_beta1e-1_step4864
trojan-llama-8b-sharded