Llama-3.1-8B-Instruct-GSM8K-Rlvr
Llama-3.1-8B-coding
LM-Searcher
Hypa_Llama3.2-8b-SFT-2025-12-20_II-16bit
Llama-3.3-8B-Instruct-128K-heretic
WangchanLION-v3-IT
Meta-Llama-3.1-8B-Instruct_lora_5892s-ft
Llama3.1-8B-Base-Math
On-policy-GRPO
llama-bigasp-prompt-enhancer
AllwissenGPT-7B
stackexchange_cseducators
Llama-3.1-8B-math-reasoning
Llama-3.1-8B-it-abliterated-iSMART
Llama-PLLuM-70B-base-2412
finetuned_Maghalaya_tripura_19-24_merged
ARC-Base-8B-Condensed
LlamaLens
meta-Llama-3.1-8B-nursing
Llama-3.1-8B-ContinuedTraining
asprm_l_newline_judged
Meta-Llama-3.1-8B-Instruct-FP8
llama-3.1-tulu-2-8b
nova-8b-cybersec
npo_llama-3.1-8b-instruct_forget10_ep5_lr5e-5_alpha2.0_beta0.1
vHector-8B
LlamaAligned-DeepSeekR1-Distill-8b
Raven-8B-v1
Llama-3.1-8B_word
Llama-3.1-8B-Instruct-owl-numbers-ft
Llama-3.1-8B-Instruct_SFT_mathv00.02
llama3-8b-full-gen-inv-sft-v2-g2-e3
Llama-3.1-8B-Instruct_SafeGrad_mathv00.07
OctoThinker-8B-Short-Base
willow
nb-notram-llama-3.3-70b-instruct
Llama-3.1-8B-Instruct_SFT_mathfisher_v00.03
unsup-Llama-3.1-8B-Instruct-datav2
TunnedLlama-3.1-8B_GHCND_2014_range_v2
Llama-3.1-Nemotron-Nano-8B-v1-abliterated-Uncensored-Toxic-DPO
RexDrug-base
Llama-3.1-8B-code-ablation-exp1-LR2.5e-5-MINLR2.5E-6-WD0.1-iter0002500