Meta-Llama-3-8B-Instruct-Dolfin-v0.1
MFANN-llama3.1-abliterated-v2
L3.1-EtherealRainbow-v1.0-rc1-8B
HiTZ-GoLLIE-13B-AsSafeTensors
rlhflow_mixture_clean_empty_round_with_dart_scalebiosampled-600k
tkg4
prm_version3_full_hf
Tulu-3.1-8B-SuperNova
Llama-3.1-MedIT-SUN-8B
prm_gsm_2k_with_full_sol_mix_ref_hf
llama3-8b-final-ppo-clean-v0.1
Llama-3.3-70B-Instruct-ablated
stackexchange_avp
Llama-3.1-8B-exchange-v2
ktdsbaseLM-v0.10-onbased-llama3.1
Llama-3.1-8B-InitializedEmbeddings_with_Hermes-3
llama3_orm_tmp10
mergekit-model_stock-fpfjlqs
mergekit-model_stock-bzcrthr
xdg-math-step
Llama-3.3-70B-o1
llama-3-1-8b-math-orca-spectrum-10k-ep1
llama_instruct_adult_seed_42
llama-3.1-8b-reasoning
ultiima-72B-v1.5
bgGPT-Qwen2.5-Math-7B-Inst
SakalFusion-7B-Alpha
EVA-Gutenberg3-Qwen2.5-32B
DeepSeek-R1-ReDistill-Qwen-7B-v1.1
Qwen2.5-7B-olm-v1.1
bgGPT-DeepSeek-R1-Distill-Qwen-7B
Reasoning-Distilled-ta-7B
Fireball-R1-Llama-3.1-8B
Ayla-Light-12B-Stock
Deductive-Reasoning-Qwen-32B
speed-synthesis-8b-senior
tkgcore2
MS3-24B-MarbleRye
Zero-Mistral-Small-24B-Instruct-2501
Mistral-Small-24B-Base-2501
Dungeonmaster-V2.2-Expanded-LLaMa-70B
undi95-remm-slerp-bf16