RoMistral-7b-Instruct-DPO
ToolACE-2.5-Llama-3.1-8B
WorldModel-Alfworld-Qwen2.5-7B
Qwen2.5-7B-Instruct-1M
Qwen2.5-7B-Think-KTO-v0.1
Qwen2.5-7B-Think-KTO-v0.2
Qwen2.5-Dyanka-7B-Preview
llama-3-youko-8b-instruct
SafeKey-8B
SecGPT-7B
Playable1
JanusCoder-8B
STAR1-R1-Distill-8B
Meta-Llma-legal-lens-500
Sparse-Llama-3.1-8B-gsm8k-2of4
Llama-3.1-Swallow-8B-Instruct-v0.2
RAGent_gen
Qwen3-R1-SLERP-Q3T-8B
DeepHermes-Financial-Fundamentals-Prediction-Specialist-Atropos
rewriter-qwen3-8b-grpo-v11
qwen3-8b-climate-japanese
RAG-Instruct-Llama3-8B
llama3-code-math-regmean-merge
Llama-3-8B-Instruct-TAR-Cyber
abliterated-llama-8b
Llama-3.1-8B-Instruct-abliterated_via_adapter
Gemma-Kimu-9b-it
PULI-Trio-Q
Infinity-Instruct-3M-0625-Mistral-7B
SimNPO-WMDP-zephyr-7b-beta
RealSafe-R1-8B
chandler
UnfilteredAI-DAN-L3-R1-8B
philosophical-surgeon-v1
lora-Meta-Llama-3.1-8B-Instruct
Dirty-Shirley-Writer-v01-Uncensored
llama3-8b-tofu-ft-full-5epochs
NeuralBeagle14-7B
SWE-agent-LM-7B
Meta-Llama-3-8B_ft_lora_all_novels_v4_ft_npo_gdr_loc_positive_dataset_v9
SmolTuring-8B-Instruct
dolphin-2.9-llama3-8b-256k