AI-ISL/DeepSeek-R1-Distill-Llama-8B-SP
TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:32kPublished:May 26, 2025License:apache-2.0Architecture:Transformer Open Weights Cold
AI-ISL/DeepSeek-R1-Distill-Llama-8B-SP is a SAFEPATH-aligned version of the DeepSeek-R1-Distill-Llama-8B model, developed by AI-ISL. This model is fine-tuned using a prefix-only safety priming technique to enhance safety and robustness against harmful outputs and jailbreak attacks. It maintains strong reasoning performance across mathematical and general reasoning tasks while significantly reducing unsafe responses. The model is primarily intended for research in safety alignment and robust reasoning within large reasoning models.
Loading preview...