trendmicro-ailab/Llama-Primus-Reasoning

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:32kPublished:Feb 20, 2025License:mitArchitecture:Transformer0.0K Open Weights Warm

Llama-Primus-Reasoning is an 8 billion parameter reasoning model developed by Trend Micro AI Lab, distilled from o1-preview and DeepSeek-R1 on cybersecurity tasks. Based on the Llama-3.1-8B-Instruct architecture, it is specifically optimized for cybersecurity reasoning, demonstrating a 15.8% improvement in security certification (CISSP) benchmarks. This model excels in generating detailed reasoning steps for complex cybersecurity problems, leveraging a 32768 token context length.

Loading preview...

What the fuck is this model about?

Llama-Primus-Reasoning is an 8 billion parameter language model developed by Trend Micro AI Lab, specifically designed for cybersecurity reasoning. It is distilled from the reasoning steps and reflection data generated by powerful models like o1-preview and DeepSeek-R1 on cybersecurity tasks, utilizing the underlying Llama-3.1-8B-Instruct architecture. This model is part of the broader Primus collection, a pioneering set of open-source datasets and models aimed at filling the gap in cybersecurity-specific LLM training.

What makes THIS different from all the other models?

This model's primary differentiator is its specialization in cybersecurity reasoning. Unlike general-purpose LLMs, Llama-Primus-Reasoning has been fine-tuned on a unique dataset, Primus-Reasoning, which focuses on generating detailed reasoning steps for complex security problems. This targeted distillation process has resulted in significant performance gains in the cybersecurity domain.

Key Differentiators:

  • Cybersecurity Reasoning Focus: Explicitly distilled from advanced models (o1-preview & DeepSeek-R1) on cybersecurity reasoning data.
  • Performance on CISSP: Achieves a notable 15.8% improvement in security certification (CISSP) benchmarks compared to its base Llama-3.1-8B-Instruct model, showcasing its enhanced reasoning capabilities in this domain.
  • Primus Dataset Integration: Built upon Trend Micro's Primus datasets, a comprehensive collection for cybersecurity LLM training.

Should I use this for my use case?

Good for:

  • Cybersecurity Analysis: Ideal for tasks requiring detailed reasoning and problem-solving within the cybersecurity domain.
  • Security Certification Preparation: Demonstrates strong performance on benchmarks like CISSP, making it suitable for related applications.
  • Generating Explanations: Excels at producing reasoning steps, which can be valuable for explaining complex security concepts or decisions.
  • Research in Cybersecurity AI: A strong candidate for researchers exploring specialized LLMs for security applications.

Popular Sampler Settings

Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.

temperature
top_p
top_k
frequency_penalty
–
presence_penalty
repetition_penalty
–
min_p