Virtue-AI-HUB/VulnLLM-R-7B

TEXT GENERATIONConcurrency Cost:1Model Size:7.6BQuant:FP8Ctx Length:32kTool Calling:SupportedPublished:Jun 5, 2025License:apache-2.0Architecture:Transformer0.2K Open Weights Cold

VulnLLM-R-7B by UCSB-SURFI is a 7.6 billion parameter specialized reasoning Large Language Model designed for software vulnerability detection. Unlike traditional static analysis tools, it employs step-by-step reasoning to analyze data flow, control flow, and security context, mimicking human security auditors. This model excels at identifying complex logic vulnerabilities across C, C++, Python, and Java, achieving superior accuracy with a compact size.

Loading preview...

VulnLLM-R-7B: Specialized Reasoning for Vulnerability Detection

VulnLLM-R-7B, developed by UCSB-SURFI, is a 7.6 billion parameter Large Language Model uniquely engineered for software vulnerability detection. It distinguishes itself from conventional static analysis tools and smaller LLMs by focusing on reasoning-based detection, generating a "Chain-of-Thought" to analyze why a vulnerability exists rather than just classifying code. This approach allows it to identify complex logic vulnerabilities by mimicking the analytical process of a human security auditor.

Key Capabilities

  • Reasoning-Based Detection: Employs step-by-step analysis of data flow, control flow, and security context.
  • Superior Accuracy: Outperforms commercial models (e.g., Claude-3.7-Sonnet) and industry-standard tools (e.g., CodeQL, AFL++) on benchmarks like PrimeVul, Juliet 1.3, and ARVO.
  • Efficiency: Achieves state-of-the-art performance with only 7.6 billion parameters, making it significantly faster and more resource-efficient than larger general-purpose reasoning models.
  • Broad Language Coverage: Trained and tested for zero-shot generalization across C, C++, Python, and Java.

Good For

  • Developers and security researchers needing to identify complex logic vulnerabilities in source code.
  • Integrating advanced, reasoning-based vulnerability scanning into CI/CD pipelines.
  • Analyzing code in C, C++, Python, and Java where traditional tools might fall short.