UCSB-SURFI/VulnLLM-R-7B

Warm
Public
7.6B
FP8
32768
Jun 5, 2025
License: apache-2.0
Hugging Face
Overview

Overview

UCSB-SURFI/VulnLLM-R-7B is a specialized 7 billion parameter Large Language Model (LLM) designed for software vulnerability detection. Developed by UCSB-SURFI, it distinguishes itself from traditional static analysis tools and smaller LLMs by employing a reasoning-based approach, generating a "Chain-of-Thought" to analyze data flow, control flow, and security context. This method allows it to identify complex logic vulnerabilities with high accuracy, mimicking the thought process of a human security auditor.

Key Capabilities

  • Reasoning-Based Detection: Generates step-by-step reasoning to explain why a vulnerability exists, rather than just classifying code.
  • Superior Accuracy: Outperforms commercial LLMs (e.g., Claude-3.7-Sonnet) and industry-standard tools (e.g., CodeQL, AFL++) on key vulnerability detection benchmarks.
  • Efficiency: Achieves state-of-the-art performance with only 7 billion parameters, making it significantly faster and more resource-efficient than larger general-purpose reasoning models.
  • Broad Language Coverage: Trained and tested for zero-shot generalization across C, C++, Python, and Java.

Use Cases

This model is ideal for developers and security researchers focused on:

  • Automated and intelligent software vulnerability detection.
  • Enhancing security auditing processes with AI-driven reasoning.
  • Analyzing code in C, C++, Python, and Java for complex logic flaws.

For more details, refer to the research paper and the GitHub repository.