Virtue-AI-HUB/VulnLLM-R-7B

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:7.6BQuant:FP8Ctx Length:32kTool Calling:SupportedPublished:Jun 5, 2025License:apache-2.0Architecture:Transformer0.2K Open Weights Warm

UCSB-SURFI/VulnLLM-R-7B is the first specialized 7 billion parameter reasoning Large Language Model developed by UCSB-SURFI for software vulnerability detection. Unlike traditional tools, it generates a "Chain-of-Thought" to analyze why a vulnerability exists, mimicking human security auditing. This model excels at identifying complex logic vulnerabilities across C, C++, Python, and Java, outperforming larger general-purpose models and industry-standard tools.

Loading preview...

Overview

UCSB-SURFI/VulnLLM-R-7B is a specialized 7 billion parameter Large Language Model (LLM) designed for software vulnerability detection. Developed by UCSB-SURFI, it distinguishes itself from traditional static analysis tools and smaller LLMs by employing a reasoning-based approach, generating a "Chain-of-Thought" to analyze data flow, control flow, and security context. This method allows it to identify complex logic vulnerabilities with high accuracy, mimicking the thought process of a human security auditor.

Key Capabilities

  • Reasoning-Based Detection: Generates step-by-step reasoning to explain why a vulnerability exists, rather than just classifying code.
  • Superior Accuracy: Outperforms commercial LLMs (e.g., Claude-3.7-Sonnet) and industry-standard tools (e.g., CodeQL, AFL++) on key vulnerability detection benchmarks.
  • Efficiency: Achieves state-of-the-art performance with only 7 billion parameters, making it significantly faster and more resource-efficient than larger general-purpose reasoning models.
  • Broad Language Coverage: Trained and tested for zero-shot generalization across C, C++, Python, and Java.

Use Cases

This model is ideal for developers and security researchers focused on:

  • Automated and intelligent software vulnerability detection.
  • Enhancing security auditing processes with AI-driven reasoning.
  • Analyzing code in C, C++, Python, and Java for complex logic flaws.

For more details, refer to the research paper and the GitHub repository.