lablab-ai-amd-developer-hackathon/CyberSecQwen-4B

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:4BQuant:BF16Ctx Length:32kPublished:May 8, 2026License:apache-2.0Architecture:Transformer0.0K Open Weights Warm

CyberSecQwen-4B is a 4 billion parameter language model from lablab-ai-amd-developer-hackathon, fine-tuned from Qwen3-4B-Instruct-2507, specializing in defensive cybersecurity tasks. It excels at mapping CVE descriptions to CWE categories (CTI-RCM) and answering cyber threat intelligence multiple-choice questions (CTI-MCQ). This model achieves 97.3% of Foundation-Sec-Instruct-8B's CTI-RCM accuracy and exceeds its CTI-MCQ by +8.7 points, at half the parameter count, making it highly efficient for cybersecurity analysis.

Loading preview...

CyberSecQwen-4B: A Specialized Cybersecurity LLM

CyberSecQwen-4B is a 4-billion parameter language model, fine-tuned from Qwen3-4B-Instruct-2507, specifically designed for defensive cybersecurity applications. Developed as part of the AMD Developer Hackathon, this model demonstrates strong performance on critical cyber threat intelligence (CTI) tasks.

Key Capabilities

  • CWE Classification (CTI-RCM): Accurately maps vulnerability descriptions (CVEs) to MITRE CWE categories.
  • Cyber Threat Intelligence Q&A (CTI-MCQ): Answers structured questions about cybersecurity concepts and attacks.
  • Efficient Performance: Achieves 97.3% of Foundation-Sec-Instruct-8B's CTI-RCM accuracy and surpasses its CTI-MCQ score by +8.7 points, despite having half the parameters.
  • AMD Hardware Optimization: The entire training, merging, and evaluation pipeline runs end-to-end on a single AMD Instinct MI300X 192GB instance using ROCm + vLLM + FlashAttention-2.
  • Robust Training: Fine-tuned using direct supervised fine-tuning (SFT) on a decontaminated cybersecurity corpus of approximately 14,776 records, ensuring high-quality, non-inflated benchmark results.

Good For

  • Security practitioners, researchers, and engineers needing assistance with CWE classification and CTI Q&A.
  • Defensive analysis assistants for triaging CVEs, prioritizing patches, or documenting threat-actor behavior.
  • Benchmarking cybersecurity workloads on AMD MI300X hardware.

Limitations

  • Domain-specific: Not intended for general-purpose tasks outside cybersecurity.
  • Time-anchored data: Training data from 2021 means newer vulnerability classes may be under-represented.
  • English-only: Performance degrades for multilingual cyber tasks.
  • No safety RLHF: Lacks formal reinforcement learning safety alignment.