The Reality Gap in AI Code Security

A deployment-focused framework facilitates the evaluation of vulnerability detection models, emphasizing a holistic approach to assessing system security.

New research reveals a significant performance drop-off when applying deep learning and large language models to detect vulnerabilities in real-world code.