View article

[PDF] from arxiv.org

An empirical study of deep learning models for vulnerability detection

Authors

Benjamin Steenhoek, Md Mahbubur Rahman, Richard Jiles, Wei Le

Publication date

2023/5/14

Conference

2023 IEEE/ACM 45th International Conference on Software Engineering (ICSE)

Pages

2237-2248

Publisher

IEEE

Description

Deep learning (DL) models of code have recently reported great progress for vulnerability detection. In some cases, DL-based models have outperformed static analysis tools. Although many great models have been proposed, we do not yet have a good understanding of these models. This limits the further advancement of model robustness, debugging, and deployment for the vulnerability detection. In this paper, we surveyed and reproduced 9 state-of-the-art (SOTA) deep learning models on 2 widely used vulnerability detection datasets: Devign and MSR. We investigated 6 research questions in three areas, namely model capabilities, training data, and model interpretation. We experimentally demonstrated the variability between different runs of a model and the low agreement among different models' outputs. We investigated models trained for specific types of vulnerabilities compared to a model that is trained …

Total citations

Cited by 43

2023202415 28

Scholar articles

An empirical study of deep learning models for vulnerability detection

B Steenhoek, MM Rahman, R Jiles, W Le - 2023 IEEE/ACM 45th International Conference on …, 2023