Authors
Benjamin Steenhoek, Md Mahbubur Rahman, Richard Jiles, Wei Le
Publication date
2023/5/14
Conference
2023 IEEE/ACM 45th International Conference on Software Engineering (ICSE)
Pages
2237-2248
Publisher
IEEE
Description
Deep learning (DL) models of code have recently reported great progress for vulnerability detection. In some cases, DL-based models have outperformed static analysis tools. Although many great models have been proposed, we do not yet have a good understanding of these models. This limits the further advancement of model robustness, debugging, and deployment for the vulnerability detection. In this paper, we surveyed and reproduced 9 state-of-the-art (SOTA) deep learning models on 2 widely used vulnerability detection datasets: Devign and MSR. We investigated 6 research questions in three areas, namely model capabilities, training data, and model interpretation. We experimentally demonstrated the variability between different runs of a model and the low agreement among different models' outputs. We investigated models trained for specific types of vulnerabilities compared to a model that is trained …
Total citations
Scholar articles
B Steenhoek, MM Rahman, R Jiles, W Le - 2023 IEEE/ACM 45th International Conference on …, 2023