View article

[PDF] from acm.org

Seed selection for successful fuzzing

Authors

Adrian Herrera, Hendra Gunadi, Shane Magrath, Michael Norrish, Mathias Payer, Antony L Hosking

Publication date

2021/7/11

Book

Proceedings of the 30th ACM SIGSOFT international symposium on software testing and analysis

Pages

230-243

Description

Mutation-based greybox fuzzing---unquestionably the most widely-used fuzzing technique---relies on a set of non-crashing seed inputs (a corpus) to bootstrap the bug-finding process. When evaluating a fuzzer, common approaches for constructing this corpus include: (i) using an empty file; (ii) using a single seed representative of the target's input format; or (iii) collecting a large number of seeds (e.g., by crawling the Internet). Little thought is given to how this seed choice affects the fuzzing process, and there is no consensus on which approach is best (or even if a best approach exists).

To address this gap in knowledge, we systematically investigate and evaluate how seed selection affects a fuzzer's ability to find bugs in real-world software. This includes a systematic review of seed selection practices used in both evaluation and deployment contexts, and a large-scale empirical evaluation (over 33 CPU-years) of …

Total citations

Cited by 82

20212022202320244 25 32 21

Scholar articles

Seed selection for successful fuzzing

A Herrera, H Gunadi, S Magrath, M Norrish, M Payer… - Proceedings of the 30th ACM SIGSOFT international …, 2021