Authors
Gerald Schermann, Sali Zumberi, Jürgen Cito
Publication date
2018/5/28
Book
Proceedings of the 15th International Conference on Mining Software Repositories
Pages
26-29
Description
Docker containers are standardized, self-contained units of applications, packaged with their dependencies and execution environment. The environment is defined in a Dockerfile that specifies the steps to reach a certain system state as infrastructure code, with the aim of enabling reproducible builds of the container. To lay the groundwork for research on infrastructure code, we collected structured information about the state and the evolution of Dockerfiles on GitHub and release it as a PostgreSQL database archive (over 100,000 unique Dockerfiles in over 15,000 GitHub projects). Our dataset enables answering a multitude of interesting research questions related to different kinds of software evolution behavior in the Docker ecosystem.
Total citations
20182019202020212022202320244782972
Scholar articles
G Schermann, S Zumberi, J Cito - Proceedings of the 15th International Conference on …, 2018