Authors
Feng Zhang, Audris Mockus, Iman Keivanloo, Ying Zou
Publication date
2014/5/31
Book
Proceedings of the 11th working conference on mining software repositories
Pages
182-191
Description
To predict files with defects, a suitable prediction model must be built for a software project from either itself (within-project) or other projects (cross-project). A universal defect prediction model that is built from the entire set of diverse projects would relieve the need for building models for an individual project. A universal model could also be interpreted as a basic relationship between software metrics and defects. However, the variations in the distribution of predictors pose a formidable obstacle to build a universal model. Such variations exist among projects with different context factors (e.g., size and programming language). To overcome this challenge, we propose context-aware rank transformations for predictors. We cluster projects based on the similarity of the distribution of 26 predictors, and derive the rank transformations using quantiles of predictors for a cluster. We then fit the universal model on the …
Total citations
2014201520162017201820192020202120222023202422324202316221620113
Scholar articles
F Zhang, A Mockus, I Keivanloo, Y Zou - Proceedings of the 11th working conference on mining …, 2014