Authors
Feng Zhang, Quan Zheng, Ying Zou, Ahmed E Hassan
Publication date
2016/5/14
Book
Proceedings of the 38th International Conference on Software Engineering
Pages
309-320
Description
Defect prediction on projects with limited historical data has attracted great interest from both researchers and practitioners. Cross-project defect prediction has been the main area of progress by reusing classifiers from other projects. However, existing approaches require some degree of homogeneity (e.g., a similar distribution of metric values) between the training projects and the target project. Satisfying the homogeneity requirement often requires significant effort (currently a very active area of research).
An unsupervised classifier does not require any training data, therefore the heterogeneity challenge is no longer an issue. In this paper, we examine two types of unsupervised classifiers: a) distance-based classifiers (e.g., k-means); and b) connectivity-based classifiers. While distance-based unsupervised classifiers have been previously used in the defect prediction literature with disappointing performance …
Total citations
2015201620172018201920202021202220232024162743493941451920
Scholar articles
F Zhang, Q Zheng, Y Zou, AE Hassan - Proceedings of the 38th International Conference on …, 2016