Authors
Andrea Bommert, Xudong Sun, Bernd Bischl, Jörg Rahnenführer, Michel Lang
Publication date
2020/3/1
Journal
Computational Statistics & Data Analysis
Volume
143
Pages
106839
Publisher
North-Holland
Description
Feature selection is one of the most fundamental problems in machine learning and has drawn increasing attention due to high-dimensional data sets emerging from different fields like bioinformatics. For feature selection, filter methods play an important role, since they can be combined with any machine learning model and can heavily reduce run time of machine learning algorithms. The aim of the analyses is to review how different filter methods work, to compare their performance with respect to both run time and predictive accuracy, and to provide guidance for applications. Based on 16 high-dimensional classification data sets, 22 filter methods are analyzed with respect to run time and accuracy when combined with a classification method. It is concluded that there is no group of filter methods that always outperforms all other methods, but recommendations on filter methods that perform well on many of the data …
Total citations
201920202021202220232024250124147149113
Scholar articles
A Bommert, X Sun, B Bischl, J Rahnenführer, M Lang - Computational Statistics & Data Analysis, 2020