Authors
Jundong Li, Kewei Cheng, Suhang Wang, Fred Morstatter, Robert P Trevino, Jiliang Tang, Huan Liu
Publication date
2017/12/6
Source
ACM computing surveys (CSUR)
Volume
50
Issue
6
Pages
1-45
Publisher
ACM
Description
Feature selection, as a data preprocessing strategy, has been proven to be effective and efficient in preparing data (especially high-dimensional data) for various data-mining and machine-learning problems. The objectives of feature selection include building simpler and more comprehensible models, improving data-mining performance, and preparing clean, understandable data. The recent proliferation of big data has presented some substantial challenges and opportunities to feature selection. In this survey, we provide a comprehensive and structured overview of recent advances in feature selection research. Motivated by current challenges and opportunities in the era of big data, we revisit feature selection research from a data perspective and review representative feature selection algorithms for conventional data, structured data, heterogeneous data and streaming data. Methodologically, to emphasize the …
Total citations
2016201720182019202020212022202320241964159292407491588689393
Scholar articles
J Li, K Cheng, S Wang, F Morstatter, RP Trevino… - ACM computing surveys (CSUR), 2017
L Jundong, C Kewei, W Suhang, M Fred - ACM Computing Surveys, 2017
J Li, K Cheng, S Wang, F Morstatter, RP Trevino… - arXiv preprint arXiv:1601.07996
J Li, K Cheng, S Wang, F Morstatter, T Robert, J Tang - Feature selection: A data perspective.< italic