View article

[PDF] from acm.org

Feature selection: A data perspective

Authors

Jundong Li, Kewei Cheng, Suhang Wang, Fred Morstatter, Robert P Trevino, Jiliang Tang, Huan Liu

Publication date

2017/12/6

Source

ACM computing surveys (CSUR)

Volume

Issue

Pages

1-45

Publisher

ACM

Description

Feature selection, as a data preprocessing strategy, has been proven to be effective and efficient in preparing data (especially high-dimensional data) for various data-mining and machine-learning problems. The objectives of feature selection include building simpler and more comprehensible models, improving data-mining performance, and preparing clean, understandable data. The recent proliferation of big data has presented some substantial challenges and opportunities to feature selection. In this survey, we provide a comprehensive and structured overview of recent advances in feature selection research. Motivated by current challenges and opportunities in the era of big data, we revisit feature selection research from a data perspective and review representative feature selection algorithms for conventional data, structured data, heterogeneous data and streaming data. Methodologically, to emphasize the …

Total citations

Cited by 3118

20162017201820192020202120222023202419 64 159 292 407 491 588 689 393

Scholar articles

Feature selection: A data perspective

J Li, K Cheng, S Wang, F Morstatter, RP Trevino… - ACM computing surveys (CSUR), 2017

Cited by 3116 Related articles All 9 versions

Trevino Robert P., Tang Jiliang, Liu Huan*

L Jundong, C Kewei, W Suhang, M Fred - ACM Computing Surveys, 2017

Cited by 21 Related articles

Feature selection: A data perspective. arXiv 2016

J Li, K Cheng, S Wang, F Morstatter, RP Trevino… - arXiv preprint arXiv:1601.07996

Cited by 9 Related articles

38; Liu, H.(2016)*

J Li, K Cheng, S Wang, F Morstatter, T Robert, J Tang - Feature selection: A data perspective.< italic

Cited by 5 Related articles