View article

A review of machine learning for modeling air quality: Overlooked but important issues

Authors

Die Tang, Yu Zhan, Fumo Yang

Publication date

2024/1/21

Source

Atmospheric Research

Pages

107261

Publisher

Elsevier

Description

Machine learning models based on satellite remote sensing have gained widespread use in estimating ground-level air pollutant concentrations, which overcome the limitations of the discontinuous spatial distribution of ground monitoring stations. However, due to the interdisciplinary nature of environmental modeling, atmospheric researchers may overlook some important issues when using machine learning. In this review, we summarize and discuss the overlooked but important issues in data preparation, model development, validation, and prediction, including feature engineering, imbalanced data, validation strategy, and model interpretation, which are critical for model generalizability. Firstly, we provide considerations and recommendations in obtaining, selecting, and using data of the main variables in machine learning for air quality mapping. Secondly, sufficient introduction and discussion are provided on …

Total citations

Cited by 7

20247

Scholar articles

A review of machine learning for modeling air quality: Overlooked but important issues

D Tang, Y Zhan, F Yang - Atmospheric Research, 2024

Cited by 7 Related articles