TY - GEN
T1 - Comparative Study of Spatial Prediction Models for Estimating PM2.5 Concentration Level in Urban Areas
AU - Vargas-Campos, Irvin Rosendo
AU - Villanueva, Edwin
N1 - Publisher Copyright:
© 2021, Springer Nature Switzerland AG.
PY - 2021
Y1 - 2021
N2 - Having accurate spatial prediction models of air pollutant concentrations can be very helpful to alleviate the shortage of monitoring stations, specially in low-to-middle income countries. However, given the large diversity of model types, both statistical, numerical and machine learning (ML) based, it is not clear which of them are most suitable for this task. In this paper we study the predictive capabilities of common machine learning methods for the spatial prediction of PM2.5 concentration level. Three relevant factors were scrutinized: the extent to which meteorological variables impact the prediction performance; the effect of variable normalization by inverse distance weighting (IDW); and the number of neighborhood stations needed to maximize predictive performance. Results in a dataset from Beijing monitoring network show that simple models like Linear Regresors trained on IDW normalized variables can cope with this task. Some knowledge have been derived to guide the construction of competent models for spatial prediction of PM2.5 concentrations with ML-based methods.
AB - Having accurate spatial prediction models of air pollutant concentrations can be very helpful to alleviate the shortage of monitoring stations, specially in low-to-middle income countries. However, given the large diversity of model types, both statistical, numerical and machine learning (ML) based, it is not clear which of them are most suitable for this task. In this paper we study the predictive capabilities of common machine learning methods for the spatial prediction of PM2.5 concentration level. Three relevant factors were scrutinized: the extent to which meteorological variables impact the prediction performance; the effect of variable normalization by inverse distance weighting (IDW); and the number of neighborhood stations needed to maximize predictive performance. Results in a dataset from Beijing monitoring network show that simple models like Linear Regresors trained on IDW normalized variables can cope with this task. Some knowledge have been derived to guide the construction of competent models for spatial prediction of PM2.5 concentrations with ML-based methods.
KW - Air quality
KW - Machine learning
KW - PM
KW - Spatial prediction
UR - http://www.scopus.com/inward/record.url?scp=85111111277&partnerID=8YFLogxK
U2 - 10.1007/978-3-030-76228-5_12
DO - 10.1007/978-3-030-76228-5_12
M3 - Conference contribution
AN - SCOPUS:85111111277
SN - 9783030762278
T3 - Communications in Computer and Information Science
SP - 169
EP - 180
BT - Information Management and Big Data - 7th Annual International Conference, SIMBig 2020, Proceedings
A2 - Lossio-Ventura, Juan Antonio
A2 - Valverde-Rebaza, Jorge Carlos
A2 - Díaz, Eduardo
A2 - Alatrista-Salas, Hugo
PB - Springer Science and Business Media Deutschland GmbH
T2 - 7th Annual International Conference on Information Management and Big Data, SIMBig 2020
Y2 - 1 October 2020 through 3 October 2020
ER -