TY - JOUR
T1 - Identification of urban sectors prone to solid waste accumulation
T2 - A machine learning approach based on social indicators
AU - Izquierdo-Horna, Luis
AU - Damazo, Miker
AU - Yanayaco, Deyvis
N1 - Publisher Copyright:
© 2022 Elsevier Ltd
PY - 2022/9
Y1 - 2022/9
N2 - In the last decades, the accumulation of municipal solid waste in urban areas has become a latent concern in our society due to its implications for the exposed population and the possible health and environmental issues it may cause. In this sense, this research study contributes to the timely identification of these sectors according to the anthropogenic characteristics of their residents as dictated by 10 social indicators (i.e., age, education, income, among others) sorted into three assessment categories (sociodemographic, sociocultural, and socioeconomic). Then, the data collected was processed and analyzed using two machine learning algorithms (random forest (RF) and logistic regression (LR)). The primary information that fed the machine learning model was collected through field visits and local/national reports. For this research, the Puente Piedra and Chaclacayo districts, both located in the province of Lima, Peru, were selected as case studies. Results suggest that the most relevant social indicators that help identifying these sectors are monthly income, consumption patterns, age, and household population density. The experiments showed that the RF algorithm has the best performance, since it efficiently identified 63% of the possible solid waste accumulation zones. In addition, both models were capable of determining different classes (AUC – RF = 0.65, AUC – LR = 0.71). Finally, the proposed approach is applicable and reproducible in different sectors of the national Peruvian territory.
AB - In the last decades, the accumulation of municipal solid waste in urban areas has become a latent concern in our society due to its implications for the exposed population and the possible health and environmental issues it may cause. In this sense, this research study contributes to the timely identification of these sectors according to the anthropogenic characteristics of their residents as dictated by 10 social indicators (i.e., age, education, income, among others) sorted into three assessment categories (sociodemographic, sociocultural, and socioeconomic). Then, the data collected was processed and analyzed using two machine learning algorithms (random forest (RF) and logistic regression (LR)). The primary information that fed the machine learning model was collected through field visits and local/national reports. For this research, the Puente Piedra and Chaclacayo districts, both located in the province of Lima, Peru, were selected as case studies. Results suggest that the most relevant social indicators that help identifying these sectors are monthly income, consumption patterns, age, and household population density. The experiments showed that the RF algorithm has the best performance, since it efficiently identified 63% of the possible solid waste accumulation zones. In addition, both models were capable of determining different classes (AUC – RF = 0.65, AUC – LR = 0.71). Finally, the proposed approach is applicable and reproducible in different sectors of the national Peruvian territory.
KW - Machine learning
KW - Peru
KW - Social indicators
KW - Waste accumulation
KW - Waste management
UR - http://www.scopus.com/inward/record.url?scp=85131712795&partnerID=8YFLogxK
U2 - 10.1016/j.compenvurbsys.2022.101834
DO - 10.1016/j.compenvurbsys.2022.101834
M3 - Article
AN - SCOPUS:85131712795
SN - 0198-9715
VL - 96
JO - Computers, Environment and Urban Systems
JF - Computers, Environment and Urban Systems
M1 - 101834
ER -