Knowledge Agora



Scientific Article details

Title Performance analysis of regression algorithms and feature selection techniques to predict PM2.5 in smart cities
ID_Doc 43957
Authors Banga, A; Ahuja, R; Sharma, SC
Title Performance analysis of regression algorithms and feature selection techniques to predict PM2.5 in smart cities
Year 2023
Published International Journal Of System Assurance Engineering And Management, 14, Suppl 3
DOI 10.1007/s13198-020-01049-9
Abstract With an increase in the urban population, environmental pollution is drastically increased. Air pollution is one of the significant issues in smart cities. The higher value of PM2.5 can cause various health issues like respiratory disease, heart attack, lung disease, and fatigue. Predicting PM2.5 can help the administration to warn people at risk and make scientific measures to reduce pollution. Existing work has utilized various regression models to predict air pollution; however, different feature selection techniques with the regression algorithm have not yet been explored. This paper has implemented five feature selection techniques (namely, Recursive Feature Elimination, Analysis of Variance, Random Forest, Variance Threshold, and Light Gradient Boosting) to select the best features. Further, six regression algorithms and ensemble models (Extra Tree, Decision Tree, XGBoost, Random Forest, Light GBM, and AdaBoost) are applied to predict PM2.5 using python language on the dataset of five cities of China. The models are compared based on the Mean Absolute Error (MAE), Root Mean Square Error (RMSE), and R-2 parameters. We observed that the AdaBoost algorithm with the Light GBM feature selection technique gives the highest performance among all the five datasets. The highest performance values (MAE 0.07, RMSE 0.14, and R-2 0.94) are given by the AdaBoost algorithm with LightGBM feature selection on the Chengdu dataset. The computed feature importance has shown that humidity, cbwd, dew point, and pressure play an essential role in air pollution.
Author Keywords AQI; Regression models; PM2.5; Machine learning; Smart city
Index Keywords Index Keywords
Document Type Other
Open Access Open Access
Source Emerging Sources Citation Index (ESCI)
EID WOS:000606161300001
WoS Category Engineering, Multidisciplinary
Research Area Engineering
PDF
Similar atricles
Scroll