Prediction of Air Quality Index Using Ensemble Models
Abstract
The impact of air pollution on health is measured by the Air Quality Index (AQI). Accurate AQI prediction is essential for pollution reduction and public health recommendations. Traditional methods of monitoring air quality are inaccurate and time-consuming. This study uses IoT-based air quality data from Kampung Kalipaten, Tangerang to build an AQI prediction model with machine learning, specifically an ensemble model. Ensemble techniques such as bagging and boosting, which increase the reliability of predictions by reducing model bias and inconsistency, improve AQI prediction. Four ensemble models used in this study, they are Random Forest Regressor, Gradient Boosting Regressor, Adaboosting Regressor, and Bagging Regressor. As the evaluation, RMSE and R2 metrics used. Random Forest Regressor perform the best with RMSE value of 0.6054 and R2 value of 0.6271, although no significant differences of RMSE and R2 value of the rest models.
Downloads
References
T. Madan, S. Sagar, and D. Virmani, “Air Quality Prediction using Machine Learning Algorithms –A Review,” in 2020 2nd International Conference on Advances in Computing, Communication Control and Networking (ICACCCN), Dec. 2020, vol. 840, pp. 140–145, doi: 10.1109/ICACCCN51052.2020.9362912.
N. S. Gupta, Y. Mohta, K. Heda, R. Armaan, B. Valarmathi, and G. Arulkumaran, “Prediction of Air Quality Index Using Machine Learning Techniques: A Comparative Analysis,” J. Environ. Public Health, vol. 2023, pp. 1–26, Jan. 2023, doi: 10.1155/2023/4916267.
U. S. E. P. Agency and I. Division, “Air Quality Index (AQI),” in Encyclopedia of Quality of Life and Well-Being Research, no. February, Dordrecht: Springer Netherlands, 2014, pp. 120–120.
T. H. Rochadiani et al., “Design of Air Quality Monitoring Using LoRaWAN In Human Settlement,” in 2022 IEEE 14th International Conference on Humanoid, Nanotechnology, Information Technology, Communication and Control, Environment, and Management (HNICEM), Dec. 2022, pp. 1–6, doi: 10.1109/HNICEM57413.2022.10109605.
C. Wu, J. Wang, X. Chen, P. Du, and W. Yang, “A novel hybrid system based on multi-objective optimization for wind speed forecasting,” Renew. Energy, vol. 146, pp. 149–165, Feb. 2020, doi: 10.1016/j.renene.2019.04.157.
A. Mosavi, F. Sajedi Hosseini, B. Choubin, M. Goodarzi, A. A. Dineva, and E. Rafiei Sardooi, “Ensemble Boosting and Bagging Based Machine Learning Models for Groundwater Potential Prediction,” Water Resour. Manag., vol. 35, no. 1, pp. 23–37, Jan. 2021, doi: 10.1007/s11269-020-02704-3.
Y. Li and Y. Pan, “A novel ensemble deep learning model for stock prediction based on stock prices and news,” Int. J. Data Sci. Anal., vol. 13, no. 2, pp. 139–149, Mar. 2022, doi: 10.1007/s41060-021-00279-9.
B. Zhang, M. Duan, Y. Sun, Y. Lyu, Y. Hou, and T. Tan, “Air Quality Index Prediction in Six Major Chinese Urban Agglomerations: A Comparative Study of Single Machine Learning Model, Ensemble Model, and Hybrid Model,” Atmosphere (Basel)., vol. 14, no. 10, 2023, doi: 10.3390/atmos14101478.
Y.-C. Liang, Y. Maimury, A. H.-L. Chen, and J. R. C. Juarez, “Machine Learning-Based Prediction of Air Quality,” Appl. Sci., vol. 10, no. 24, p. 9151, Dec. 2020, doi: 10.3390/app10249151.
C. Li, Y. Li, and Y. Bao, “Research on Air Quality Prediction Based on Machine Learning,” in 2021 2nd International Conference on Intelligent Computing and Human-Computer Interaction (ICHCI), Nov. 2021, pp. 77–81, doi: 10.1109/ICHCI54629.2021.00022.
J. Zheng, D. Xin, Q. Cheng, M. Tian, and L. Yang, “The Random Forest Model for analyzing and Forecasting the US Stock Market under the background of smart finance,” 2024, pp. 82–90.
S. Park, S. Jung, J. Lee, and J. Hur, “A Short-Term Forecasting of Wind Power Outputs Based on Gradient Boosting Regression Tree Algorithms,” Energies, vol. 16, no. 3, p. 1132, Jan. 2023, doi: 10.3390/en16031132.
K. Mukherjee, S. S. Ahmed, M. Aasif, S. Kundu, and S. Ghosh, “House Rent Prediction Using Ensemble-Based Regression With Real-Time Data,” in Novel Research and Development Approaches in Heterogeneous Systems and Algorithms, no. March 2023, 2023, pp. 258–271.
J. T R, N. S. Reddy, and U. D. Acharya, “Modeling Daily Reference Evapotranspiration from Climate Variables: Assessment of Bagging and Boosting Regression Approaches,” Water Resour. Manag., vol. 37, no. 3, pp. 1013–1032, Feb. 2023, doi: 10.1007/s11269-022-03399-4.
D. Iskandaryan, F. Ramos, and S. Trilles, “Air Quality Prediction in Smart Cities Using Machine Learning Technologies Based on Sensor Data: A Review,” Appl. Sci., vol. 10, no. 7, p. 2401, Apr. 2020, doi: 10.3390/app10072401.
M. M. Londhe, “Data Mining and Machine Learning Approach for Air Quality Index Prediction,” Int. J. Eng. Appl. Phys., vol. 1, no. 2, pp. 136–153, 2021, [Online]. Available: https://ijeap.org/.
Copyright (c) 2024 Theresia Herlina Rochadiani
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
Authors who publish with this journal agree to the following terms:
- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License (Attribution-ShareAlike 4.0 International (CC BY-SA 4.0) ) that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See The Effect of Open Access).