IDEAS home Printed from https://ideas.repec.org/a/gam/jsusta/v14y2022i18p11403-d912424.html
   My bibliography  Save this article

Improving Air Pollution Prediction Modelling Using Wrapper Feature Selection

Author

Listed:
  • Ahmad Zia Ul-Saufie

    (Faculty of Computer and Mathematical Sciences, Universiti Teknologi MARA, Shah Alam 40450, Selangor, Malaysia)

  • Nurul Haziqah Hamzan

    (Faculty of Computer and Mathematical Sciences, Universiti Teknologi MARA, Shah Alam 40450, Selangor, Malaysia)

  • Zulaika Zahari

    (Faculty of Computer and Mathematical Sciences, Universiti Teknologi MARA, Shah Alam 40450, Selangor, Malaysia)

  • Wan Nur Shaziayani

    (Faculty of Computer and Mathematical Sciences, Universiti Teknologi MARA, Shah Alam 40450, Selangor, Malaysia)

  • Norazian Mohamad Noor

    (Faculty of Civil Engineering Technology, Universiti Malaysia Perlis, Kompleks Pengajian Jejawi 3, Arau 02600, Perlis, Malaysia)

  • Mohd Remy Rozainy Mohd Arif Zainol

    (School of Civil Engineering, Engineering Campus, Universiti Sains Malaysia, Nibong Tebal 14300, Pulau Pinang, Malaysia)

  • Andrei Victor Sandu

    (Faculty of Material Science and Engineering, Gheorghe Asachi Technical University of Iasi, 61 D. Mangeron Blvd., 700050 Iasi, Romania
    Romanian Inventors Forum, St. P. Movila 3, 700089 Iasi, Romania
    National Institute for Research and Development in Environmental Protection INCDPM, Splaiul Independentei 294, 060031 Bucharest, Romania)

  • Gyorgy Deak

    (National Institute for Research and Development in Environmental Protection INCDPM, Splaiul Independentei 294, 060031 Bucharest, Romania)

  • Petrica Vizureanu

    (Faculty of Material Science and Engineering, Gheorghe Asachi Technical University of Iasi, 61 D. Mangeron Blvd., 700050 Iasi, Romania
    Technical Sciences Academy of Romania, Dacia Blvd 26, 030167 Bucharest, Romania)

Abstract

Feature selection is considered as one of the essential steps in data pre-processing. However, all of the previous studies on predicting PM 10 concentration in Malaysia have been limited to statistical method feature selection, and none of these studies used machine-learning approaches. Therefore, the objective of this research is to investigate the influence variables of the PM 10 prediction model by using wrapper feature selection to compare the prediction model performance of different wrapper feature selection and to predict the concentration of PM 10 for the next day. This research uses 10 years of daily data on pollutant concentrations from two stations (Klang and Shah Alam) obtained from the Department of Environment Malaysia (DOE) from 2009 until 2018. Six wrapper methods (forward selection, backward elimination, stepwise, brute-force, weight-guided and genetic algorithm evolution and the predictive analytics multiple linear regression (MLR) and artificial neural network (ANN)) were implemented in this study. This study found that brute-force is the dominant wrapper method in most of the best models in selecting important features for MLR. Moreover, compared to MLR, ANN provides more advantages regarding model accuracy and permits feature selection in predicting PM 10 . The overall results revealed that the RMSE value for next day prediction in Klang is 20.728, while the AE value is 15.69. Furthermore, the RMSE value for next day prediction in Shah Alam is 10.004, while the AE value is 7.982. Finally, all of the predicted models in Klang and Shah Alam can be used to predict the PM 10 concentrations. This proposed model can be used as a tool for an early warning system in giving air quality information to local authorities in order to formulate air-quality-improvement strategies.

Suggested Citation

  • Ahmad Zia Ul-Saufie & Nurul Haziqah Hamzan & Zulaika Zahari & Wan Nur Shaziayani & Norazian Mohamad Noor & Mohd Remy Rozainy Mohd Arif Zainol & Andrei Victor Sandu & Gyorgy Deak & Petrica Vizureanu, 2022. "Improving Air Pollution Prediction Modelling Using Wrapper Feature Selection," Sustainability, MDPI, vol. 14(18), pages 1-16, September.
  • Handle: RePEc:gam:jsusta:v:14:y:2022:i:18:p:11403-:d:912424
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2071-1050/14/18/11403/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2071-1050/14/18/11403/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Wu, Binrong & Wang, Lin & Zeng, Yu-Rong, 2022. "Interpretable wind speed prediction with multivariate time series and temporal fusion transformers," Energy, Elsevier, vol. 252(C).
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Ezgi Güler & Süheyla Yerel Kandemir, 2024. "Analysis of PM 10 Substances via Intuitionistic Fuzzy Decision-Making and Statistical Evaluation," Sustainability, MDPI, vol. 16(17), pages 1-23, September.
    2. Chelladurai Aarthi & Varatharaj Jeya Ramya & Przemysław Falkowski-Gilski & Parameshachari Bidare Divakarachari, 2023. "Balanced Spider Monkey Optimization with Bi-LSTM for Sustainable Air Quality Prediction," Sustainability, MDPI, vol. 15(2), pages 1-16, January.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Katarzyna Rudnik & Anna Hnydiuk-Stefan & Aneta Kucińska-Landwójtowicz & Łukasz Mach, 2022. "Forecasting Day-Ahead Carbon Price by Modelling Its Determinants Using the PCA-Based Approach," Energies, MDPI, vol. 15(21), pages 1-23, October.
    2. Lin Wang & Wuyue An & Feng‐Ting Li, 2024. "Text‐based corn futures price forecasting using improved neural basis expansion network," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 43(6), pages 2042-2063, September.
    3. Shengxiang Lv & Lin Wang & Sirui Wang, 2023. "A Hybrid Neural Network Model for Short-Term Wind Speed Forecasting," Energies, MDPI, vol. 16(4), pages 1-18, February.
    4. Wuyue An & Lin Wang & Dongfeng Zhang, 2023. "Comprehensive commodity price forecasting framework using text mining methods," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 42(7), pages 1865-1888, November.
    5. Nascimento, Erick Giovani Sperandio & de Melo, Talison A.C. & Moreira, Davidson M., 2023. "A transformer-based deep neural network with wavelet transform for forecasting wind speed and wind energy," Energy, Elsevier, vol. 278(C).
    6. Lu Peng & Sheng‐Xiang Lv & Lin Wang, 2024. "Explainable machine learning techniques based on attention gate recurrent unit and local interpretable model‐agnostic explanations for multivariate wind speed forecasting," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 43(6), pages 2064-2087, September.
    7. Lv, Sheng-Xiang & Wang, Lin, 2023. "Multivariate wind speed forecasting based on multi-objective feature selection approach and hybrid deep learning model," Energy, Elsevier, vol. 263(PE).
    8. Karla Schröder & Gonzalo Farias & Sebastián Dormido-Canto & Ernesto Fabregas, 2024. "Comparative Analysis of Deep Learning Methods for Fault Avoidance and Predicting Demand in Electrical Distribution," Energies, MDPI, vol. 17(11), pages 1-13, June.
    9. Shengcai Zhang & Changsheng Zhu & Xiuting Guo, 2024. "Wind-Speed Multi-Step Forecasting Based on Variational Mode Decomposition, Temporal Convolutional Network, and Transformer Model," Energies, MDPI, vol. 17(9), pages 1-22, April.
    10. Zhang, Chu & Qiao, Xiujie & Zhang, Zhao & Wang, Yuhan & Fu, Yongyan & Nazir, Muhammad Shahzad & Peng, Tian, 2024. "Simultaneous forecasting of wind speed for multiple stations based on attribute-augmented spatiotemporal graph convolutional network and tree-structured parzen estimator," Energy, Elsevier, vol. 295(C).
    11. Du, Pei & Yang, Dongchuan & Li, Yanzhao & Wang, Jianzhou, 2024. "An innovative interpretable combined learning model for wind speed forecasting," Applied Energy, Elsevier, vol. 358(C).
    12. Fatma Mazen Ali Mazen & Yomna Shaker & Rania Ahmed Abul Seoud, 2023. "Forecasting of Solar Power Using GRU–Temporal Fusion Transformer Model and DILATE Loss Function," Energies, MDPI, vol. 16(24), pages 1-24, December.
    13. Li, Xin & Xu, Yechi & Law, Rob & Wang, Shouyang, 2024. "Enhancing tourism demand forecasting with a transformer-based framework," Annals of Tourism Research, Elsevier, vol. 107(C).
    14. Li, Yang & Shen, Xiaojun & Zhou, Chongcheng, 2023. "Dynamic multi-turbines spatiotemporal correlation model enabled digital twin technology for real-time wind speed prediction," Renewable Energy, Elsevier, vol. 203(C), pages 841-853.
    15. Ke Xu & Junli Zhang & Junhao Huang & Hongbo Tan & Xiuli Jing & Tianxiang Zheng, 2024. "Forecasting Visitor Arrivals at Tourist Attractions: A Time Series Framework with the N-BEATS for Sustainable Tourism," Sustainability, MDPI, vol. 16(18), pages 1-28, September.
    16. Wu, Binrong & Yu, Sihao & Peng, Lu & Wang, Lin, 2024. "Interpretable wind speed forecasting with meteorological feature exploring and two-stage decomposition," Energy, Elsevier, vol. 294(C).
    17. Athanasios Ioannis Arvanitidis & Dimitrios Bargiotas & Dimitrios Kontogiannis & Athanasios Fevgas & Miltiadis Alamaniotis, 2022. "Optimized Data-Driven Models for Short-Term Electricity Price Forecasting Based on Signal Decomposition and Clustering Techniques," Energies, MDPI, vol. 15(21), pages 1-24, October.
    18. Parri, Srihari & Teeparthi, Kiran & Kosana, Vishalteja, 2024. "A hybrid methodology using VMD and disentangled features for wind speed forecasting," Energy, Elsevier, vol. 288(C).
    19. Bentsen, Lars Ødegaard & Warakagoda, Narada Dilp & Stenbro, Roy & Engelstad, Paal, 2023. "Spatio-temporal wind speed forecasting using graph networks and novel Transformer architectures," Applied Energy, Elsevier, vol. 333(C).
    20. Konstantinos Blazakis & Yiannis Katsigiannis & Georgios Stavrakakis, 2022. "One-Day-Ahead Solar Irradiation and Windspeed Forecasting with Advanced Deep Learning Techniques," Energies, MDPI, vol. 15(12), pages 1-25, June.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jsusta:v:14:y:2022:i:18:p:11403-:d:912424. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.