IDEAS home Printed from https://ideas.repec.org/a/gam/jijerp/v19y2022i19p12180-d925319.html
   My bibliography  Save this article

Predicting Heavy Metal Concentrations in Shallow Aquifer Systems Based on Low-Cost Physiochemical Parameters Using Machine Learning Techniques

Author

Listed:
  • Thi-Minh-Trang Huynh

    (Graduate Institute of Applied Geology, National Central University, Taoyuan 32001, Taiwan)

  • Chuen-Fa Ni

    (Graduate Institute of Applied Geology, National Central University, Taoyuan 32001, Taiwan
    Center for Environmental Studies, National Central University, Taoyuan 32001, Taiwan)

  • Yu-Sheng Su

    (Department of Computer Science and Engineering, National Taiwan Ocean University, Keelung 202301, Taiwan)

  • Vo-Chau-Ngan Nguyen

    (College of Environment and Natural Resources, Can Tho University, Can Tho 94000, Vietnam)

  • I-Hsien Lee

    (Graduate Institute of Applied Geology, National Central University, Taoyuan 32001, Taiwan
    Center for Environmental Studies, National Central University, Taoyuan 32001, Taiwan)

  • Chi-Ping Lin

    (Graduate Institute of Applied Geology, National Central University, Taoyuan 32001, Taiwan
    Center for Environmental Studies, National Central University, Taoyuan 32001, Taiwan)

  • Hoang-Hiep Nguyen

    (Graduate Institute of Applied Geology, National Central University, Taoyuan 32001, Taiwan)

Abstract

Monitoring ex-situ water parameters, namely heavy metals, needs time and laboratory work for water sampling and analytical processes, which can retard the response to ongoing pollution events. Previous studies have successfully applied fast modeling techniques such as artificial intelligence algorithms to predict heavy metals. However, neither low-cost feature predictability nor explainability assessments have been considered in the modeling process. This study proposes a reliable and explainable framework to find an effective model and feature set to predict heavy metals in groundwater. The integrated assessment framework has four steps: model selection uncertainty, feature selection uncertainty, predictive uncertainty, and model interpretability. The results show that Random Forest is the most suitable model, and quick-measure parameters can be used as predictors for arsenic (As), iron (Fe), and manganese (Mn). Although the model performance is auspicious, it likely produces significant uncertainties. The findings also demonstrate that arsenic is related to nutrients and spatial distribution, while Fe and Mn are affected by spatial distribution and salinity. Some limitations and suggestions are also discussed to improve the prediction accuracy and interpretability.

Suggested Citation

  • Thi-Minh-Trang Huynh & Chuen-Fa Ni & Yu-Sheng Su & Vo-Chau-Ngan Nguyen & I-Hsien Lee & Chi-Ping Lin & Hoang-Hiep Nguyen, 2022. "Predicting Heavy Metal Concentrations in Shallow Aquifer Systems Based on Low-Cost Physiochemical Parameters Using Machine Learning Techniques," IJERPH, MDPI, vol. 19(19), pages 1-21, September.
  • Handle: RePEc:gam:jijerp:v:19:y:2022:i:19:p:12180-:d:925319
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/1660-4601/19/19/12180/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/1660-4601/19/19/12180/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Eric W Fox & Jay M Ver Hoef & Anthony R Olsen, 2020. "Comparing spatial regression to random forests for large environmental data sets," PLOS ONE, Public Library of Science, vol. 15(3), pages 1-22, March.
    2. Nantian Huang & Guobo Lu & Dianguo Xu, 2016. "A Permutation Importance-Based Feature Selection Method for Short-Term Electricity Load Forecasting Using Random Forest," Energies, MDPI, vol. 9(10), pages 1-24, September.
    3. Basim Mahbooba & Mohan Timilsina & Radhya Sahal & Martin Serrano & Ahmed Mostafa Khalil, 2021. "Explainable Artificial Intelligence (XAI) to Enhance Trust Management in Intrusion Detection Systems Using Decision Tree Model," Complexity, Hindawi, vol. 2021, pages 1-11, January.
    4. Akram Seifi & Mohammad Ehteram & Vijay P. Singh & Amir Mosavi, 2020. "Modeling and Uncertainty Analysis of Groundwater Level Using Six Evolutionary Optimization Algorithms Hybridized with ANFIS, SVM, and ANN," Sustainability, MDPI, vol. 12(10), pages 1-42, May.
    5. Russell R. Barton & Barry L. Nelson & Wei Xie, 2014. "Quantifying Input Uncertainty via Simulation Confidence Intervals," INFORMS Journal on Computing, INFORMS, vol. 26(1), pages 74-87, February.
    6. Willcock, Simon & Martínez-López, Javier & Hooftman, Danny A.P. & Bagstad, Kenneth J. & Balbi, Stefano & Marzo, Alessia & Prato, Carlo & Sciandrello, Saverio & Signorello, Giovanni & Voigt, Brian & Vi, 2018. "Machine learning for ecosystem services," Ecosystem Services, Elsevier, vol. 33(PB), pages 165-174.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Wen Shi & Xi Chen & Jennifer Shang, 2019. "An Efficient Morris Method-Based Framework for Simulation Factor Screening," INFORMS Journal on Computing, INFORMS, vol. 31(4), pages 745-770, October.
    2. Yao, Nengzhi(Chris) & Bai, Junhong & Yu, Zihao & Guo, Qiaozhe, 2025. "Does AI orientation facilitate operational efficiency? A contingent strategic orientation perspective," Journal of Business Research, Elsevier, vol. 186(C).
    3. Agudelo, César Augusto Ruiz & Bustos, Sandra Liliana Hurtado & Moreno, Carmen Alicia Parrado, 2020. "Modeling interactions among multiple ecosystem services. A critical review," Ecological Modelling, Elsevier, vol. 429(C).
    4. Weiwei Fan & L. Jeff Hong & Xiaowei Zhang, 2020. "Distributionally Robust Selection of the Best," Management Science, INFORMS, vol. 66(1), pages 190-208, January.
    5. Héctor Migallón & Akram Belazi & José-Luis Sánchez-Romero & Héctor Rico & Antonio Jimeno-Morenilla, 2020. "Settings-Free Hybrid Metaheuristic General Optimization Methods," Mathematics, MDPI, vol. 8(7), pages 1-25, July.
    6. Katharina Schulze & Žiga Malek & Dmitry Schepaschenko & Myroslava Lesiv & Steffen Fritz & Peter H. Verburg, 2023. "Pantropical distribution of short-rotation woody plantations: spatial probabilities under current and future climate," Mitigation and Adaptation Strategies for Global Change, Springer, vol. 28(5), pages 1-22, June.
    7. Jun Yuan & Haowei Wang & Szu Hui Ng & Victor Nian, 2020. "Ship Emission Mitigation Strategies Choice Under Uncertainty," Energies, MDPI, vol. 13(9), pages 1-20, May.
    8. Signorello, Giovanni & Prato, Carlo & Marzo, Alessia & Ientile, Renzo & Cucuzza, Giuseppe & Sciandrello, Saverio & Martínez-López, Javier & Balbi, Stefano & Villa, Ferdinando, 2018. "Are protected areas covering important biodiversity sites? An assessment of the nature protection network in Sicily (Italy)," Land Use Policy, Elsevier, vol. 78(C), pages 593-602.
    9. Richards, Daniel Rex & Lavorel, Sandra, 2022. "Integrating social media data and machine learning to analyse scenarios of landscape appreciation," Ecosystem Services, Elsevier, vol. 55(C).
    10. Manley, Kyle & Nyelele, Charity & Egoh, Benis N., 2022. "A review of machine learning and big data applications in addressing ecosystem service research gaps," Ecosystem Services, Elsevier, vol. 57(C).
    11. Chai, Xuqing & Li, Shihao & Liang, Fengwei, 2024. "A novel battery SOC estimation method based on random search optimized LSTM neural network," Energy, Elsevier, vol. 306(C).
    12. Chan-Uk Yeom & Keun-Chang Kwak, 2017. "Short-Term Electricity-Load Forecasting Using a TSK-Based Extreme Learning Machine with Knowledge Representation," Energies, MDPI, vol. 10(10), pages 1-18, October.
    13. Xinchen Gu & Aihua Long & Guihua Liu & Jiawen Yu & Hao Wang & Yongmin Yang & Pei Zhang, 2021. "Changes in Ecosystem Service Value in the 1 km Lakeshore Zone of Poyang Lake from 1980 to 2020," Land, MDPI, vol. 10(9), pages 1-19, September.
    14. Bagstad, Kenneth J. & Ingram, Jane Carter & Shapiro, Carl D. & La Notte, Alessandra & Maes, Joachim & Vallecillo, Sara & Casey, C. Frank & Glynn, Pierre D. & Heris, Mehdi P. & Johnson, Justin A. & Lau, 2021. "Lessons learned from development of natural capital accounts in the United States and European Union," Ecosystem Services, Elsevier, vol. 52(C).
    15. Abu Reza Md. Towfiqul Islam & Swapan Talukdar & Shumona Akhter & Kutub Uddin Eibek & Md. Mostafizur Rahman & Swades Pal & Mohd Waseem Naikoo & Atiqur Rahman & Amir Mosavi, 2022. "Assessing the Impact of the Farakka Barrage on Hydrological Alteration in the Padma River with Future Insight," Sustainability, MDPI, vol. 14(9), pages 1-26, April.
    16. Chiou-Jye Huang & Ping-Huan Kuo, 2018. "A Short-Term Wind Speed Forecasting Model by Using Artificial Neural Networks with Stochastic Optimization for Renewable Energy Systems," Energies, MDPI, vol. 11(10), pages 1-20, October.
    17. Junyi Wu & Shari Shang, 2020. "Managing Uncertainty in AI-Enabled Decision Making and Achieving Sustainability," Sustainability, MDPI, vol. 12(21), pages 1-17, October.
    18. Francisco Martínez-Álvarez & Alicia Troncoso & José C. Riquelme, 2017. "Recent Advances in Energy Time Series Forecasting," Energies, MDPI, vol. 10(6), pages 1-3, June.
    19. Ming-Hui Huang & Roland T. Rust, 2021. "A strategic framework for artificial intelligence in marketing," Journal of the Academy of Marketing Science, Springer, vol. 49(1), pages 30-50, January.
    20. Kleijnen, J.P.C. & Mehdad, Ehsan, 2015. "Estimating the Variance of the Predictor in Stochastic Kriging," Discussion Paper 2015-041, Tilburg University, Center for Economic Research.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jijerp:v:19:y:2022:i:19:p:12180-:d:925319. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.