IDEAS home Printed from https://ideas.repec.org/a/spr/waterr/v34y2020i1d10.1007_s11269-019-02447-w.html
   My bibliography  Save this article

Decision Tree-Based Data Mining and Rule Induction for Identifying High Quality Groundwater Zones to Water Supply Management: a Novel Hybrid Use of Data Mining and GIS

Author

Listed:
  • Mehrdad Jeihouni

    (University of Tehran)

  • Ara Toomanian

    (University of Tehran)

  • Ali Mansourian

    (University of Tehran
    Lund University)

Abstract

Groundwater is an important source to supply drinking water demands in both arid and semi-arid regions. Nevertheless, locating high quality drinking water is a major challenge in such areas. Against this background, this study proceeds to utilize and compare five decision tree-based data mining algorithms including Ordinary Decision Tree (ODT), Random Forest (RF), Random Tree (RT), Chi-square Automatic Interaction Detector (CHAID), and Iterative Dichotomiser 3 (ID3) for rule induction in order to identify high quality groundwater zones for drinking purposes. The proposed methodology works by initially extracting key relevant variables affecting water quality (electrical conductivity, pH, hardness and chloride) out of a total of eight existing parameters, and using them as inputs for the rule induction process. The algorithms were evaluated with reference to both continuous and discrete datasets. The findings were speculative of the superiority, performance-wise, of rule induction using the continuous dataset as opposed to the discrete dataset. Based on validation results, in continuous dataset, RF and ODT showed higher and RT showed acceptable performance. The groundwater quality maps were generated by combining the effective parameters distribution maps using inducted rules from RF, ODT, and RT, in GIS environment. A quick glance at the generated maps reveals a drop in the quality of groundwater from south to north as well as from east to west in the study area. The RF showed the highest performance (accuracy of 97.10%) among its counterparts; and so the generated map based on rules inducted from RF is more reliable. The RF and ODT methods are more suitable in the case of continuous dataset and can be applied for rule induction to determine water quality with higher accuracy compared to other tested algorithms.

Suggested Citation

  • Mehrdad Jeihouni & Ara Toomanian & Ali Mansourian, 2020. "Decision Tree-Based Data Mining and Rule Induction for Identifying High Quality Groundwater Zones to Water Supply Management: a Novel Hybrid Use of Data Mining and GIS," Water Resources Management: An International Journal, Published for the European Water Resources Association (EWRA), Springer;European Water Resources Association (EWRA), vol. 34(1), pages 139-154, January.
  • Handle: RePEc:spr:waterr:v:34:y:2020:i:1:d:10.1007_s11269-019-02447-w
    DOI: 10.1007/s11269-019-02447-w
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s11269-019-02447-w
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s11269-019-02447-w?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Guoqiang Chen & Tianyu Long & Jiangong Xiong & Yun Bai, 2017. "Multiple Random Forests Modelling for Urban Water Consumption Forecasting," Water Resources Management: An International Journal, Published for the European Water Resources Association (EWRA), Springer;European Water Resources Association (EWRA), vol. 31(15), pages 4715-4729, December.
    2. Peters, Jan & Baets, Bernard De & Verhoest, Niko E.C. & Samson, Roeland & Degroeve, Sven & Becker, Piet De & Huybrechts, Willy, 2007. "Random forests as a tool for ecohydrological distribution modelling," Ecological Modelling, Elsevier, vol. 207(2), pages 304-318.
    3. Seyed Amir Naghibi & Kourosh Ahmadi & Alireza Daneshi, 2017. "Application of Support Vector Machine, Random Forest, and Genetic Algorithm Optimized Random Forest Models in Groundwater Potential Mapping," Water Resources Management: An International Journal, Published for the European Water Resources Association (EWRA), Springer;European Water Resources Association (EWRA), vol. 31(9), pages 2761-2775, July.
    4. Shaghayegh Miraki & Sasan Hedayati Zanganeh & Kamran Chapi & Vijay P. Singh & Ataollah Shirzadi & Himan Shahabi & Binh Thai Pham, 2019. "Mapping Groundwater Potential Using a Novel Hybrid Intelligence Approach," Water Resources Management: An International Journal, Published for the European Water Resources Association (EWRA), Springer;European Water Resources Association (EWRA), vol. 33(1), pages 281-302, January.
    5. Anas Mahmood Al-Juboori, 2019. "Generating Monthly Stream Flow Using Nearest River Data: Assessing Different Trees Models," Water Resources Management: An International Journal, Published for the European Water Resources Association (EWRA), Springer;European Water Resources Association (EWRA), vol. 33(9), pages 3257-3270, July.
    6. G. Robinson & S. Moutari & A. A. Ahmed & G. A. Hamill, 2018. "An Advanced Calibration Method for Image Analysis in Laboratory-Scale Seawater Intrusion Problems," Water Resources Management: An International Journal, Published for the European Water Resources Association (EWRA), Springer;European Water Resources Association (EWRA), vol. 32(9), pages 3087-3102, July.
    7. Zohreh Sherafatpour & Abbas Roozbahani & Yousef Hasani, 2019. "Agricultural Water Allocation by Integration of Hydro-Economic Modeling with Bayesian Networks and Random Forest Approaches," Water Resources Management: An International Journal, Published for the European Water Resources Association (EWRA), Springer;European Water Resources Association (EWRA), vol. 33(7), pages 2277-2299, May.
    8. Daniela Ducci, 1999. "GIS Techniques for Mapping Groundwater Contamination Risk," Natural Hazards: Journal of the International Society for the Prevention and Mitigation of Natural Hazards, Springer;International Society for the Prevention and Mitigation of Natural Hazards, vol. 20(2), pages 279-294, November.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Mojtaba Poursaeid & Amir Houssain Poursaeid & Saeid Shabanlou, 2022. "A Comparative Study of Artificial Intelligence Models and A Statistical Method for Groundwater Level Prediction," Water Resources Management: An International Journal, Published for the European Water Resources Association (EWRA), Springer;European Water Resources Association (EWRA), vol. 36(5), pages 1499-1519, March.
    2. Mohammad Zounemat-Kermani & Abdollah Ramezani-Charmahineh & Reza Razavi & Meysam Alizamir & Taha B.M.J. Ouarda, 2020. "Machine Learning and Water Economy: a New Approach to Predicting Dams Water Sales Revenue," Water Resources Management: An International Journal, Published for the European Water Resources Association (EWRA), Springer;European Water Resources Association (EWRA), vol. 34(6), pages 1893-1911, April.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Weiyu Yu & Nicola A Wardrop & Robert E S Bain & Victor Alegana & Laura J Graham & Jim A Wright, 2019. "Mapping access to domestic water supplies from incomplete data in developing countries: An illustrative assessment for Kenya," PLOS ONE, Public Library of Science, vol. 14(5), pages 1-19, May.
    2. Sarmad Dashti Latif & Ali Najah Ahmed, 2023. "Streamflow Prediction Utilizing Deep Learning and Machine Learning Algorithms for Sustainable Water Supply Management," Water Resources Management: An International Journal, Published for the European Water Resources Association (EWRA), Springer;European Water Resources Association (EWRA), vol. 37(8), pages 3227-3241, June.
    3. Dieu Tien Bui & Ataollah Shirzadi & Ata Amini & Himan Shahabi & Nadhir Al-Ansari & Shahriar Hamidi & Sushant K. Singh & Binh Thai Pham & Baharin Bin Ahmad & Pezhman Taherei Ghazvinei, 2020. "A Hybrid Intelligence Approach to Enhance the Prediction Accuracy of Local Scour Depth at Complex Bridge Piers," Sustainability, MDPI, vol. 12(3), pages 1-24, February.
    4. Viet-Ha Nhu & Ataollah Shirzadi & Himan Shahabi & Sushant K. Singh & Nadhir Al-Ansari & John J. Clague & Abolfazl Jaafari & Wei Chen & Shaghayegh Miraki & Jie Dou & Chinh Luu & Krzysztof Górski & Binh, 2020. "Shallow Landslide Susceptibility Mapping: A Comparison between Logistic Model Tree, Logistic Regression, Naïve Bayes Tree, Artificial Neural Network, and Support Vector Machine Algorithms," IJERPH, MDPI, vol. 17(8), pages 1-30, April.
    5. Yikalo H. Araya & Tarmo K. Remmel & Ajith H. Perera, 2016. "What governs the presence of residual vegetation in boreal wildfires?," Journal of Geographical Systems, Springer, vol. 18(2), pages 159-181, April.
    6. Sarah Mittlefehldt & Erin Bunting & Emily Huff & Joseph Welsh & Robert Goodwin, 2021. "New Methods for Assessing Sustainability of Wood-Burning Energy Facilities: Combining Historical and Spatial Approaches," Energies, MDPI, vol. 14(23), pages 1-18, November.
    7. Bemah Ibrahim & Isaac Ahenkorah & Anthony Ewusi, 2022. "Explainable Risk Assessment of Rockbolts’ Failure in Underground Coal Mines Based on Categorical Gradient Boosting and SHapley Additive exPlanations (SHAP)," Sustainability, MDPI, vol. 14(19), pages 1-16, September.
    8. Madhumita Sahoo & Aman Kasot & Anirban Dhar & Amlanjyoti Kar, 2018. "On Predictability of Groundwater Level in Shallow Wells Using Satellite Observations," Water Resources Management: An International Journal, Published for the European Water Resources Association (EWRA), Springer;European Water Resources Association (EWRA), vol. 32(4), pages 1225-1244, March.
    9. Anas Mahmood Al-Juboori, 2021. "A Hybrid Model to Predict Monthly Streamflow Using Neighboring Rivers Annual Flows," Water Resources Management: An International Journal, Published for the European Water Resources Association (EWRA), Springer;European Water Resources Association (EWRA), vol. 35(2), pages 729-743, January.
    10. Vanesa Mateo-Pérez & Marina Corral-Bobadilla & Francisco Ortega-Fernández & Vicente Rodríguez-Montequín, 2021. "Determination of Water Depth in Ports Using Satellite Data Based on Machine Learning Algorithms," Energies, MDPI, vol. 14(9), pages 1-22, April.
    11. Eamen, Leila & Brouwer, Roy & Razavi, Saman, 2020. "The economic impacts of water supply restrictions due to climate and policy change: A transboundary river basin supply-side input-output analysis," Ecological Economics, Elsevier, vol. 172(C).
    12. Fatima Zahra Echogdali & Said Boutaleb & Rosine Basseu Kpan & Mohammed Ouchchen & Amine Bendarma & Hasna El Ayady & Kamal Abdelrahman & Mohammed S. Fnais & Kochappi Sathyan Sajinkumar & Mohamed Abioui, 2022. "Application of Fuzzy Logic and Fractal Modeling Approach for Groundwater Potential Mapping in Semi-Arid Akka Basin, Southeast Morocco," Sustainability, MDPI, vol. 14(16), pages 1-17, August.
    13. Shuang Zhang & Shaobo Liu & Qikang Zhong & Kai Zhu & Hongpeng Fu, 2024. "Assessing Eco-Environmental Effects and Its Impacts Mechanisms in the Mountainous City: Insights from Ecological–Production–Living Spaces Using Machine Learning Models in Chongqing," Land, MDPI, vol. 13(8), pages 1-24, August.
    14. Binh Thai Pham & Ataollah Shirzadi & Himan Shahabi & Ebrahim Omidvar & Sushant K. Singh & Mehebub Sahana & Dawood Talebpour Asl & Baharin Bin Ahmad & Nguyen Kim Quoc & Saro Lee, 2019. "Landslide Susceptibility Assessment by Novel Hybrid Machine Learning Algorithms," Sustainability, MDPI, vol. 11(16), pages 1-25, August.
    15. Li Yang & Xin Fang & Xue Wang & Shanshan Li & Junqi Zhu, 2022. "Risk Prediction of Coal and Gas Outburst in Deep Coal Mines Based on the SAPSO-ELM Algorithm," IJERPH, MDPI, vol. 19(19), pages 1-18, September.
    16. Guanyin Shuai & Yan Zhou & Jingli Shao & Yali Cui & Qiulan Zhang & Chaowei Jin & Shuyuan Xu, 2024. "Comparison of Multiple Machine Learning Methods for Correcting Groundwater Levels Predicted by Physics-Based Models," Sustainability, MDPI, vol. 16(2), pages 1-18, January.
    17. Phong Tung Nguyen & Duong Hai Ha & Abolfazl Jaafari & Huu Duy Nguyen & Tran Van Phong & Nadhir Al-Ansari & Indra Prakash & Hiep Van Le & Binh Thai Pham, 2020. "Groundwater Potential Mapping Combining Artificial Neural Network and Real AdaBoost Ensemble Technique: The DakNong Province Case-study, Vietnam," IJERPH, MDPI, vol. 17(7), pages 1-20, April.
    18. Baalousha, Husam, 2010. "Assessment of a groundwater quality monitoring network using vulnerability mapping and geostatistics: A case study from Heretaunga Plains, New Zealand," Agricultural Water Management, Elsevier, vol. 97(2), pages 240-246, February.
    19. Ali Mokhtar & Nadhir Al-Ansari & Wessam El-Ssawy & Renata Graf & Pouya Aghelpour & Hongming He & Salma M. Hafez & Mohamed Abuarab, 2023. "Prediction of Irrigation Water Requirements for Green Beans-Based Machine Learning Algorithm Models in Arid Region," Water Resources Management: An International Journal, Published for the European Water Resources Association (EWRA), Springer;European Water Resources Association (EWRA), vol. 37(4), pages 1557-1580, March.
    20. Shuaiwei Shi & Meiyi Hou & Zifan Gu & Ce Jiang & Weiqiang Zhang & Mengyang Hou & Chenxi Li & Zenglei Xi, 2022. "Estimation of Heavy Metal Content in Soil Based on Machine Learning Models," Land, MDPI, vol. 11(7), pages 1-19, July.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:waterr:v:34:y:2020:i:1:d:10.1007_s11269-019-02447-w. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.