IDEAS home Printed from https://ideas.repec.org/a/spr/waterr/v36y2022i15d10.1007_s11269-022-03341-8.html
   My bibliography  Save this article

Machine Learning-Based Rainfall Forecasting with Multiple Non-Linear Feature Selection Algorithms

Author

Listed:
  • Prabal Das

    (Indian Institute of Technology (Indian School of Mines))

  • D. A. Sachindra

    (Maria Curie-Sklodowska University)

  • Kironmala Chanda

    (Indian Institute of Technology (Indian School of Mines))

Abstract

The present research examined the potential of two important feature selection methods, Bayesian Networks (BN) and Recursive Feature Elimination (RFE), in identifying the optimum predictors for forecasting rainfall in the Godavari Basin in India. Initially, a set of ‘probable hydro-climatological variables’ is chosen based on previous studies. Following a correlation analysis between these probable predictors and monthly rainfall, the most correlated contiguous zones were chosen as ‘potential predictors' which were subsequently used as inputs to the two feature selection algorithms, BN and RFE for selecting the ‘optimum predictors’. The optimum predictions were further utilised to develop seven state-of-the-art Machine Learning (ML) models, including Support Vector Regression (SVR), Gaussian Process Regression (GPR), Multivariate Adaptive Regression Splines (MARS), Random Forest (RF), Parallel Multi-Population Genetic Programming (PMPGP (5 demes (Case 1), 9 demes (Case 2), and 17 demes (Case 3)). The result of the correlation analysis revealed that the domain for Relative Humidity, Total Precipitable Water should be considered over the study area, and for U-Wind, V-Wind and Surface Pressure, the zones around Indian Ocean, Arabian Sea and Persian Gulf should be considered respectively. BN, as a feature selection technique for choosing the optimum predictors, was found to be more effective than RFE. In terms of prediction models, GPR and PMPGP models outperformed others, both when used alone and in conjunction with feature selection methods. The R2 values for GPR models vary from 0.82–0.41, whereas the same varies from 0.81–0.31 for PMPGP models.

Suggested Citation

  • Prabal Das & D. A. Sachindra & Kironmala Chanda, 2022. "Machine Learning-Based Rainfall Forecasting with Multiple Non-Linear Feature Selection Algorithms," Water Resources Management: An International Journal, Published for the European Water Resources Association (EWRA), Springer;European Water Resources Association (EWRA), vol. 36(15), pages 6043-6071, December.
  • Handle: RePEc:spr:waterr:v:36:y:2022:i:15:d:10.1007_s11269-022-03341-8
    DOI: 10.1007/s11269-022-03341-8
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s11269-022-03341-8
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s11269-022-03341-8?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Parisa Noorbeh & Abbas Roozbahani & Hamid Kardan Moghaddam, 2020. "Annual and Monthly Dam Inflow Prediction Using Bayesian Networks," Water Resources Management: An International Journal, Published for the European Water Resources Association (EWRA), Springer;European Water Resources Association (EWRA), vol. 34(9), pages 2933-2951, July.
    2. Jew Das & Nanduri V. Umamahesh, 2016. "Downscaling Monsoon Rainfall over River Godavari Basin under Different Climate-Change Scenarios," Water Resources Management: An International Journal, Published for the European Water Resources Association (EWRA), Springer;European Water Resources Association (EWRA), vol. 30(15), pages 5575-5587, December.
    3. Kuhn, Max, 2008. "Building Predictive Models in R Using the caret Package," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 28(i05).
    4. Sou-Sen Leu & Quang-Nha Bui, 2016. "Leak Prediction Model for Water Distribution Networks Created Using a Bayesian Network Learning Approach," Water Resources Management: An International Journal, Published for the European Water Resources Association (EWRA), Springer;European Water Resources Association (EWRA), vol. 30(8), pages 2719-2733, June.
    5. R. Venkata Ramana & B. Krishna & S. Kumar & N. Pandey, 2013. "Monthly Rainfall Prediction Using Wavelet Neural Network Analysis," Water Resources Management: An International Journal, Published for the European Water Resources Association (EWRA), Springer;European Water Resources Association (EWRA), vol. 27(10), pages 3697-3711, August.
    6. Ming Wei & Xue-yi You, 2022. "Monthly rainfall forecasting by a hybrid neural network of discrete wavelet transformation and deep learning," Water Resources Management: An International Journal, Published for the European Water Resources Association (EWRA), Springer;European Water Resources Association (EWRA), vol. 36(11), pages 4003-4018, September.
    7. Alessandro Pagano & Raffaele Giordano & Ivan Portoghese, 2022. "A Pipe Ranking Method for Water Distribution Network Resilience Assessment Based on Graph-Theory Metrics Aggregated Through Bayesian Belief Networks," Water Resources Management: An International Journal, Published for the European Water Resources Association (EWRA), Springer;European Water Resources Association (EWRA), vol. 36(13), pages 5091-5106, October.
    8. Scutari, Marco, 2010. "Learning Bayesian Networks with the bnlearn R Package," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 35(i03).
    9. Riya Dutta & Rajib Maity & Parul Patel, 2022. "Short and Medium Range Forecast of Soil Moisture for the Different Climatic Regions of India Using Temporal Networks," Water Resources Management: An International Journal, Published for the European Water Resources Association (EWRA), Springer;European Water Resources Association (EWRA), vol. 36(1), pages 235-251, January.
    10. Radhikesh Kumar & Maheshwari Prasad Singh & Bishwajit Roy & Afzal Hussain Shahid, 2021. "A Comparative Assessment of Metaheuristic Optimized Extreme Learning Machine and Deep Neural Network in Multi-Step-Ahead Long-term Rainfall Prediction for All-Indian Regions," Water Resources Management: An International Journal, Published for the European Water Resources Association (EWRA), Springer;European Water Resources Association (EWRA), vol. 35(6), pages 1927-1960, April.
    11. Scutari, Marco, 2017. "Bayesian Network Constraint-Based Structure Learning Algorithms: Parallel and Optimized Implementations in the bnlearn R Package," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 77(i02).
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Mahdie Afshari Nia & Fatemeh Panahi & Mohammad Ehteram, 2023. "Convolutional Neural Network- ANN- E (Tanh): A New Deep Learning Model for Predicting Rainfall," Water Resources Management: An International Journal, Published for the European Water Resources Association (EWRA), Springer;European Water Resources Association (EWRA), vol. 37(4), pages 1785-1810, March.
    2. Mohammad Ehteram & Ali Najah Ahmed & Zohreh Sheikh Khozani & Ahmed El-Shafie, 2023. "Convolutional Neural Network -Support Vector Machine Model-Gaussian Process Regression: A New Machine Model for Predicting Monthly and Daily Rainfall," Water Resources Management: An International Journal, Published for the European Water Resources Association (EWRA), Springer;European Water Resources Association (EWRA), vol. 37(9), pages 3631-3655, July.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Tang, Kayu & Parsons, David J. & Jude, Simon, 2019. "Comparison of automatic and guided learning for Bayesian networks to analyse pipe failures in the water distribution system," Reliability Engineering and System Safety, Elsevier, vol. 186(C), pages 24-36.
    2. Sangsung Park & Sunghae Jun, 2020. "Patent Keyword Analysis of Disaster Artificial Intelligence Using Bayesian Network Modeling and Factor Analysis," Sustainability, MDPI, vol. 12(2), pages 1-11, January.
    3. Mahrouz Nourali, 2023. "Improved Treatment of Model Prediction Uncertainty: Estimating Rainfall using Discrete Wavelet Transform and Principal Component Analysis," Water Resources Management: An International Journal, Published for the European Water Resources Association (EWRA), Springer;European Water Resources Association (EWRA), vol. 37(11), pages 4211-4231, September.
    4. Guo-Yu Huang & Chi-Ju Lai & Ping-Feng Pai, 2022. "Forecasting Hourly Intermittent Rainfall by Deep Belief Networks with Simple Exponential Smoothing," Water Resources Management: An International Journal, Published for the European Water Resources Association (EWRA), Springer;European Water Resources Association (EWRA), vol. 36(13), pages 5207-5223, October.
    5. Babak Fazelabdolabadi, 2019. "Uncertainty and energy-sector equity returns in Iran: a Bayesian and quasi-Monte Carlo time-varying analysis," Financial Innovation, Springer;Southwestern University of Finance and Economics, vol. 5(1), pages 1-20, December.
    6. Pedro Bonilla-Nadal & Andrés Cano & Manuel Gómez-Olmedo & Serafín Moral & Ofelia Paula Retamero, 2022. "Using Value-Based Potentials for Making Approximate Inference on Probabilistic Graphical Models," Mathematics, MDPI, vol. 10(14), pages 1-27, July.
    7. Mahdie Afshari Nia & Fatemeh Panahi & Mohammad Ehteram, 2023. "Convolutional Neural Network- ANN- E (Tanh): A New Deep Learning Model for Predicting Rainfall," Water Resources Management: An International Journal, Published for the European Water Resources Association (EWRA), Springer;European Water Resources Association (EWRA), vol. 37(4), pages 1785-1810, March.
    8. Wang, Yuhong & Zhang, Fan & Yang, Zhisen & Yang, Zaili, 2021. "Incorporation of deficiency data into the analysis of the dependency and interdependency among the risk factors influencing port state control inspection," Reliability Engineering and System Safety, Elsevier, vol. 206(C).
    9. Babak Fazelabdolabadi, 2019. "A hybrid Bayesian-network proposition for forecasting the crude oil price," Financial Innovation, Springer;Southwestern University of Finance and Economics, vol. 5(1), pages 1-21, December.
    10. Jinping Zhang & Yong Zhao & Weihua Xiao, 2015. "Multi-Resolution Cointegration Prediction for Runoff and Sediment Load," Water Resources Management: An International Journal, Published for the European Water Resources Association (EWRA), Springer;European Water Resources Association (EWRA), vol. 29(10), pages 3601-3613, August.
    11. Jie Zhao & Ji Chen & Damien Beillouin & Hans Lambers & Yadong Yang & Pete Smith & Zhaohai Zeng & Jørgen E. Olesen & Huadong Zang, 2022. "Global systematic review with meta-analysis reveals yield advantage of legume-based rotations and its drivers," Nature Communications, Nature, vol. 13(1), pages 1-9, December.
    12. Piaopiao Chen & Agnès H. Michel & Jianzhi Zhang, 2022. "Transposon insertional mutagenesis of diverse yeast strains suggests coordinated gene essentiality polymorphisms," Nature Communications, Nature, vol. 13(1), pages 1-15, December.
    13. Paulo Infante & Gonçalo Jacinto & Anabela Afonso & Leonor Rego & Pedro Nogueira & Marcelo Silva & Vitor Nogueira & José Saias & Paulo Quaresma & Daniel Santos & Patrícia Góis & Paulo Rebelo Manuel, 2023. "Factors That Influence the Type of Road Traffic Accidents: A Case Study in a District of Portugal," Sustainability, MDPI, vol. 15(3), pages 1-16, January.
    14. Vuong, Quan-Hoang & La, Viet-Phuong, 2019. "The bayesvl R package. User guide v0.8.1," OSF Preprints w5dx6, Center for Open Science.
    15. Ephrem Habyarimana & Faheem S Baloch, 2021. "Machine learning models based on remote and proximal sensing as potential methods for in-season biomass yields prediction in commercial sorghum fields," PLOS ONE, Public Library of Science, vol. 16(3), pages 1-23, March.
    16. Banks, Jonathan & Rabbani, Arif & Nadkarni, Kabir & Renaud, Evan, 2020. "Estimating parasitic loads related to brine production from a hot sedimentary aquifer geothermal project: A case study from the Clarke Lake gas field, British Columbia," Renewable Energy, Elsevier, vol. 153(C), pages 539-552.
    17. F. Cugnata & G. Perucca & S. Salini, 2017. "Bayesian networks and the assessment of universities' value added," Journal of Applied Statistics, Taylor & Francis Journals, vol. 44(10), pages 1785-1806, July.
    18. Jew Das & Alin Treesa & N. V. Umamahesh, 2018. "Modelling Impacts of Climate Change on a River Basin: Analysis of Uncertainty Using REA & Possibilistic Approach," Water Resources Management: An International Journal, Published for the European Water Resources Association (EWRA), Springer;European Water Resources Association (EWRA), vol. 32(15), pages 4833-4852, December.
    19. Crespo, Cristian, 2020. "Two become one: improving the targeting of conditional cash transfers with a predictive model of school dropout," LSE Research Online Documents on Economics 123139, London School of Economics and Political Science, LSE Library.
    20. Roland R. Ramsahai, 2020. "Connecting actuarial judgment to probabilistic learning techniques with graph theory," Papers 2007.15475, arXiv.org.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:waterr:v:36:y:2022:i:15:d:10.1007_s11269-022-03341-8. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.