IDEAS home Printed from https://ideas.repec.org/a/gam/jijerp/v19y2022i20p13702-d949916.html
   My bibliography  Save this article

Improving Water Quality Index Prediction Using Regression Learning Models

Author

Listed:
  • Jesmeen Mohd Zebaral Hoque

    (Faculty of Engineering & Technology, Multimedia University, Melaka 75450, Malaysia)

  • Nor Azlina Ab. Aziz

    (Faculty of Engineering & Technology, Multimedia University, Melaka 75450, Malaysia)

  • Salem Alelyani

    (Center for Artificial Intelligence (CAI), King Khalid University, Abha 61421, Saudi Arabia
    College of Computer Science, King Khalid University, Abha 61421, Saudi Arabia)

  • Mohamed Mohana

    (Center for Artificial Intelligence (CAI), King Khalid University, Abha 61421, Saudi Arabia)

  • Maruf Hosain

    (Faculty of Engineering & Technology, Multimedia University, Melaka 75450, Malaysia)

Abstract

Rivers are the main sources of freshwater supply for the world population. However, many economic activities contribute to river water pollution. River water quality can be monitored using various parameters, such as the pH level, dissolved oxygen, total suspended solids, and the chemical properties. Analyzing the trend and pattern of these parameters enables the prediction of the water quality so that proactive measures can be made by relevant authorities to prevent water pollution and predict the effectiveness of water restoration measures. Machine learning regression algorithms can be applied for this purpose. Here, eight machine learning regression techniques, including decision tree regression, linear regression, ridge, Lasso, support vector regression, random forest regression, extra tree regression, and the artificial neural network, are applied for the purpose of water quality index prediction. Historical data from Indian rivers are adopted for this study. The data refer to six water parameters. Twelve other features are then derived from the original six parameters. The performances of the models using different algorithms and sets of features are compared. The derived water quality rating scale features are identified to contribute toward the development of better regression models, while the linear regression and ridge offer the best performance. The best mean square error achieved is 0 and the correlation coefficient is 1.

Suggested Citation

  • Jesmeen Mohd Zebaral Hoque & Nor Azlina Ab. Aziz & Salem Alelyani & Mohamed Mohana & Maruf Hosain, 2022. "Improving Water Quality Index Prediction Using Regression Learning Models," IJERPH, MDPI, vol. 19(20), pages 1-23, October.
  • Handle: RePEc:gam:jijerp:v:19:y:2022:i:20:p:13702-:d:949916
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/1660-4601/19/20/13702/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/1660-4601/19/20/13702/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Koketso J. Setshedi & Nhamo Mutingwende & Nosiphiwe P. Ngqwala, 2021. "The Use of Artificial Neural Networks to Predict the Physicochemical Characteristics of Water Quality in Three District Municipalities, Eastern Cape Province, South Africa," IJERPH, MDPI, vol. 18(10), pages 1-17, May.
    2. Huan Wu & Shuiping Cheng & Kunlun Xin & Nian Ma & Jie Chen & Liang Tao & Min Gao, 2022. "Water Quality Prediction Based on Multi-Task Learning," IJERPH, MDPI, vol. 19(15), pages 1-19, August.
    3. Basilua Andre Muzembo & Kei Kitahara & Anusuya Debnath & Ayumu Ohno & Keinosuke Okamoto & Shin-Ichi Miyoshi, 2022. "Cholera Outbreaks in India, 2011–2020: A Systematic Review," IJERPH, MDPI, vol. 19(9), pages 1-27, May.
    4. Bijoyee Sarker & Kamrun N. Keya & Fatin I. Mahir & Khandakar M. Nahiun & Shahirin Shahida & Ruhul A. Khan, 2021. "Surface and Ground Water Pollution: Causes and Effects of Urbanization and Industrialization in South Asia," Scientific Review, Academic Research Publishing Group, vol. 7(3), pages 32-41, 07-2021.
    5. Manickavasagar Kayanan & Pushpakanthie Wijekoon, 2020. "Stochastic Restricted LASSO-Type Estimator in the Linear Regression Model," Journal of Probability and Statistics, Hindawi, vol. 2020, pages 1-7, March.
    6. Monika Kulisz & Justyna Kujawska & Bartosz Przysucha & Wojciech Cel, 2021. "Forecasting Water Quality Index in Groundwater Using Artificial Neural Network," Energies, MDPI, vol. 14(18), pages 1-17, September.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Regmi, Rupesh & Zhang, Zhuo & Zhang, Hongpeng, 2023. "Entrepreneurship strategy, natural resources management and sustainable performance: A study of an emerging market," Resources Policy, Elsevier, vol. 86(PB).
    2. Rocio Camarena-Martinez & Rocio A. Lizarraga-Morales & Roberto Baeza-Serrato, 2021. "Classification of Geomembranes as Raw Material for Defects Reduction in the Manufacture of Biodigesters Using an Artificial Neuronal Network," Energies, MDPI, vol. 14(21), pages 1-13, November.
    3. George Halkos, 2023. "Economic Analysis and Policies for the Environment, Natural Resources, and Energy," Energies, MDPI, vol. 16(18), pages 1-6, September.
    4. James Ming Chen & Predrag Bejaković & Nika Šimurina, 2024. "Tax and Policy Drivers of Personal Overindebtedness in the European Union," International Advances in Economic Research, Springer;International Atlantic Economic Society, vol. 30(2), pages 115-133, May.
    5. Yuliia Trach & Roman Trach & Marek Kalenik & Eugeniusz Koda & Anna Podlasek, 2021. "A Study of Dispersed, Thermally Activated Limestone from Ukraine for the Safe Liming of Water Using ANN Models," Energies, MDPI, vol. 14(24), pages 1-14, December.
    6. Shimza Bint Aslam & Nimra Amar, 2023. "Household Water Conservation in Pakistan: An Examination of Strategies, Behaviors and Determinants," Journal of Economic Impact, Science Impact Publishers, vol. 5(3), pages 181-189.
    7. Justyna Kujawska & Monika Kulisz & Piotr Oleszczuk & Wojciech Cel, 2023. "Improved Prediction of the Higher Heating Value of Biomass Using an Artificial Neural Network Model Based on the Selection of Input Parameters," Energies, MDPI, vol. 16(10), pages 1-16, May.
    8. Roman Trach & Yuliia Trach & Agnieszka Kiersnowska & Anna Markiewicz & Marzena Lendo-Siwicka & Konstantin Rusakov, 2022. "A Study of Assessment and Prediction of Water Quality Index Using Fuzzy Logic and ANN Models," Sustainability, MDPI, vol. 14(9), pages 1-19, May.
    9. Muhammad Ishfaque & Qianwei Dai & Nuhman ul Haq & Khanzaib Jadoon & Syed Muzyan Shahzad & Hammad Tariq Janjuhah, 2022. "Use of Recurrent Neural Network with Long Short-Term Memory for Seepage Prediction at Tarbela Dam, KP, Pakistan," Energies, MDPI, vol. 15(9), pages 1-16, April.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jijerp:v:19:y:2022:i:20:p:13702-:d:949916. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.