IDEAS home Printed from https://ideas.repec.org/a/gam/jsusta/v17y2025i2p497-d1564080.html
   My bibliography  Save this article

Assessment of Water Hydrochemical Parameters Using Machine Learning Tools

Author

Listed:
  • Ivan Malashin

    (Artificial Intelligence Technology Scientific and Education Center, Bauman Moscow State Technical University, 105005 Moscow, Russia)

  • Vladimir Nelyub

    (Artificial Intelligence Technology Scientific and Education Center, Bauman Moscow State Technical University, 105005 Moscow, Russia
    Scientific Department, Far Eastern Federal University, 690922 Vladivostok, Russia)

  • Aleksei Borodulin

    (Artificial Intelligence Technology Scientific and Education Center, Bauman Moscow State Technical University, 105005 Moscow, Russia)

  • Andrei Gantimurov

    (Artificial Intelligence Technology Scientific and Education Center, Bauman Moscow State Technical University, 105005 Moscow, Russia)

  • Vadim Tynchenko

    (Artificial Intelligence Technology Scientific and Education Center, Bauman Moscow State Technical University, 105005 Moscow, Russia)

Abstract

Access to clean water is a fundamental human need, yet millions of people worldwide still lack access to safe drinking water. Traditional water quality assessments, though reliable, are typically time-consuming and resource-intensive. This study investigates the application of machine learning (ML) techniques for analyzing river water quality in the Barnaul area, located on the Ob River in the Altai Krai. The research particularly highlights the use of the Water Quality Index (WQI) as a key factor in feature engineering. WQI, calculated using the Horton model, integrates nine hydrochemical parameters: pH, hardness, solids, chloramines, sulfate, conductivity, organic carbon, trihalomethanes, and turbidity. The primary objective was to demonstrate the contribution of WQI in enhancing predictive performance for water quality analysis. A dataset of 2465 records was analyzed, with missing values for parameters (pH, sulfate, and trihalomethanes) addressed using predictive imputation via neural network (NN) architectures optimized with genetic algorithms (GAs). Models trained without WQI achieved moderate predictive accuracy, but incorporating WQI as a feature dramatically improved performance across all tasks. For the trihalomethanes model, the R 2 score increased from 0.68 (without WQI) to 0.86 (with WQI). Similarly, for pH, the R 2 improved from 0.35 to 0.74, and for sulfate, from 0.27 to 0.69 after including WQI in the feature set.

Suggested Citation

  • Ivan Malashin & Vladimir Nelyub & Aleksei Borodulin & Andrei Gantimurov & Vadim Tynchenko, 2025. "Assessment of Water Hydrochemical Parameters Using Machine Learning Tools," Sustainability, MDPI, vol. 17(2), pages 1-21, January.
  • Handle: RePEc:gam:jsusta:v:17:y:2025:i:2:p:497-:d:1564080
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2071-1050/17/2/497/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2071-1050/17/2/497/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Yibrah Gebreyesus & Damian Dalton & Sebastian Nixon & Davide De Chiara & Marta Chinnici, 2023. "Machine Learning for Data Center Optimizations: Feature Selection Using Shapley Additive exPlanation (SHAP)," Future Internet, MDPI, vol. 15(3), pages 1-17, February.
    2. Hua-peng Qin & Qiong Su & Soon-Thiam Khu & Nv Tang, 2014. "Water Quality Changes during Rapid Urbanization in the Shenzhen River Catchment: An Integrated View of Socio-Economic and Infrastructure Development," Sustainability, MDPI, vol. 6(10), pages 1-19, October.
    3. Hui Wu & Yan-Fu Li, 2024. "A multi-sensor fusion-based prognostic model for systems with partially observable failure modes," IISE Transactions, Taylor & Francis Journals, vol. 56(6), pages 624-637, June.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Bing Xue & Mario Tobias, 2015. "Sustainability in China: Bridging Global Knowledge with Local Action," Sustainability, MDPI, vol. 7(4), pages 1-7, March.
    2. Chamara P. Liyanage & Koichi Yamada, 2017. "Impact of Population Growth on the Water Quality of Natural Water Bodies," Sustainability, MDPI, vol. 9(8), pages 1-14, August.
    3. Wei Wang & Hualin Xie & Tong Jiang & Daobei Zhang & Xue Xie, 2016. "Measuring the Total-Factor Carbon Emission Performance of Industrial Land Use in China Based on the Global Directional Distance Function and Non-Radial Luenberger Productivity Index," Sustainability, MDPI, vol. 8(4), pages 1-19, April.
    4. Qiang Sheng & Wang Xu & Long Chen & Lei Wang & Yudong Wang & Yihong Liu & Linshen Xie, 2022. "Effect of Urban River Morphology on the Structure of Macroinvertebrate Communities in a Subtropical Urban River," Sustainability, MDPI, vol. 14(16), pages 1-16, August.
    5. Jovana Brankov & Ana Milanović Pešić & Dragana Milijašević Joksimović & Milan M. Radovanović & Marko D. Petrović, 2020. "Water Quality Estimation and Population’s Attitudes: A Multi-Disciplinary Perspective of Environmental Implications in Tara National Park (Serbia)," Sustainability, MDPI, vol. 13(1), pages 1-18, December.
    6. Zhenhuan Liu & Haiyan Yang, 2018. "The Impacts of Spatiotemporal Landscape Changes on Water Quality in Shenzhen, China," IJERPH, MDPI, vol. 15(5), pages 1-14, May.
    7. Krzysztof Stępniewski & Maksym Łaszewski, 2021. "Spatial and Seasonal Dynamics of Inorganic Nitrogen and Phosphorous Compounds in an Orchard-Dominated Catchment with Anthropogenic Impacts," Sustainability, MDPI, vol. 13(20), pages 1-19, October.
    8. Sheng Zhang & Meng Xu & Yifu Yang & Zeyu Song, 2021. "Technological Innovation, Production Efficiency, and Sustainable Development: A Case Study from Shenzhen in China," Sustainability, MDPI, vol. 13(19), pages 1-12, September.
    9. Lei Liu & Tong Wu & Zhihang Xu & Xiaofeng Pan, 2018. "The Water-Economy Nexus and Sustainable Transition of the Pearl River Delta, China (1999–2015)," Sustainability, MDPI, vol. 10(8), pages 1-15, July.
    10. Jin Guo & Yingzhi Xu & Zhengning Pu, 2016. "Urbanization and Its Effects on Industrial Pollutant Emissions: An Empirical Study of a Chinese Case with the Spatial Panel Model," Sustainability, MDPI, vol. 8(8), pages 1-15, August.
    11. Edson Turibamwe & Rapheal Wangalwa, 2020. "A Comparative Study Of Two Biological Monitoring Systems In Assessing Water Quality: A Case Of River Birira, Sheema District, Uganda," Water Conservation & Management (WCM), Zibeline International Publishing, vol. 4(1), pages 7-14, January.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jsusta:v:17:y:2025:i:2:p:497-:d:1564080. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.