Author
Listed:
- Ivan Malashin
(Artificial Intelligence Technology Scientific and Education Center, Bauman Moscow State Technical University, 105005 Moscow, Russia)
- Vladimir Nelyub
(Artificial Intelligence Technology Scientific and Education Center, Bauman Moscow State Technical University, 105005 Moscow, Russia
Scientific Department, Far Eastern Federal University, 690922 Vladivostok, Russia)
- Aleksei Borodulin
(Artificial Intelligence Technology Scientific and Education Center, Bauman Moscow State Technical University, 105005 Moscow, Russia)
- Andrei Gantimurov
(Artificial Intelligence Technology Scientific and Education Center, Bauman Moscow State Technical University, 105005 Moscow, Russia)
- Vadim Tynchenko
(Artificial Intelligence Technology Scientific and Education Center, Bauman Moscow State Technical University, 105005 Moscow, Russia)
Abstract
Access to clean water is a fundamental human need, yet millions of people worldwide still lack access to safe drinking water. Traditional water quality assessments, though reliable, are typically time-consuming and resource-intensive. This study investigates the application of machine learning (ML) techniques for analyzing river water quality in the Barnaul area, located on the Ob River in the Altai Krai. The research particularly highlights the use of the Water Quality Index (WQI) as a key factor in feature engineering. WQI, calculated using the Horton model, integrates nine hydrochemical parameters: pH, hardness, solids, chloramines, sulfate, conductivity, organic carbon, trihalomethanes, and turbidity. The primary objective was to demonstrate the contribution of WQI in enhancing predictive performance for water quality analysis. A dataset of 2465 records was analyzed, with missing values for parameters (pH, sulfate, and trihalomethanes) addressed using predictive imputation via neural network (NN) architectures optimized with genetic algorithms (GAs). Models trained without WQI achieved moderate predictive accuracy, but incorporating WQI as a feature dramatically improved performance across all tasks. For the trihalomethanes model, the R 2 score increased from 0.68 (without WQI) to 0.86 (with WQI). Similarly, for pH, the R 2 improved from 0.35 to 0.74, and for sulfate, from 0.27 to 0.69 after including WQI in the feature set.
Suggested Citation
Ivan Malashin & Vladimir Nelyub & Aleksei Borodulin & Andrei Gantimurov & Vadim Tynchenko, 2025.
"Assessment of Water Hydrochemical Parameters Using Machine Learning Tools,"
Sustainability, MDPI, vol. 17(2), pages 1-21, January.
Handle:
RePEc:gam:jsusta:v:17:y:2025:i:2:p:497-:d:1564080
Download full text from publisher
Corrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jsusta:v:17:y:2025:i:2:p:497-:d:1564080. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
We have no bibliographic references for this item. You can help adding them by using this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .
Please note that corrections may take a couple of weeks to filter through
the various RePEc services.