IDEAS home Printed from https://ideas.repec.org/a/gam/jmathe/v10y2022i14p2486-d864582.html
   My bibliography  Save this article

Data Mining Applied to Decision Support Systems for Power Transformers’ Health Diagnostics

Author

Listed:
  • Alexandra I. Khalyasmaa

    (Ural Power Engineering Institute, Ural Federal University Named after the First President of Russia B.N. Yeltsin, 620002 Ekaterinburg, Russia
    Power Plants Department, Novosibirsk State Technical University, 630073 Novosibirsk, Russia)

  • Pavel V. Matrenin

    (Industrial Power Supply Systems Department, Novosibirsk State Technical University, 630073 Novosibirsk, Russia)

  • Stanislav A. Eroshenko

    (Ural Power Engineering Institute, Ural Federal University Named after the First President of Russia B.N. Yeltsin, 620002 Ekaterinburg, Russia
    Power Plants Department, Novosibirsk State Technical University, 630073 Novosibirsk, Russia)

  • Vadim Z. Manusov

    (Industrial Power Supply Systems Department, Novosibirsk State Technical University, 630073 Novosibirsk, Russia)

  • Andrey M. Bramm

    (Ural Power Engineering Institute, Ural Federal University Named after the First President of Russia B.N. Yeltsin, 620002 Ekaterinburg, Russia)

  • Alexey M. Romanov

    (Institute of Artificial Intelligence, MIREA-Russian Technological University, 119454 Moscow, Russia)

Abstract

This manuscript addresses the problem of technical state assessment of power transformers based on data preprocessing and machine learning. The initial dataset contains diagnostics results of the power transformers, which were collected from a variety of different data sources. It leads to dramatic degradation of the quality of the initial dataset, due to a substantial number of missing values. The problems of such real-life datasets are considered together with the performed efforts to find a balance between data quality and quantity. A data preprocessing method is proposed as a two-iteration data mining technology with simultaneous visualization of objects’ observability in a form of an image of the dataset represented by a data area diagram. The visualization improves the decision-making quality in the course of the data preprocessing procedure. On the dataset collected by the authors, the two-iteration data preprocessing technology increased the dataset filling degree from 75% to 94%, thus the number of gaps that had to be filled in with the synthetic values was reduced by 2.5 times. The processed dataset was used to build machine-learning models for power transformers’ technical state classification. A comparative analysis of different machine learning models was carried out. The outperforming efficiency of ensembles of decision trees was validated for the fleet of high-voltage power equipment taken under consideration. The resulting classification-quality metric, namely, F 1 -score, was estimated to be 83%.

Suggested Citation

  • Alexandra I. Khalyasmaa & Pavel V. Matrenin & Stanislav A. Eroshenko & Vadim Z. Manusov & Andrey M. Bramm & Alexey M. Romanov, 2022. "Data Mining Applied to Decision Support Systems for Power Transformers’ Health Diagnostics," Mathematics, MDPI, vol. 10(14), pages 1-25, July.
  • Handle: RePEc:gam:jmathe:v:10:y:2022:i:14:p:2486-:d:864582
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2227-7390/10/14/2486/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2227-7390/10/14/2486/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Stefan Tenbohlen & Sebastian Coenen & Mohammad Djamali & Andreas Müller & Mohammad Hamed Samimi & Martin Siegel, 2016. "Diagnostic Measurements for Power Transformers," Energies, MDPI, vol. 9(5), pages 1-25, May.
    2. Quinn McNemar, 1947. "Note on the sampling error of the difference between correlated proportions or percentages," Psychometrika, Springer;The Psychometric Society, vol. 12(2), pages 153-157, June.
    3. Hazlee Azil Illias & Wee Zhao Liang, 2018. "Identification of transformer fault based on dissolved gas analysis using hybrid support vector machine-modified evolutionary particle swarm optimisation," PLOS ONE, Public Library of Science, vol. 13(1), pages 1-15, January.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Stanislav A. Eroshenko & Alexander A. Pastushkov & Mikhail P. Romanov & Alexey M. Romanov, 2023. "Choice of Solutions in the Design of Complex Energy Systems Based on the Analysis of Variants with Interval Weights," Mathematics, MDPI, vol. 11(7), pages 1-18, March.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Uttam Bandyopadhyay & Atanu Biswas & Shirsendu Mukherjee, 2009. "Adaptive two-treatment two-period crossover design for binary treatment responses incorporating carry-over effects," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 18(1), pages 13-33, March.
    2. Mehran Tahir & Stefan Tenbohlen, 2019. "A Comprehensive Analysis of Windings Electrical and Mechanical Faults Using a High-Frequency Model," Energies, MDPI, vol. 13(1), pages 1-25, December.
    3. Chacón, José E. & Fernández Serrano, Javier, 2024. "Bayesian taut splines for estimating the number of modes," Computational Statistics & Data Analysis, Elsevier, vol. 196(C).
    4. Lefeng Cheng & Tao Yu & Guoping Wang & Bo Yang & Lv Zhou, 2018. "Hot Spot Temperature and Grey Target Theory-Based Dynamic Modelling for Reliability Assessment of Transformer Oil-Paper Insulation Systems: A Practical Case Study," Energies, MDPI, vol. 11(1), pages 1-26, January.
    5. Jiefeng Liu & Hanbo Zheng & Yiyi Zhang & Hua Wei & Ruijin Liao, 2017. "Grey Relational Analysis for Insulation Condition Assessment of Power Transformers Based Upon Conventional Dielectric Response Measurement," Energies, MDPI, vol. 10(10), pages 1-16, October.
    6. Bester Tawona Mudereri & Elfatih M. Abdel-Rahman & Shepard Ndlela & Louisa Delfin Mutsa Makumbe & Christabel Chiedza Nyanga & Henri E. Z. Tonnang & Samira A. Mohamed, 2022. "Integrating the Strength of Multi-Date Sentinel-1 and -2 Datasets for Detecting Mango ( Mangifera indica L.) Orchards in a Semi-Arid Environment in Zimbabwe," Sustainability, MDPI, vol. 14(10), pages 1-23, May.
    7. Nosi, Costanza & D’Agostino, Antonella & Pratesi, Carlo Alberto & Barbarossa, Camilla, 2021. "Evaluating a social marketing campaign on healthy nutrition and lifestyle among primary-school children: A mixed-method research design," Evaluation and Program Planning, Elsevier, vol. 89(C).
    8. Yulong Wang & Xiaohong Zhang & Lili Li & Jinyang Du & Junguo Gao, 2019. "Design of Partial Discharge Test Environment for Oil-Filled Submarine Cable Terminals and Ultrasonic Monitoring," Energies, MDPI, vol. 12(24), pages 1-14, December.
    9. John E. Core, 2010. "Discussion of Chief Executive Officer Equity Incentives and Accounting Irregularities," Journal of Accounting Research, Wiley Blackwell, vol. 48(2), pages 273-287, May.
    10. Preety Srivastava & Xueyan Zhao, 2010. "What Do the Bingers Drink? Micro‐Unit Evidence on Negative Externalities and Drinker Characteristics of Alcohol Consumption by Beverage Types," Economic Papers, The Economic Society of Australia, vol. 29(2), pages 229-250, June.
    11. Szymon Banaszak & Konstanty Marek Gawrylczyk & Katarzyna Trela, 2020. "Frequency Response Modelling of Transformer Windings Connected in Parallel," Energies, MDPI, vol. 13(6), pages 1-13, March.
    12. Hanousek Jan & Kočenda Evžen & Novotný Jan, 2012. "The identification of price jumps," Monte Carlo Methods and Applications, De Gruyter, vol. 18(1), pages 53-77, January.
    13. Monnery, Benjamin & Wolff, François-Charles & Henneguelle, Anaïs, 2020. "Prison, semi-liberty and recidivism: Bounding causal effects in a survival model," International Review of Law and Economics, Elsevier, vol. 61(C).
    14. Holger Schwender & Margaret A. Taub & Terri H. Beaty & Mary L. Marazita & Ingo Ruczinski, 2012. "Rapid Testing of SNPs and Gene–Environment Interactions in Case–Parent Trio Data Based on Exact Analytic Parameter Estimation," Biometrics, The International Biometric Society, vol. 68(3), pages 766-773, September.
    15. Matysková, Ludmila & Rogers, Brian & Steiner, Jakub & Sun, Keh-Kuan, 2020. "Habits as adaptations: An experimental study," Games and Economic Behavior, Elsevier, vol. 122(C), pages 391-406.
    16. André, Kévin, 2013. "Applying the Capability Approach to the French Education System: An Assessment of the "Pourquoi pas moi ?"," ESSEC Working Papers WP1316, ESSEC Research Center, ESSEC Business School.
    17. repec:hal:journl:hal-00880246 is not listed on IDEAS
    18. Ruiz-Frau, A. & Krause, T. & Marbà, N., 2018. "The use of sociocultural valuation in sustainable environmental management," Ecosystem Services, Elsevier, vol. 29(PA), pages 158-167.
    19. Szymon Banaszak & Wojciech Szoka, 2018. "Cross Test Comparison in Transformer Windings Frequency Response Analysis," Energies, MDPI, vol. 11(6), pages 1-12, May.
    20. repec:cup:judgdm:v:8:y:2013:i:3:p:278-298 is not listed on IDEAS
    21. Shaub, David, 2020. "Fast and accurate yearly time series forecasting with forecast combinations," International Journal of Forecasting, Elsevier, vol. 36(1), pages 116-120.
    22. Fatih Atalar & Aysel Ersoy & Pawel Rozga, 2022. "Investigation of Effects of Different High Voltage Types on Dielectric Strength of Insulating Liquids," Energies, MDPI, vol. 15(21), pages 1-25, October.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jmathe:v:10:y:2022:i:14:p:2486-:d:864582. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.