IDEAS home Printed from https://ideas.repec.org/a/gam/jftint/v15y2023i3p88-d1076144.html
   My bibliography  Save this article

Machine Learning for Data Center Optimizations: Feature Selection Using Shapley Additive exPlanation (SHAP)

Author

Listed:
  • Yibrah Gebreyesus

    (School of Computer Science, University College of Dublin, D04 V1W8 Dublin, Ireland)

  • Damian Dalton

    (School of Computer Science, University College of Dublin, D04 V1W8 Dublin, Ireland)

  • Sebastian Nixon

    (School of Computer Science, Wolaita Sodo University, Wolaita P.O. Box 138, Ethiopia)

  • Davide De Chiara

    (ENEA-R.C. Portici, 80055 Portici (NA), Italy)

  • Marta Chinnici

    (ENEA-R.C. Casaccia, 00196 Rome, Italy)

Abstract

The need for artificial intelligence (AI) and machine learning (ML) models to optimize data center (DC) operations increases as the volume of operations management data upsurges tremendously. These strategies can assist operators in better understanding their DC operations and help them make informed decisions upfront to maintain service reliability and availability. The strategies include developing models that optimize energy efficiency, identifying inefficient resource utilization and scheduling policies, and predicting outages. In addition to model hyperparameter tuning, feature subset selection (FSS) is critical for identifying relevant features for effectively modeling DC operations to provide insight into the data, optimize model performance, and reduce computational expenses. Hence, this paper introduces the Shapley Additive exPlanation (SHAP) values method, a class of additive feature attribution values for identifying relevant features that is rarely discussed in the literature. We compared its effectiveness with several commonly used, importance-based feature selection methods. The methods were tested on real DC operations data streams obtained from the ENEA CRESCO6 cluster with 20,832 cores. To demonstrate the effectiveness of SHAP compared to other methods, we selected the top ten most important features from each method, retrained the predictive models, and evaluated their performance using the MAE, RMSE, and MPAE evaluation criteria. The results presented in this paper demonstrate that the predictive models trained using features selected with the SHAP-assisted method performed well, with a lower error and a reasonable execution time compared to other methods.

Suggested Citation

  • Yibrah Gebreyesus & Damian Dalton & Sebastian Nixon & Davide De Chiara & Marta Chinnici, 2023. "Machine Learning for Data Center Optimizations: Feature Selection Using Shapley Additive exPlanation (SHAP)," Future Internet, MDPI, vol. 15(3), pages 1-17, February.
  • Handle: RePEc:gam:jftint:v:15:y:2023:i:3:p:88-:d:1076144
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/1999-5903/15/3/88/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/1999-5903/15/3/88/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Mohamed Sameer Hoosain & Babu Sena Paul & Seeram Ramakrishna, 2020. "The Impact of 4IR Digital Technologies and Circular Thinking on the United Nations Sustainable Development Goals," Sustainability, MDPI, vol. 12(23), pages 1-16, December.
    2. Agnieszka Malkowska & Maria Urbaniec & Malgorzata Kosala, 2021. "The impact of digital transformation on European countries: insights from a comparative analysis," Equilibrium. Quarterly Journal of Economics and Economic Policy, Institute of Economic Research, vol. 16(2), pages 325-355, June.
    3. Zhen Yang & Jinhong Du & Yiting Lin & Zhen Du & Li Xia & Qianchuan Zhao & Xiaohong Guan, 2022. "Increasing the energy efficiency of a data center based on machine learning," Journal of Industrial Ecology, Yale University, vol. 26(1), pages 323-335, February.
    4. Anastasiia Grishina & Marta Chinnici & Ah-Lian Kor & Eric Rondeau & Jean-Philippe Georges, 2020. "A Machine Learning Solution for Data Center Thermal Characteristics Analysis," Energies, MDPI, vol. 13(17), pages 1-13, August.
    5. Anders S. G. Andrae & Tomas Edler, 2015. "On Global Electricity Usage of Communication Technology: Trends to 2030," Challenges, MDPI, vol. 6(1), pages 1-41, April.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Steffen Dalsgaard, 2022. "Can IT Resolve the Climate Crisis? Sketching the Role of an Anthropology of Digital Technology," Sustainability, MDPI, vol. 14(10), pages 1-17, May.
    2. Axenbeck, Janna & Niebel, Thomas, 2021. "Climate Protection Potentials of Digitalized Production Processes: Microeconometric Evidence?," VfS Annual Conference 2021 (Virtual Conference): Climate Economics 242369, Verein für Socialpolitik / German Economic Association.
    3. Damar Novtahaning & Hasnain Ali Shah & Jae-Mo Kang, 2022. "Deep Learning Ensemble-Based Automated and High-Performing Recognition of Coffee Leaf Disease," Agriculture, MDPI, vol. 12(11), pages 1-16, November.
    4. Francisco Olmo-García & Fernando Javier Crecente-Romero & María Teresa Val-Núñez & María Sarabia-Alegría, 2023. "Entrepreneurial activity in an environment of digital transformation: an analysis of relevant factors in the euro area," Palgrave Communications, Palgrave Macmillan, vol. 10(1), pages 1-10, December.
    5. Paraschiv Dorel Mihai & ȚIțan Emilia & Manea Daniela Ioana & Ionescu Crina Dana & Mihai Mihaela & Șerban Octavian, 2022. "The change in e-commerce in the context of the Coronavirus pandemic," Management & Marketing, Sciendo, vol. 17(2), pages 220-233, June.
    6. Babasola Osibo & Simisola Adamo, 2023. "Data Centers and Green Energy: Paving the Way for a Sustainable Digital Future," International Journal of Latest Technology in Engineering, Management & Applied Science, International Journal of Latest Technology in Engineering, Management & Applied Science (IJLTEMAS), vol. 12(11), pages 15-30, November.
    7. Mariusz Czupich & Justyna Łapińska & Vojtěch Bartoš, 2022. "Environmental Sustainability Assessment of the European Union’s Capital Cities," IJERPH, MDPI, vol. 19(7), pages 1-18, April.
    8. Anders S. G. Andrae & Mengjun Xia & Jianli Zhang & Xiaoming Tang, 2016. "Practical Eco-Design and Eco-Innovation of Consumer Electronics—the Case of Mobile Phones," Challenges, MDPI, vol. 7(1), pages 1-19, February.
    9. Muhammad Fahad & Arsalan Shahid & Ravi Reddy Manumachu & Alexey Lastovetsky, 2019. "A Comparative Study of Methods for Measurement of Energy of Computing," Energies, MDPI, vol. 12(11), pages 1-42, June.
    10. Tilman Santarius & Johanna Pohl & Steffen Lange, 2020. "Digitalization and the Decoupling Debate: Can ICT Help to Reduce Environmental Impacts While the Economy Keeps Growing?," Sustainability, MDPI, vol. 12(18), pages 1-20, September.
    11. John Martinovic & Markus Hähnel & Guntram Scheithauer & Waltenegus Dargie, 2022. "An introduction to stochastic bin packing-based server consolidation with conflicts," TOP: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 30(2), pages 296-331, July.
    12. Balcerzak, Adam P. & Zinecker, Marek & Skalický, Roman & Rogalska, Elżbieta & Doubravský, Karel, 2023. "Technology-oriented start-ups and valuation: A novel approach based on specific contract terms," Technological Forecasting and Social Change, Elsevier, vol. 197(C).
    13. Anders S. G. Andrae & Mikko Samuli Vaija, 2017. "Precision of a Streamlined Life Cycle Assessment Approach Used in Eco-Rating of Mobile Phones," Challenges, MDPI, vol. 8(2), pages 1-24, August.
    14. Salil Bharany & Sandeep Sharma & Osamah Ibrahim Khalaf & Ghaida Muttashar Abdulsahib & Abeer S. Al Humaimeedy & Theyazn H. H. Aldhyani & Mashael Maashi & Hasan Alkahtani, 2022. "A Systematic Survey on Energy-Efficient Techniques in Sustainable Cloud Computing," Sustainability, MDPI, vol. 14(10), pages 1-89, May.
    15. Sovacool, Benjamin K. & Martiskainen, Mari & Furszyfer Del Rio, Dylan D., 2021. "Knowledge, energy sustainability, and vulnerability in the demographics of smart home technology diffusion," Energy Policy, Elsevier, vol. 153(C).
    16. Fernando Almeida & José Morais & José Duarte Santos, 2022. "A Bibliometric Analysis of the Scientific Outcomes of European Projects on the Digital Transformation of SMEs," Publications, MDPI, vol. 10(4), pages 1-18, September.
    17. Henryk Dzwigol & Aleksy Kwilinski & Oleksii Lyulyov & Tetyana Pimonenko, 2024. "Digitalization and Energy in Attaining Sustainable Development: Impact on Energy Consumption, Energy Structure, and Energy Intensity," Energies, MDPI, vol. 17(5), pages 1-17, March.
    18. Christiano Coelho & Pedro Gesteira, 2022. "Measuring the effect of health on cross-country income variability," Economics Bulletin, AccessEcon, vol. 42(3), pages 1547-1560.
    19. Kosuke Sasakura & Takeshi Aoki & Masayoshi Komatsu & Takeshi Watanabe, 2020. "A Temperature-Risk and Energy-Saving Evaluation Model for Supporting Energy-Saving Measures for Data Center Server Rooms," Energies, MDPI, vol. 13(19), pages 1-22, October.
    20. Xiaoxi Zhang & Machiko Shinozuka & Yuriko Tanaka & Yuko Kanamori & Toshihiko Masui, 2022. "How ICT can contribute to realize a sustainable society in the future: a CGE approach," Environment, Development and Sustainability: A Multidisciplinary Approach to the Theory and Practice of Sustainable Development, Springer, vol. 24(4), pages 5614-5640, April.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jftint:v:15:y:2023:i:3:p:88-:d:1076144. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.