IDEAS home Printed from https://ideas.repec.org/a/gam/jftint/v15y2023i5p174-d1139665.html
   My bibliography  Save this article

Predicting Football Team Performance with Explainable AI: Leveraging SHAP to Identify Key Team-Level Performance Metrics

Author

Listed:
  • Serafeim Moustakidis

    (AIDEAS OÜ, 10117 Tallinn, Estonia)

  • Spyridon Plakias

    (Department of Physical Education and Sport Science, University of Thessaly, 42100 Trikala, Greece)

  • Christos Kokkotis

    (Department of Physical Education and Sport Science, Democritus University of Thrace, 69100 Komotini, Greece)

  • Themistoklis Tsatalas

    (Department of Physical Education and Sport Science, University of Thessaly, 42100 Trikala, Greece)

  • Dimitrios Tsaopoulos

    (Institute for Bio-Economy and Agri-Technology, Center for Research and Technology Hellas, 38333 Volos, Greece)

Abstract

Understanding the performance indicators that contribute to the final score of a football match is crucial for directing the training process towards specific goals. This paper presents a pipeline for identifying key team-level performance variables in football using explainable ML techniques. The input data includes various team-specific features such as ball possession and pass behaviors, with the target output being the average scoring performance of each team over a season. The pipeline includes data preprocessing, sequential forward feature selection, model training, prediction, and explainability using SHapley Additive exPlanations (SHAP). Results show that 14 variables have the greatest contribution to the outcome of a match, with 12 having a positive effect and 2 having a negative effect. The study also identified the importance of certain performance indicators, such as shots, chances, passing, and ball possession, to the final score. This pipeline provides valuable insights for coaches and sports analysts to understand which aspects of a team’s performance need improvement and enable targeted interventions to improve performance. The use of explainable ML techniques allows for a deeper understanding of the factors contributing to the predicted average team score performance.

Suggested Citation

  • Serafeim Moustakidis & Spyridon Plakias & Christos Kokkotis & Themistoklis Tsatalas & Dimitrios Tsaopoulos, 2023. "Predicting Football Team Performance with Explainable AI: Leveraging SHAP to Identify Key Team-Level Performance Metrics," Future Internet, MDPI, vol. 15(5), pages 1-18, May.
  • Handle: RePEc:gam:jftint:v:15:y:2023:i:5:p:174-:d:1139665
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/1999-5903/15/5/174/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/1999-5903/15/5/174/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Stan Lipovetsky & Michael Conklin, 2001. "Analysis of regression in game theory approach," Applied Stochastic Models in Business and Industry, John Wiley & Sons, vol. 17(4), pages 319-330, October.
    2. Changjing Zhou & Shaoliang Zhang & Alberto Lorenzo Calvo & Yixiong Cui, 2018. "Chinese soccer association super league, 2012–2017: key performance indicators in balance games," International Journal of Performance Analysis in Sport, Taylor & Francis Journals, vol. 18(4), pages 645-656, July.
    3. Miguel-Ángel Gómez & Michalis Mitrotasios & Vasilis Armatas & Carlos Lago-Peñas, 2018. "Analysis of playing styles according to team quality and match location in Greek professional soccer," International Journal of Performance Analysis in Sport, Taylor & Francis Journals, vol. 18(6), pages 986-997, November.
    4. Carlos Lago-Peñas & Miguel Gómez-Ruano & Gai Yang, 2017. "Styles of play in professional soccer: an approach of the Chinese Soccer Super League," International Journal of Performance Analysis in Sport, Taylor & Francis Journals, vol. 17(6), pages 1073-1084, November.
    5. Gunal Bilek & Efehan Ulas, 2019. "Predicting match outcome according to the quality of opponent in the English premier league using situational variables and team performance indicators," International Journal of Performance Analysis in Sport, Taylor & Francis Journals, vol. 19(6), pages 930-941, November.
    6. Ali Cakmak & Ali Uzun & Emrullah Delibas, 2018. "Computational Modeling Of Pass Effectiveness In Soccer," Advances in Complex Systems (ACS), World Scientific Publishing Co. Pte. Ltd., vol. 21(03n04), pages 1-28, May.
    7. Kerys Harrop & Alan Nevill, 2014. "Performance indicators that predict success in an English professional League One soccer team," International Journal of Performance Analysis in Sport, Taylor & Francis Journals, vol. 14(3), pages 907-920, December.
    8. Luca Pappalardo & Paolo Cintia, 2018. "Quantifying The Relation Between Performance And Success In Soccer," Advances in Complex Systems (ACS), World Scientific Publishing Co. Pte. Ltd., vol. 21(03n04), pages 1-30, May.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Li, Yuesen & Ma, Runqing & Gonçalves, Bruno & Gong, Bingnan & Cui, Yixiong & Shen, Yanfei, 2020. "Data-driven team ranking and match performance analysis in Chinese Football Super League," Chaos, Solitons & Fractals, Elsevier, vol. 141(C).
    2. Gong, Bingnan & Zhou, Changjing & Gómez, Miguel-Ángel & Buldú, J.M., 2023. "Identifiability of Chinese football teams: A complex networks approach," Chaos, Solitons & Fractals, Elsevier, vol. 166(C).
    3. Alejandro Sabarit & Rafael E. Reigal & Juan P. Morillo-Baro & Rocío Juárez-Ruiz de Mier & Auxiliadora Franquelo & Antonio Hernández-Mendo & Coral Falcó & Verónica Morales-Sánchez, 2020. "Cognitive Functioning, Physical Fitness, and Game Performance in a Sample of Adolescent Soccer Players," Sustainability, MDPI, vol. 12(13), pages 1-12, June.
    4. Claudio A. Casal & José L. Losada & Daniel Barreira & Rubén Maneiro, 2021. "Multivariate Exploratory Comparative Analysis of LaLiga Teams: Principal Component Analysis," IJERPH, MDPI, vol. 18(6), pages 1-18, March.
    5. Laura M S de Jong & Paul B Gastin & Maia Angelova & Lyndell Bruce & Dan B Dwyer, 2020. "Technical determinants of success in professional women’s soccer: A wider range of variables reveals new insights," PLOS ONE, Public Library of Science, vol. 15(10), pages 1-12, October.
    6. Julen Castellano & Miguel Pic, 2019. "Identification and Preference of Game Styles in LaLiga Associated with Match Outcomes," IJERPH, MDPI, vol. 16(24), pages 1-13, December.
    7. Galli, L. & Galvan, G. & Levato, T. & Liti, C. & Piccialli, V. & Sciandrone, M., 2021. "Football: Discovering elapsing-time bias in the science of success," Chaos, Solitons & Fractals, Elsevier, vol. 152(C).
    8. Pera, Rebecca & Viglia, Giampaolo & Furlan, Roberto, 2016. "Who Am I? How Compelling Self-storytelling Builds Digital Personal Reputation," Journal of Interactive Marketing, Elsevier, vol. 35(C), pages 44-55.
    9. Stan Lipovetsky, 2021. "Predictor Analysis in Group Decision Making," Stats, MDPI, vol. 4(1), pages 1-14, February.
    10. Hugh Chen & Scott M. Lundberg & Su-In Lee, 2022. "Explaining a series of models by propagating Shapley values," Nature Communications, Nature, vol. 13(1), pages 1-15, December.
    11. Emrah Arbak, 2017. "Identifying the provisioning policies of Belgian banks," Working Paper Research 326, National Bank of Belgium.
    12. Viglia, Giampaolo & Abrate, Graziano, 2017. "When distinction does not pay off - Investigating the determinants of European agritourism prices," Journal of Business Research, Elsevier, vol. 80(C), pages 45-52.
    13. Xingwei Hu, 2020. "A theory of dichotomous valuation with applications to variable selection," Econometric Reviews, Taylor & Francis Journals, vol. 39(10), pages 1075-1099, November.
    14. Dmitry Sharapov & Paul Kattuman & Diego Rodriguez & F. Javier Velazquez, 2021. "Using the SHAPLEY value approach to variance decomposition in strategy research: Diversification, internationalization, and corporate group effects on affiliate profitability," Strategic Management Journal, Wiley Blackwell, vol. 42(3), pages 608-623, March.
    15. Xingwei Hu, 2018. "A Theory of Dichotomous Valuation with Applications to Variable Selection," Papers 1808.00131, arXiv.org, revised Mar 2020.
    16. Filotto, Umberto & Caratelli, Massimo & Fornezza, Fabrizio, 2021. "Shaping the digital transformation of the retail banking industry. Empirical evidence from Italy," European Management Journal, Elsevier, vol. 39(3), pages 366-375.
    17. Nimai Parmar & Nic James & Mike Hughes & Huw Jones & Gary Hearne, 2017. "Team performance indicators that predict match outcome and points difference in professional rugby league," International Journal of Performance Analysis in Sport, Taylor & Francis Journals, vol. 17(6), pages 1044-1056, November.
    18. Elena Pokryshevskaya & Evgeny Antipov, 2013. "Importance-performance analysis for internet stores: a system based on publicly available panel data," HSE Working papers WP BRP 08/MAN/2013, National Research University Higher School of Economics.
    19. Pelin Ayranci & Phung Lai & Nhathai Phan & Han Hu & Alexander Kolinowski & David Newman & Deijing Dou, 2022. "OnML: an ontology-based approach for interpretable machine learning," Journal of Combinatorial Optimization, Springer, vol. 44(1), pages 770-793, August.
    20. Changjing Zhou & William G. Hopkins & Wanli Mao & Alberto L. Calvo & Hongyou Liu, 2019. "Match Performance of Soccer Teams in the Chinese Super League—Effects of Situational and Environmental Factors," IJERPH, MDPI, vol. 16(21), pages 1-13, November.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jftint:v:15:y:2023:i:5:p:174-:d:1139665. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.