IDEAS home Printed from https://ideas.repec.org/a/eee/intfor/v35y2019i2p797-809.html
   My bibliography  Save this article

Forecasting football match results in national league competitions using score-driven time series models

Author

Listed:
  • Koopman, Siem Jan
  • Lit, Rutger

Abstract

We develop a new dynamic multivariate model for the analysis and forecasting of football match results in national league competitions. The proposed dynamic model is based on the score of the predictive observation mass function for a high-dimensional panel of weekly match results. Our main interest is in forecasting whether the match result is a win, a loss or a draw for each team. The dynamic model for delivering such forecasts can be based on three different dependent variables: the pairwise count of the number of goals, the difference between the numbers of goals, or the category of the match result (win, loss, draw). The different dependent variables require different distributional assumptions. Furthermore, different dynamic model specifications can be considered for generating the forecasts. We investigate empirically which dependent variable and which dynamic model specification yield the best forecasting results. We validate the precision of the resulting forecasts and the success of the forecasts in a betting simulation in an extensive forecasting study for match results from six large European football competitions. Finally, we conclude that the dynamic model for pairwise counts delivers the most precise forecasts while the dynamic model for the difference between counts is most successful for betting, but that both outperform benchmark and other competing models.

Suggested Citation

  • Koopman, Siem Jan & Lit, Rutger, 2019. "Forecasting football match results in national league competitions using score-driven time series models," International Journal of Forecasting, Elsevier, vol. 35(2), pages 797-809.
  • Handle: RePEc:eee:intfor:v:35:y:2019:i:2:p:797-809
    DOI: 10.1016/j.ijforecast.2018.10.011
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0169207018302048
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.ijforecast.2018.10.011?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to look for a different version below or search for a different version of it.

    Other versions of this item:

    References listed on IDEAS

    as
    1. F. Blasques & S. J. Koopman & A. Lucas, 2015. "Information-theoretic optimality of observation-driven time series models for continuous responses," Biometrika, Biometrika Trust, vol. 102(2), pages 325-343.
    2. Gourieroux, Christian & Monfort, Alain & Trognon, Alain, 1984. "Pseudo Maximum Likelihood Methods: Applications to Poisson Models," Econometrica, Econometric Society, vol. 52(3), pages 701-720, May.
    3. Ioannis Asimakopoulos & John Goddard, 2004. "Forecasting football results and the efficiency of fixed-odds betting," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 23(1), pages 51-66.
    4. Siem Jan Koopman & André Lucas & Marcel Scharth, 2016. "Predicting Time-Varying Parameters with Parameter-Driven and Observation-Driven Models," The Review of Economics and Statistics, MIT Press, vol. 98(1), pages 97-110, March.
    5. Siem Jan Koopman & Rutger Lit & André Lucas, 2017. "Intraday Stochastic Volatility in Discrete Price Changes: The Dynamic Skellam Model," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 112(520), pages 1490-1503, October.
    6. Diebold, Francis X & Mariano, Roberto S, 2002. "Comparing Predictive Accuracy," Journal of Business & Economic Statistics, American Statistical Association, vol. 20(1), pages 134-144, January.
    7. Giovanni Angelini & Luca De Angelis, 2017. "PARX model for football match predictions," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 36(7), pages 795-807, November.
    8. M. J. Maher, 1982. "Modelling association football scores," Statistica Neerlandica, Netherlands Society for Statistics and Operations Research, vol. 36(3), pages 109-118, September.
    9. Manuela Cattelan & Cristiano Varin & David Firth, 2013. "Dynamic Bradley–Terry modelling of sports tournaments," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 62(1), pages 135-150, January.
    10. Baboota, Rahul & Kaur, Harleen, 2019. "Predictive analysis and modelling football results using machine learning approach for English Premier League," International Journal of Forecasting, Elsevier, vol. 35(2), pages 741-755.
    11. Boshnakov, Georgi & Kharrat, Tarak & McHale, Ian G., 2017. "A bivariate Weibull count model for forecasting association football scores," International Journal of Forecasting, Elsevier, vol. 33(2), pages 458-466.
    12. Felix Famoye, 2010. "On the bivariate negative binomial regression model," Journal of Applied Statistics, Taylor & Francis Journals, vol. 37(6), pages 969-981.
    13. Drew Creal & Siem Jan Koopman & André Lucas, 2013. "Generalized Autoregressive Score Models With Applications," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 28(5), pages 777-795, August.
    14. Constantinou Anthony Costa & Fenton Norman Elliott, 2012. "Solving the Problem of Inadequate Scoring Rules for Assessing Probabilistic Football Forecast Models," Journal of Quantitative Analysis in Sports, De Gruyter, vol. 8(1), pages 1-14, March.
    15. Goddard, John, 2005. "Regression models for forecasting goals and match results in association football," International Journal of Forecasting, Elsevier, vol. 21(2), pages 331-340.
    16. Hvattum, Lars Magnus & Arntzen, Halvard, 2010. "Using ELO ratings for match result prediction in association football," International Journal of Forecasting, Elsevier, vol. 26(3), pages 460-470, July.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Wheatcroft Edward, 2021. "Evaluating probabilistic forecasts of football matches: the case against the ranked probability score," Journal of Quantitative Analysis in Sports, De Gruyter, vol. 17(4), pages 273-287, December.
    2. Wheatcroft, Edward, 2021. "Evaluating probabilistic forecasts of football matches: the case against the ranked probability score," LSE Research Online Documents on Economics 111494, London School of Economics and Political Science, LSE Library.
    3. Harvey, A., 2021. "Score-driven time series models," Cambridge Working Papers in Economics 2133, Faculty of Economics, University of Cambridge.
    4. Francisco Blasques & Vladim'ir Hol'y & Petra Tomanov'a, 2018. "Zero-Inflated Autoregressive Conditional Duration Model for Discrete Trade Durations with Excessive Zeros," Papers 1812.07318, arXiv.org, revised May 2024.
    5. Vladimír Holý & Jan Zouhar, 2022. "Modelling time‐varying rankings with autoregressive and score‐driven dynamics," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 71(5), pages 1427-1450, November.
    6. Raffaele Mattera, 2023. "Forecasting binary outcomes in soccer," Annals of Operations Research, Springer, vol. 325(1), pages 115-134, June.
    7. da Costa, Igor Barbosa & Marinho, Leandro Balby & Pires, Carlos Eduardo Santos, 2022. "Forecasting football results and exploiting betting markets: The case of “both teams to score”," International Journal of Forecasting, Elsevier, vol. 38(3), pages 895-909.
    8. Butler, David & Butler, Robert & Eakins, John, 2021. "Expert performance and crowd wisdom: Evidence from English Premier League predictions," European Journal of Operational Research, Elsevier, vol. 288(1), pages 170-182.
    9. Marc Garnica-Caparrós & Daniel Memmert & Fabian Wunderlich, 2022. "Artificial data in sports forecasting: a simulation framework for analysing predictive models in sports," Information Systems and e-Business Management, Springer, vol. 20(3), pages 551-580, September.
    10. Wunderlich, Fabian & Memmert, Daniel, 2020. "Are betting returns a useful measure of accuracy in (sports) forecasting?," International Journal of Forecasting, Elsevier, vol. 36(2), pages 713-722.
    11. Vladim'ir Hol'y, 2022. "An Intraday GARCH Model for Discrete Price Changes and Irregularly Spaced Observations," Papers 2211.12376, arXiv.org, revised May 2024.
    12. Lasek, Jan & Gagolewski, Marek, 2021. "Interpretable sports team rating models based on the gradient descent algorithm," International Journal of Forecasting, Elsevier, vol. 37(3), pages 1061-1071.
    13. Robert C. Smit & Francesco Ravazzolo & Luca Rossini, 2020. "Dynamic Bayesian forecasting of English Premier League match results with the Skellam distribution," BEMPS - Bozen Economics & Management Paper Series BEMPS72, Faculty of Economics and Management at the Free University of Bozen.
    14. P. Gorgi & S. J. Koopman & R. Lit, 2023. "Estimation of final standings in football competitions with a premature ending: the case of COVID-19," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 107(1), pages 233-250, March.
    15. Giovanni Angelini & Giuseppe Cavaliere & Enzo D'Innocenzo & Luca De Angelis, 2022. "Time-Varying Poisson Autoregression," Papers 2207.11003, arXiv.org.
    16. Kung, Ko-Lun & Liu, I-Chien & Wang, Chou-Wen, 2021. "Modeling and pricing longevity derivatives using Skellam distribution," Insurance: Mathematics and Economics, Elsevier, vol. 99(C), pages 341-354.
    17. Andrea Guizzardi & Luca Vincenzo Ballestra & Enzo D’Innocenzo, 2024. "Reverse engineering the last-minute on-line pricing practices: an application to hotels," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 33(3), pages 943-971, July.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Lasek, Jan & Gagolewski, Marek, 2021. "Interpretable sports team rating models based on the gradient descent algorithm," International Journal of Forecasting, Elsevier, vol. 37(3), pages 1061-1071.
    2. da Costa, Igor Barbosa & Marinho, Leandro Balby & Pires, Carlos Eduardo Santos, 2022. "Forecasting football results and exploiting betting markets: The case of “both teams to score”," International Journal of Forecasting, Elsevier, vol. 38(3), pages 895-909.
    3. Szczecinski Leszek, 2022. "G-Elo: generalization of the Elo algorithm by modeling the discretized margin of victory," Journal of Quantitative Analysis in Sports, De Gruyter, vol. 18(1), pages 1-14, March.
    4. Gross, Johannes & Rebeggiani, Luca, 2018. "Chance or Ability? The Efficiency of the Football Betting Market Revisited," MPRA Paper 87230, University Library of Munich, Germany.
    5. Raffaele Mattera, 2023. "Forecasting binary outcomes in soccer," Annals of Operations Research, Springer, vol. 325(1), pages 115-134, June.
    6. Baboota, Rahul & Kaur, Harleen, 2019. "Predictive analysis and modelling football results using machine learning approach for English Premier League," International Journal of Forecasting, Elsevier, vol. 35(2), pages 741-755.
    7. Nguyen, Hoang & Javed, Farrukh, 2023. "Dynamic relationship between Stock and Bond returns: A GAS MIDAS copula approach," Journal of Empirical Finance, Elsevier, vol. 73(C), pages 272-292.
    8. Siem Jan Koopman & Rutger Lit & André Lucas & Anne Opschoor, 2018. "Dynamic discrete copula models for high‐frequency stock price changes," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 33(7), pages 966-985, November.
    9. Virbickaitė, Audronė & Nguyen, Hoang & Tran, Minh-Ngoc, 2023. "Bayesian predictive distributions of oil returns using mixed data sampling volatility models," Resources Policy, Elsevier, vol. 86(PA).
    10. Wunderlich, Fabian & Memmert, Daniel, 2020. "Are betting returns a useful measure of accuracy in (sports) forecasting?," International Journal of Forecasting, Elsevier, vol. 36(2), pages 713-722.
    11. Wheatcroft, Edward, 2020. "A profitable model for predicting the over/under market in football," LSE Research Online Documents on Economics 103712, London School of Economics and Political Science, LSE Library.
    12. Tobias Eckernkemper & Bastian Gribisch, 2021. "Intraday conditional value at risk: A periodic mixed‐frequency generalized autoregressive score approach," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 40(5), pages 883-910, August.
    13. Wheatcroft, Edward, 2020. "A profitable model for predicting the over/under market in football," International Journal of Forecasting, Elsevier, vol. 36(3), pages 916-932.
    14. Blasques, Francisco & van Brummelen, Janneke & Koopman, Siem Jan & Lucas, André, 2022. "Maximum likelihood estimation for score-driven models," Journal of Econometrics, Elsevier, vol. 227(2), pages 325-346.
    15. J. James Reade & Carl Singleton & Alasdair Brown, 2021. "Evaluating strange forecasts: The curious case of football match scorelines," Scottish Journal of Political Economy, Scottish Economic Society, vol. 68(2), pages 261-285, May.
    16. Andrei Shynkevich, 2022. "Informational efficiency of football transfer market," Economics Bulletin, AccessEcon, vol. 42(2), pages 1032-1039.
    17. Hassanniakalager, Arman & Sermpinis, Georgios & Stasinakis, Charalampos & Verousis, Thanos, 2020. "A conditional fuzzy inference approach in forecasting," European Journal of Operational Research, Elsevier, vol. 283(1), pages 196-216.
    18. Andreas Heuer & Oliver Rubner, 2014. "Optimizing the Prediction Process: From Statistical Concepts to the Case Study of Soccer," PLOS ONE, Public Library of Science, vol. 9(9), pages 1-9, September.
    19. Gorgi, Paolo & Koopman, Siem Jan & Li, Mengheng, 2019. "Forecasting economic time series using score-driven dynamic models with mixed-data sampling," International Journal of Forecasting, Elsevier, vol. 35(4), pages 1735-1747.
    20. Marc Garnica-Caparrós & Daniel Memmert & Fabian Wunderlich, 2022. "Artificial data in sports forecasting: a simulation framework for analysing predictive models in sports," Information Systems and e-Business Management, Springer, vol. 20(3), pages 551-580, September.

    More about this item

    Keywords

    Bivariate Poisson; Ordered probit; Skellam; Probabilistic loss function;
    All these keywords.

    JEL classification:

    • C32 - Mathematical and Quantitative Methods - - Multiple or Simultaneous Equation Models; Multiple Variables - - - Time-Series Models; Dynamic Quantile Regressions; Dynamic Treatment Effect Models; Diffusion Processes; State Space Models

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:intfor:v:35:y:2019:i:2:p:797-809. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/ijforecast .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.