IDEAS home Printed from https://ideas.repec.org/p/tin/wpaper/20180009.html
   My bibliography  Save this paper

The analysis and forecasting of ATP tennis matches using a high-dimensional dynamic model

Author

Listed:
  • P. Gorgi

    (VU Amsterdam, The Netherlands)

  • Siem Jan (S.J.) Koopman

    (VU Amsterdam, The Netherlands; CREATES Aarhus University, Denmark)

  • R. Lit

    (VU Amsterdam, The Netherlands)

Abstract

We propose a basic high-dimensional dynamic model for tennis match results with time varying player-specific abilities for different court surface types. Our statistical model can be treated in a likelihood-based analysis and is capable of handling high-dimensional datasets while the number of parameters remains small. In particular, we analyze 17 years of tennis matches for a panel of over 500 players, which leads to more than 2000 dynamic strength levels. We find that time varying player-specific abilities for different court surfaces are of key importance for analyzing tennis matches. We further consider several other extensions including player-specific explanatory variables and the accountance of specific configurations for Grand Slam tournaments. The estimation results can be used to construct rankings of players for different court surface types. We finally show that our proposed model can also be effective in forecasting. We provide evidence that our model significantly outperforms existing models in the forecasting of tennis match results.

Suggested Citation

  • P. Gorgi & Siem Jan (S.J.) Koopman & R. Lit, 2018. "The analysis and forecasting of ATP tennis matches using a high-dimensional dynamic model," Tinbergen Institute Discussion Papers 18-009/III, Tinbergen Institute.
  • Handle: RePEc:tin:wpaper:20180009
    as

    Download full text from publisher

    File URL: https://papers.tinbergen.nl/18009.pdf
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. McHale, Ian & Morton, Alex, 2011. "A Bradley-Terry type model for forecasting tennis match results," International Journal of Forecasting, Elsevier, vol. 27(2), pages 619-630, April.
    2. Baker, Rose D. & McHale, Ian G., 2014. "A dynamic paired comparisons model: Who is the greatest tennis player?," European Journal of Operational Research, Elsevier, vol. 236(2), pages 677-684.
    3. Harvey,Andrew C., 2013. "Dynamic Models for Volatility and Heavy Tails," Cambridge Books, Cambridge University Press, number 9781107630024, September.
    4. Mark E. Glickman, 1999. "Parameter Estimation in Large Dynamic Paired Comparison Experiments," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 48(3), pages 377-394.
    5. Andrew Harvey & Alessandra Luati, 2014. "Filtering With Heavy Tails," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 109(507), pages 1112-1122, September.
    6. Geweke, John & Amisano, Gianni, 2011. "Optimal prediction pools," Journal of Econometrics, Elsevier, vol. 164(1), pages 130-141, September.
    7. McHale, Ian & Morton, Alex, 2011. "A Bradley-Terry type model for forecasting tennis match results," International Journal of Forecasting, Elsevier, vol. 27(2), pages 619-630.
    8. Drew Creal & Siem Jan Koopman & André Lucas, 2013. "Generalized Autoregressive Score Models With Applications," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 28(5), pages 777-795, August.
    9. Baker, Rose D. & McHale, Ian G., 2017. "An empirical Bayes model for time-varying paired comparisons ratings: Who is the greatest women’s tennis player?," European Journal of Operational Research, Elsevier, vol. 258(1), pages 328-333.
    10. F. Blasques & S. J. Koopman & A. Lucas, 2015. "Information-theoretic optimality of observation-driven time series models for continuous responses," Biometrika, Biometrika Trust, vol. 102(2), pages 325-343.
    11. Diebold, Francis X & Mariano, Roberto S, 2002. "Comparing Predictive Accuracy," Journal of Business & Economic Statistics, American Statistical Association, vol. 20(1), pages 134-144, January.
    12. Irons David J. & Buckley Stephen & Paulden Tim, 2014. "Developing an improved tennis ranking system," Journal of Quantitative Analysis in Sports, De Gruyter, vol. 10(2), pages 109-118, June.
    13. De Lira Salvatierra, Irving & Patton, Andrew J., 2015. "Dynamic copula models and high frequency data," Journal of Empirical Finance, Elsevier, vol. 30(C), pages 120-135.
    14. Boulier, Bryan L. & Stekler, H. O., 1999. "Are sports seedings good predictors?: an evaluation," International Journal of Forecasting, Elsevier, vol. 15(1), pages 83-91, February.
    15. Blasques, Francisco & Koopman, Siem Jan & Łasak, Katarzyna & Lucas, André, 2016. "In-sample confidence bands and out-of-sample forecast bands for time-varying parameters in observation-driven models," International Journal of Forecasting, Elsevier, vol. 32(3), pages 875-887.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Silvia Montagna & Vanessa Orani & Raffaele Argiento, 2021. "Bayesian isotonic logistic regression via constrained splines: an application to estimating the serve advantage in professional tennis," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 30(2), pages 573-604, June.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Angelini, Giovanni & Candila, Vincenzo & De Angelis, Luca, 2022. "Weighted Elo rating for tennis match predictions," European Journal of Operational Research, Elsevier, vol. 297(1), pages 120-132.
    2. Blasques, F. & Gorgi, P. & Koopman, S.J., 2019. "Accelerating score-driven time series models," Journal of Econometrics, Elsevier, vol. 212(2), pages 359-376.
    3. Francisco (F.) Blasques & Paolo Gorgi & Siem Jan (S.J.) Koopman, 2017. "Accelerating GARCH and Score-Driven Models: Optimality, Estimation and Forecasting," Tinbergen Institute Discussion Papers 17-059/III, Tinbergen Institute.
    4. Siem Jan Koopman & Rutger Lit & André Lucas & Anne Opschoor, 2018. "Dynamic discrete copula models for high‐frequency stock price changes," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 33(7), pages 966-985, November.
    5. Gorgi, Paolo & Koopman, Siem Jan & Li, Mengheng, 2019. "Forecasting economic time series using score-driven dynamic models with mixed-data sampling," International Journal of Forecasting, Elsevier, vol. 35(4), pages 1735-1747.
    6. Siem Jan Koopman & Rutger Lit & André Lucas, 2015. "Intraday Stock Price Dependence using Dynamic Discrete Copula Distributions," Tinbergen Institute Discussion Papers 15-037/III/DSF90, Tinbergen Institute.
    7. Mauro Bernardi & Leopoldo Catania, 2015. "Switching-GAS Copula Models With Application to Systemic Risk," Papers 1504.03733, arXiv.org, revised Jan 2016.
    8. Andre Lucas & Anne Opschoor, 2016. "Fractional Integration and Fat Tails for Realized Covariance Kernels and Returns," Tinbergen Institute Discussion Papers 16-069/IV, Tinbergen Institute, revised 07 Jul 2017.
    9. Collingwood, James A.P. & Wright, Michael & Brooks, Roger J, 2022. "Evaluating the effectiveness of different player rating systems in predicting the results of professional snooker matches," European Journal of Operational Research, Elsevier, vol. 296(3), pages 1025-1035.
    10. Marco Bazzi & Francisco Blasques & Siem Jan Koopman & Andre Lucas, 2017. "Time-Varying Transition Probabilities for Markov Regime Switching Models," Journal of Time Series Analysis, Wiley Blackwell, vol. 38(3), pages 458-478, May.
    11. Francisco (F.) Blasques & Andre (A.) Lucas & Andries van Vlodrop, 2017. "Finite Sample Optimality of Score-Driven Volatility Models," Tinbergen Institute Discussion Papers 17-111/III, Tinbergen Institute.
    12. Blasques, Francisco & Lucas, André & van Vlodrop, Andries C., 2021. "Finite Sample Optimality of Score-Driven Volatility Models: Some Monte Carlo Evidence," Econometrics and Statistics, Elsevier, vol. 19(C), pages 47-57.
    13. Blasques, Francisco & van Brummelen, Janneke & Koopman, Siem Jan & Lucas, André, 2022. "Maximum likelihood estimation for score-driven models," Journal of Econometrics, Elsevier, vol. 227(2), pages 325-346.
    14. Kovalchik, Stephanie, 2020. "Extension of the Elo rating system to margin of victory," International Journal of Forecasting, Elsevier, vol. 36(4), pages 1329-1341.
    15. Giovanni Angelini & Paolo Gorgi, 2018. "DSGE Models with Observation-Driven Time-Varying parameters," Tinbergen Institute Discussion Papers 18-030/III, Tinbergen Institute.
    16. Alberto Arcagni & Vincenzo Candila & Rosanna Grassi, 2023. "A new model for predicting the winner in tennis based on the eigenvector centrality," Annals of Operations Research, Springer, vol. 325(1), pages 615-632, June.
    17. Anne Opschoor & André Lucas, 2019. "Time-varying tail behavior for realized kernels," Tinbergen Institute Discussion Papers 19-051/IV, Tinbergen Institute.
    18. Catania, Leopoldo & Proietti, Tommaso, 2020. "Forecasting volatility with time-varying leverage and volatility of volatility effects," International Journal of Forecasting, Elsevier, vol. 36(4), pages 1301-1317.
    19. Bernardi, Mauro & Catania, Leopoldo, 2018. "Portfolio optimisation under flexible dynamic dependence modelling," Journal of Empirical Finance, Elsevier, vol. 48(C), pages 1-18.
    20. Caballero, Diego & Lucas, André & Schwaab, Bernd & Zhang, Xin, 2020. "Risk endogeneity at the lender/investor-of-last-resort," Journal of Monetary Economics, Elsevier, vol. 116(C), pages 283-297.

    More about this item

    Keywords

    Sports statistics; Score-driven time series models; Rankings; Forecasting.;
    All these keywords.

    JEL classification:

    • C32 - Mathematical and Quantitative Methods - - Multiple or Simultaneous Equation Models; Multiple Variables - - - Time-Series Models; Dynamic Quantile Regressions; Dynamic Treatment Effect Models; Diffusion Processes; State Space Models
    • C53 - Mathematical and Quantitative Methods - - Econometric Modeling - - - Forecasting and Prediction Models; Simulation Methods

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:tin:wpaper:20180009. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Tinbergen Office +31 (0)10-4088900 (email available below). General contact details of provider: https://edirc.repec.org/data/tinbenl.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.