IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2306.01740.html
   My bibliography  Save this paper

Not feeling the buzz: Correction study of mispricing and inefficiency in online sportsbooks

Author

Listed:
  • Lawrence Clegg
  • John Cartlidge

Abstract

We present a replication and correction of a recent article (Ramirez, P., Reade, J.J., Singleton, C., Betting on a buzz: Mispricing and inefficiency in online sportsbooks, International Journal of Forecasting, 39:3, 2023, pp. 1413-1423, doi: 10.1016/j.ijforecast.2022.07.011). RRS measure profile page views on Wikipedia to generate a "buzz factor" metric for tennis players and show that it can be used to form a profitable gambling strategy by predicting bookmaker mispricing. Here, we use the same dataset as RRS to reproduce their results exactly, thus confirming the robustness of their mispricing claim. However, we discover that the published betting results are significantly affected by a single bet (the "Hercog" bet), which returns substantial outlier profits based on erroneously long odds. When this data quality issue is resolved, the majority of reported profits disappear and only one strategy, which bets on "competitive" matches, remains significantly profitable in the original out-of-sample period. While one profitable strategy offers weaker support than the original study, it still provides an indication that market inefficiencies may exist, as originally claimed by RRS. As an extension, we continue backtesting after 2020 on a cleaned dataset. Results show that (a) the "competitive" strategy generates no further profits, potentially suggesting markets have become more efficient, and (b) model coefficients estimated over this more recent period are no longer reliable predictors of bookmaker mispricing. We present this work as a case study demonstrating the importance of replication studies in sports forecasting, and the necessity to clean data. We open-source release comprehensive datasets and code.

Suggested Citation

  • Lawrence Clegg & John Cartlidge, 2023. "Not feeling the buzz: Correction study of mispricing and inefficiency in online sportsbooks," Papers 2306.01740, arXiv.org, revised Jul 2024.
  • Handle: RePEc:arx:papers:2306.01740
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2306.01740
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Hyndman, Rob J., 2010. "Encouraging replication and reproducible research," International Journal of Forecasting, Elsevier, vol. 26(1), pages 2-3, January.
    2. Boylan, John E. & Goodwin, Paul & Mohammadipour, Maryam & Syntetos, Aris A., 2015. "Reproducibility in forecasting research," International Journal of Forecasting, Elsevier, vol. 31(1), pages 79-90.
    3. Boshnakov, Georgi & Kharrat, Tarak & McHale, Ian G., 2017. "A bivariate Weibull count model for forecasting association football scores," International Journal of Forecasting, Elsevier, vol. 33(2), pages 458-466.
    4. Wheatcroft, Edward, 2020. "A profitable model for predicting the over/under market in football," LSE Research Online Documents on Economics 103712, London School of Economics and Political Science, LSE Library.
    5. Ramirez, Philip & Reade, J. James & Singleton, Carl, 2023. "Betting on a buzz: Mispricing and inefficiency in online sportsbooks," International Journal of Forecasting, Elsevier, vol. 39(3), pages 1413-1423.
    6. Angelini, Giovanni & Candila, Vincenzo & De Angelis, Luca, 2022. "Weighted Elo rating for tennis match predictions," European Journal of Operational Research, Elsevier, vol. 297(1), pages 120-132.
    7. Wheatcroft, Edward, 2020. "A profitable model for predicting the over/under market in football," International Journal of Forecasting, Elsevier, vol. 36(3), pages 916-932.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Rostami-Tabar, Bahman & Ali, Mohammad M. & Hong, Tao & Hyndman, Rob J. & Porter, Michael D. & Syntetos, Aris, 2022. "Forecasting for social good," International Journal of Forecasting, Elsevier, vol. 38(3), pages 1245-1257.
    2. Holmes, Benjamin & McHale, Ian G., 2024. "Forecasting football match results using a player rating based model," International Journal of Forecasting, Elsevier, vol. 40(1), pages 302-312.
    3. Di Fonzo, Tommaso & Girolimetto, Daniele, 2024. "Forecast combination-based forecast reconciliation: Insights and extensions," International Journal of Forecasting, Elsevier, vol. 40(2), pages 490-514.
    4. Marc Garnica-Caparrós & Daniel Memmert & Fabian Wunderlich, 2022. "Artificial data in sports forecasting: a simulation framework for analysing predictive models in sports," Information Systems and e-Business Management, Springer, vol. 20(3), pages 551-580, September.
    5. Raffaele Mattera, 2023. "Forecasting binary outcomes in soccer," Annals of Operations Research, Springer, vol. 325(1), pages 115-134, June.
    6. Holmes, Benjamin & McHale, Ian G. & Żychaluk, Kamila, 2023. "A Markov chain model for forecasting results of mixed martial arts contests," International Journal of Forecasting, Elsevier, vol. 39(2), pages 623-640.
    7. Baboota, Rahul & Kaur, Harleen, 2019. "Predictive analysis and modelling football results using machine learning approach for English Premier League," International Journal of Forecasting, Elsevier, vol. 35(2), pages 741-755.
    8. Banerjee, Nilabhra & Morton, Alec & Akartunalı, Kerem, 2020. "Passenger demand forecasting in scheduled transportation," European Journal of Operational Research, Elsevier, vol. 286(3), pages 797-810.
    9. Singleton, Carl & Reade, J. James & Brown, Alasdair, 2020. "Going with your gut: The (In)accuracy of forecast revisions in a football score prediction game," Journal of Behavioral and Experimental Economics (formerly The Journal of Socio-Economics), Elsevier, vol. 89(C).
    10. Hubáček, Ondřej & Šír, Gustav, 2023. "Beating the market with a bad predictive model," International Journal of Forecasting, Elsevier, vol. 39(2), pages 691-719.
    11. Groll Andreas & Kneib Thomas & Mayr Andreas & Schauberger Gunther, 2018. "On the dependency of soccer scores – a sparse bivariate Poisson model for the UEFA European football championship 2016," Journal of Quantitative Analysis in Sports, De Gruyter, vol. 14(2), pages 65-79, June.
    12. He, Xue-Zhong & Treich, Nicolas, 2017. "Prediction market prices under risk aversion and heterogeneous beliefs," Journal of Mathematical Economics, Elsevier, vol. 70(C), pages 105-114.
    13. da Costa, Igor Barbosa & Marinho, Leandro Balby & Pires, Carlos Eduardo Santos, 2022. "Forecasting football results and exploiting betting markets: The case of “both teams to score”," International Journal of Forecasting, Elsevier, vol. 38(3), pages 895-909.
    14. Butler, David & Butler, Robert & Eakins, John, 2021. "Expert performance and crowd wisdom: Evidence from English Premier League predictions," European Journal of Operational Research, Elsevier, vol. 288(1), pages 170-182.
    15. Wunderlich, Fabian & Memmert, Daniel, 2020. "Are betting returns a useful measure of accuracy in (sports) forecasting?," International Journal of Forecasting, Elsevier, vol. 36(2), pages 713-722.
    16. Ramirez, Philip & Reade, J. James & Singleton, Carl, 2023. "Betting on a buzz: Mispricing and inefficiency in online sportsbooks," International Journal of Forecasting, Elsevier, vol. 39(3), pages 1413-1423.
    17. Hyndman, Rob J., 2020. "A brief history of forecasting competitions," International Journal of Forecasting, Elsevier, vol. 36(1), pages 7-14.
    18. Kharrat, Tarak & McHale, Ian G. & Peña, Javier López, 2020. "Plus–minus player ratings for soccer," European Journal of Operational Research, Elsevier, vol. 283(2), pages 726-736.
    19. Wheatcroft, Edward, 2020. "A profitable model for predicting the over/under market in football," LSE Research Online Documents on Economics 103712, London School of Economics and Political Science, LSE Library.
    20. Szczecinski Leszek, 2022. "G-Elo: generalization of the Elo algorithm by modeling the discretized margin of victory," Journal of Quantitative Analysis in Sports, De Gruyter, vol. 18(1), pages 1-14, March.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2306.01740. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.