IDEAS home Printed from https://ideas.repec.org/a/bpj/jqsprt/v11y2015i1p29-37n2.html
   My bibliography  Save this article

Nearest-neighbor matchup effects: accounting for team matchups for predicting March Madness

Author

Listed:
  • Hoegh Andrew

    (Virginia Tech – Department of Statistics, Blacksburg, VA, USA)

  • Carzolio Marcos

    (Virginia Tech – Department of Statistics, Blacksburg, VA, USA)

  • Crandell Ian

    (Virginia Tech – Department of Statistics, Blacksburg, VA, USA)

  • Hu Xinran

    (Virginia Tech – Department of Statistics, Blacksburg, VA, USA)

  • Roberts Lucas

    (Virginia Tech – Department of Statistics, Blacksburg, VA, USA)

  • Song Yuhyun

    (Virginia Tech – Department of Statistics, Blacksburg, VA, USA)

  • Leman Scotland C.

    (Virginia Tech – Department of Statistics, Blacksburg, VA, USA)

Abstract

Recently, the surge of predictive analytics competitions has improved sports predictions by fostering data-driven inference and steering clear of human bias. This article details methods developed for Kaggle’s March Machine Learning Mania competition for the 2014 NCAA tournament. A submission to the competition consists of outcome probabilities for each potential matchup. Most predictive models are based entirely on measures of overall team strength, resulting in the unintended “transitive property.” These models are therefore unable to capture specific matchup tendencies. We introduce our novel nearest-neighbor matchup effects framework, which presents a flexible way to account for team characteristics above and beyond team strength that may influence game outcomes. In particular we develop a general framework that couples a model predicting a point spread with a clustering procedure that borrows strength from games similar to a current matchup. This results in a model capable of issuing predictions controlling for team strength and that capture specific matchup characteristics.

Suggested Citation

  • Hoegh Andrew & Carzolio Marcos & Crandell Ian & Hu Xinran & Roberts Lucas & Song Yuhyun & Leman Scotland C., 2015. "Nearest-neighbor matchup effects: accounting for team matchups for predicting March Madness," Journal of Quantitative Analysis in Sports, De Gruyter, vol. 11(1), pages 29-37, March.
  • Handle: RePEc:bpj:jqsprt:v:11:y:2015:i:1:p:29-37:n:2
    DOI: 10.1515/jqas-2014-0054
    as

    Download full text from publisher

    File URL: https://doi.org/10.1515/jqas-2014-0054
    Download Restriction: For access to full text, subscription to the journal or payment for the individual article is required.

    File URL: https://libkey.io/10.1515/jqas-2014-0054?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. West Brady T, 2006. "A Simple and Flexible Rating Method for Predicting Success in the NCAA Basketball Tournament," Journal of Quantitative Analysis in Sports, De Gruyter, vol. 2(3), pages 1-16, July.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Daniel C. Hickman & Andrew G. Meyer, 2017. "Does Athletic Success Influence Persistence At Higher Education Institutions? New Evidence Using Panel Data," Contemporary Economic Policy, Western Economic Association International, vol. 35(4), pages 658-676, October.
    2. Ludden Ian G. & Khatibi Arash & King Douglas M. & Jacobson Sheldon H., 2020. "Models for generating NCAA men’s basketball tournament bracket pools," Journal of Quantitative Analysis in Sports, De Gruyter, vol. 16(1), pages 1-15, March.
    3. repec:ebl:ecbull:v:4:y:2007:i:34:p:1-7 is not listed on IDEAS
    4. Karl Andrew T., 2012. "The Sensitivity of College Football Rankings to Several Modeling Choices," Journal of Quantitative Analysis in Sports, De Gruyter, vol. 8(3), pages 1-44, October.
    5. West Brady T & Lamsal Madhur, 2008. "A New Application of Linear Modeling in the Prediction of College Football Bowl Outcomes and the Development of Team Ratings," Journal of Quantitative Analysis in Sports, De Gruyter, vol. 4(3), pages 1-21, July.
    6. Daniel F. Stone & Jeremy Arkes, 2018. "March Madness? Underreaction To Hot And Cold Hands In Ncaa Basketball," Economic Inquiry, Western Economic Association International, vol. 56(3), pages 1724-1747, July.
    7. Gupta Ajay Andrew, 2015. "A new approach to bracket prediction in the NCAA Men’s Basketball Tournament based on a dual-proportion likelihood," Journal of Quantitative Analysis in Sports, De Gruyter, vol. 11(1), pages 53-67, March.
    8. Coleman Jay & Lynch Allen K, 2009. "NCAA Tournament Games: The Real Nitty-Gritty," Journal of Quantitative Analysis in Sports, De Gruyter, vol. 5(3), pages 1-27, July.
    9. Cassey Lee, 2007. "A Cheap Ticket to the Dance: Systematic Bias in College Basketball's Ratings Percentage Index," Economics Bulletin, AccessEcon, vol. 4(34), pages 1-7.
    10. B. Jay Coleman & J. Michael DuMond & Allen K. Lynch, 2010. "Evidence of bias in NCAA tournament selection and seeding," Managerial and Decision Economics, John Wiley & Sons, Ltd., vol. 31(7), pages 431-452.
    11. Song, Kai & Shi, Jian, 2020. "A gamma process based in-play prediction model for National Basketball Association games," European Journal of Operational Research, Elsevier, vol. 283(2), pages 706-713.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bpj:jqsprt:v:11:y:2015:i:1:p:29-37:n:2. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Peter Golla (email available below). General contact details of provider: https://www.degruyter.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.