IDEAS home Printed from https://ideas.repec.org/a/bpj/jqsprt/v13y2017i2p79-93n1.html
   My bibliography  Save this article

Identifying NCAA tournament upsets using Balance Optimization Subset Selection

Author

Listed:
  • Dutta Shouvik
  • Jacobson Sheldon H.

    (University of Illinois – Computer Science, Urbana, IL, USA)

  • Sauppe Jason J.

    (University of Wisconsin–La Crosse, Computer Science, La Crosse, WI, USA)

Abstract

The NCAA basketball tournament attracts over 60 million people who fill out a bracket to try to predict the outcome of every tournament game correctly. Predictions are often made on the basis of instinct, statistics, or a combination of the two. This paper proposes a technique to select round-of-64 upsets in the tournament using a Balance Optimization Subset Selection model. The model determines which games feature match-ups that are statistically most similar to the match-ups in historical upsets. The technique is then applied to the tournament in each of the 13 years from 2003 to 2015 in order to select two games as potential upsets each year. Of the 26 selected games, 10 (38.4%) were actual upsets, which is more than twice as many as the expected number of correct selections when using a weighted random selection method.

Suggested Citation

  • Dutta Shouvik & Jacobson Sheldon H. & Sauppe Jason J., 2017. "Identifying NCAA tournament upsets using Balance Optimization Subset Selection," Journal of Quantitative Analysis in Sports, De Gruyter, vol. 13(2), pages 79-93, June.
  • Handle: RePEc:bpj:jqsprt:v:13:y:2017:i:2:p:79-93:n:1
    DOI: 10.1515/jqas-2016-0062
    as

    Download full text from publisher

    File URL: https://doi.org/10.1515/jqas-2016-0062
    Download Restriction: For access to full text, subscription to the journal or payment for the individual article is required.

    File URL: https://libkey.io/10.1515/jqas-2016-0062?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Alexander G. Nikolaev & Sheldon H. Jacobson & Wendy K. Tam Cho & Jason J. Sauppe & Edward C. Sewell, 2013. "Balance Optimization Subset Selection (BOSS): An Alternative Approach for Causal Inference with Observational Data," Operations Research, INFORMS, vol. 61(2), pages 398-412, April.
    2. Lopez Michael J. & Matthews Gregory J., 2015. "Building an NCAA men’s basketball predictive model and quantifying its success," Journal of Quantitative Analysis in Sports, De Gruyter, vol. 11(1), pages 5-12, March.
    3. Jason J. Sauppe & Sheldon H. Jacobson & Edward C. Sewell, 2014. "Complexity and Approximation Results for the Balance Optimization Subset Selection Model for Causal Inference in Observational Studies," INFORMS Journal on Computing, INFORMS, vol. 26(3), pages 547-566, August.
    4. Alexis Diamond & Jasjeet S. Sekhon, 2013. "Genetic Matching for Estimating Causal Effects: A General Multivariate Matching Method for Achieving Balance in Observational Studies," The Review of Economics and Statistics, MIT Press, vol. 95(3), pages 932-945, July.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Jason J. Sauppe & Sheldon H. Jacobson, 2017. "The role of covariate balance in observational studies," Naval Research Logistics (NRL), John Wiley & Sons, vol. 64(4), pages 323-344, June.
    2. Cousineau, Martin & Verter, Vedat & Murphy, Susan A. & Pineau, Joelle, 2023. "Estimating causal effects with optimization-based methods: A review and empirical comparison," European Journal of Operational Research, Elsevier, vol. 304(2), pages 367-380.
    3. Md Saiful Islam & Md Sarowar Morshed & Md. Noor-E-Alam, 2022. "A Computational Framework for Solving Nonlinear Binary Optimization Problems in Robust Causal Inference," INFORMS Journal on Computing, INFORMS, vol. 34(6), pages 3023-3041, November.
    4. Md Saiful Islam & Md Sarowar Morshed & Gary J Young & Md Noor-E-Alam, 2019. "Robust policy evaluation from large-scale observational studies," PLOS ONE, Public Library of Science, vol. 14(10), pages 1-19, October.
    5. Shouvik Dutta & Jason Sauppe & Sheldon Jacobson, 2016. "Targeted Marketing Using Balance Optimization Subset Selection," Annals of Data Science, Springer, vol. 3(4), pages 423-444, December.
    6. Yu, Haiyan & Yang, Ching-Chi & Yu, Ping, 2023. "Constrained optimization for stratified treatment rules in reducing hospital readmission rates of diabetic patients," European Journal of Operational Research, Elsevier, vol. 308(3), pages 1355-1364.
    7. Jason J. Sauppe & Sheldon H. Jacobson & Edward C. Sewell, 2014. "Complexity and Approximation Results for the Balance Optimization Subset Selection Model for Causal Inference in Observational Studies," INFORMS Journal on Computing, INFORMS, vol. 26(3), pages 547-566, August.
    8. Hochbaum, Dorit S. & Rao, Xu & Sauppe, Jason, 2022. "Network flow methods for the minimum covariate imbalance problem," European Journal of Operational Research, Elsevier, vol. 300(3), pages 827-836.
    9. Hee Youn Kwon & Jason J. Sauppe & Sheldon H. Jacobson, 2019. "Treatment Effect Decomposition and Bootstrap Hypothesis Testing in Observational Studies," Annals of Data Science, Springer, vol. 6(3), pages 491-511, September.
    10. Martin Cousineau & Vedat Verter & Susan A. Murphy & Joelle Pineau, 2022. "Estimating causal effects with optimization-based methods: A review and empirical comparison," Papers 2203.00097, arXiv.org.
    11. Waltl, Sofie R., 2018. "Estimating quantile-specific rental yields for residential housing in Sydney," Regional Science and Urban Economics, Elsevier, vol. 68(C), pages 204-225.
    12. Harsh Parikh & Carlos Varjao & Louise Xu & Eric Tchetgen Tchetgen, 2022. "Validating Causal Inference Methods," Papers 2202.04208, arXiv.org, revised Jul 2022.
    13. Martin Huber, 2019. "An introduction to flexible methods for policy evaluation," Papers 1910.00641, arXiv.org.
    14. Ignacio Lozano-Espitia & Juan Camilo Restrepo Salazar, 2016. "El papel de la infraestructura rural en el desarrollo agrícola en Colombia," Coyuntura Económica, Fedesarrollo, vol. 46(1), pages 107-147, June.
    15. Adeola Oyenubi & Martin Wittenberg, 2021. "Does the choice of balance-measure matter under genetic matching?," Empirical Economics, Springer, vol. 61(1), pages 489-502, July.
    16. Tymon Słoczyński, 2015. "The Oaxaca–Blinder Unexplained Component as a Treatment Effects Estimator," Oxford Bulletin of Economics and Statistics, Department of Economics, University of Oxford, vol. 77(4), pages 588-604, August.
    17. Ludden Ian G. & Jacobson Sheldon H. & Khatibi Arash & King Douglas M., 2020. "Models for generating NCAA men’s basketball tournament bracket pools," Journal of Quantitative Analysis in Sports, De Gruyter, vol. 16(1), pages 1-15, March.
    18. Jaime Torres, Mónica M. & Carlsson, Fredrik, 2018. "Direct and spillover effects of a social information campaign on residential water-savings," Journal of Environmental Economics and Management, Elsevier, vol. 92(C), pages 222-243.
    19. Pécastaing, Nicolas & Salavarriga, Juan, 2022. "The potential impact of fishing in peruvian marine protected areas (MPAs) on artisanal fishery poverty during El Niño events," Ecological Economics, Elsevier, vol. 202(C).
    20. Gabriele Spilker & Tobias Böhmelt, 2013. "The impact of preferential trade agreements on governmental repression revisited," The Review of International Organizations, Springer, vol. 8(3), pages 343-361, September.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bpj:jqsprt:v:13:y:2017:i:2:p:79-93:n:1. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Peter Golla (email available below). General contact details of provider: https://www.degruyter.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.