IDEAS home Printed from https://ideas.repec.org/a/inm/orited/v25y2024i1p55-80.html
   My bibliography  Save this article

An Interactive Spreadsheet Model for Teaching Classification Using Logistic Regression

Author

Listed:
  • Vahid Roshanaei

    (Operations Management & Statistics, Rotman School of Management, University of Toronto, Toronto, Ontario M5S 1A1, Canada)

  • Bahman Naderi

    (Department of Mechanical, Automotive, and Material Engineering, University of Windsor, Windsor, Ontario N9B 3P4, Canada)

  • Opher Baron

    (Operations Management & Statistics, Rotman School of Management, University of Toronto, Toronto, Ontario M5S 1A1, Canada)

  • Dmitry Krass

    (Operations Management & Statistics, Rotman School of Management, University of Toronto, Toronto, Ontario M5S 1A1, Canada)

Abstract

We present an interactive spreadsheet that supports teaching essential concepts in classification using the logistic regression (LoR) model for binary classification. The interactive spreadsheet demonstrates the capabilities of LoR by integrating computation with visualization. Students will reinforce concepts like probabilities, maximum likelihood estimation (MLE), and the use of likelihoods to optimize parameters for the LoR. We then discuss using LoR for classifications while adjusting its decision boundary (DB), demonstrating how to convert assigned likelihoods into classification using the DB; impact classification outcome by varying DBs; designate predictions as true positive, true negative, false positive, or false negative; and determine the classification accuracy. We use a variety of performance measures, including sensitivity, specificity, precision, negative predictive value, F 1 and F 2 scores, the receiver operating characteristics curve, and lift/decile charts. These measures are dynamically adjusted when the DB changes. We also reiterate the usage of these measures in the context of crossvalidation and imbalanced data sets. We provide a case study that implements LoR and an option for teaching the details behind MLE. We discuss the pedagogical aspects of this spreadsheet based on a survey of the 2022 student cohort in the Master of Management Analytics Program at the Rotman School of Management.

Suggested Citation

  • Vahid Roshanaei & Bahman Naderi & Opher Baron & Dmitry Krass, 2024. "An Interactive Spreadsheet Model for Teaching Classification Using Logistic Regression," INFORMS Transactions on Education, INFORMS, vol. 25(1), pages 55-80, September.
  • Handle: RePEc:inm:orited:v:25:y:2024:i:1:p:55-80
    DOI: 10.1287/ited.2022.0022
    as

    Download full text from publisher

    File URL: http://dx.doi.org/10.1287/ited.2022.0022
    Download Restriction: no

    File URL: https://libkey.io/10.1287/ited.2022.0022?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Guangxin Jiang & L. Jeff Hong & Barry L. Nelson, 2020. "Online Risk Monitoring Using Offline Simulation," INFORMS Journal on Computing, INFORMS, vol. 32(2), pages 356-375, April.
    2. Eric Huggins & Matt Bailey & Ivan Guardiola, 2020. "Case Article—Converting NFL Point Spreads into Probabilities: A Case Study for Teaching Business Analytics," INFORMS Transactions on Education, INFORMS, vol. 21(1), pages 57-60, September.
    3. Donghyun Kim & Namyong Kim & Junoh Cho & Hayong Shin, 2019. "Optimizing the Multistage University Admission Decision Process," Service Science, INFORMS, vol. 49(6), pages 422-429, November.
    4. Teodora Dan & Patrice Marcotte, 2019. "Competitive Facility Location with Selfish Users and Queues," Operations Research, INFORMS, vol. 67(2), pages 479-497, March.
    5. David Kopcso & Dessislava Pachamanova, 2018. "Case Article—Business Value in Integrating Predictive and Prescriptive Analytics Models," INFORMS Transactions on Education, INFORMS, vol. 19(1), pages 36-42, September.
    6. Michael Brusco, 2022. "Logistic Regression via Excel Spreadsheets: Mechanics, Model Selection, and Relative Predictor Importance," INFORMS Transactions on Education, INFORMS, vol. 23(1), pages 1-11, September.
    7. Matthew Liberatore & Robert Nydick & Constantine Daskalakis & Elisabeth Kunkel & James Cocroft & Ronald Myers, 2009. "Helping Men Decide About Scheduling a Prostate Cancer Screening Exam," Interfaces, INFORMS, vol. 39(3), pages 209-217, June.
    8. Justin J. Boutilier & Timothy C. Y. Chan, 2023. "Introducing and Integrating Machine Learning in an Operations Research Curriculum: An Application-Driven Course," INFORMS Transactions on Education, INFORMS, vol. 23(2), pages 64-83, January.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Jalili Marand, Ata & Hoseinpour, Pooya, 2024. "A congested facility location problem with strategic customers," European Journal of Operational Research, Elsevier, vol. 318(2), pages 442-456.
    2. Dessislava Pachamanova & Vera Tilson & Keely Dwyer-Matzky, 2022. "Case Article—Machine Learning, Ethics, and Change Management: A Data-Driven Approach to Improving Hospital Observation Unit Operations," INFORMS Transactions on Education, INFORMS, vol. 22(3), pages 178-187, May.
    3. Matthew J. Drake, 2024. "Case Article—Creating a Brick Empire Through Data Visualization and Analytics," INFORMS Transactions on Education, INFORMS, vol. 24(3), pages 271-277, May.
    4. Zhang, Wenwei & Xu, Min & Wang, Shuaian, 2023. "Joint location and pricing optimization of self-service in urban logistics considering customers’ choice behavior," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 174(C).
    5. Kobkoon Janngam & Suthep Suantai & Yeol Je Cho & Attapol Kaewkhao & Rattanakorn Wattanataweekul, 2023. "A Novel Inertial Viscosity Algorithm for Bilevel Optimization Problems Applied to Classification Problems," Mathematics, MDPI, vol. 11(14), pages 1-15, July.
    6. Fadda, Edoardo & Manerba, Daniele & Cabodi, Gianpiero & Camurati, Paolo Enrico & Tadei, Roberto, 2021. "Comparative analysis of models and performance indicators for optimal service facility location," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 145(C).
    7. Ralf Krohn & Sven Müller & Knut Haase, 2021. "Preventive healthcare facility location planning with quality-conscious clients," OR Spectrum: Quantitative Approaches in Management, Springer;Gesellschaft für Operations Research e.V., vol. 43(1), pages 59-87, March.
    8. Iacocca, Kathleen & Mahar, Stephen & Daniel Wright, P., 2022. "Strategic horizontal integration for drug cost reduction in the pharmaceutical supply chain," Omega, Elsevier, vol. 108(C).
    9. Jack P. C. Kleijnen & Wim C. M. van Beers, 2022. "Statistical Tests for Cross-Validation of Kriging Models," INFORMS Journal on Computing, INFORMS, vol. 34(1), pages 607-621, January.
    10. Sneha Dhyani Bhatt & Sachin Jayaswal & Ankur Sinha & Navneet Vidyarthi, 2021. "Alternate second order conic program reformulations for hub location under stochastic demand and congestion," Annals of Operations Research, Springer, vol. 304(1), pages 481-527, September.
    11. Bhatt, Sneha Dhyani & Sinha, Ankur & Jayaswal, Sachin, 2024. "The capacitated r-hub interdiction problem with congestion: Models and solution approaches," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 185(C).
    12. Yun Hui Lin & Qingyun Tian & Yanlu Zhao, 2024. "Unified Framework for Choice-Based Facility Location Problem," INFORMS Journal on Computing, INFORMS, vol. 36(6), pages 1436-1458, December.
    13. Guangxin Jiang & L. Jeff Hong & Haihui Shen, 2024. "Real-Time Derivative Pricing and Hedging with Consistent Metamodels," INFORMS Journal on Computing, INFORMS, vol. 36(5), pages 1168-1189, September.
    14. Marianov, Vladimir & Eiselt, H.A., 2024. "Fifty Years of Location Theory - A Selective Review," European Journal of Operational Research, Elsevier, vol. 318(3), pages 701-718.
    15. Michael Brusco, 2022. "Logistic Regression via Excel Spreadsheets: Mechanics, Model Selection, and Relative Predictor Importance," INFORMS Transactions on Education, INFORMS, vol. 23(1), pages 1-11, September.
    16. Xin Yun & Yanyi Ye & Hao Liu & Yi Li & Kin-Keung Lai, 2023. "Stylized Model of Lévy Process in Risk Estimation," Mathematics, MDPI, vol. 11(6), pages 1-14, March.
    17. Han, Shuihua & Chen, Linlin & Su, Zhaopei & Gupta, Shivam & Sivarajah, Uthayasankar, 2024. "Identifying a good business location using prescriptive analytics: Restaurant location recommendation based on spatial data mining," Journal of Business Research, Elsevier, vol. 179(C).
    18. Decui Liang & Fangshun Li & Xinyi Chen, 2024. "Failure mode and effect analysis by exploiting text mining and multi-view group consensus for the defect detection of electric vehicles in social media data," Annals of Operations Research, Springer, vol. 340(1), pages 289-324, September.
    19. Thomas C. Sharkey & Steve Bublak & Lisa Disselkamp & Brittney Shkil, 2020. "Workforce Scheduling for Airport Immigration on the Island of Tropical Paradise," INFORMS Transactions on Education, INFORMS, vol. 20(2), pages 85-89, January.
    20. Klein, Michael G. & Verter, Vedat & Moses, Brian G., 2020. "Designing a rural network of dialysis facilities," European Journal of Operational Research, Elsevier, vol. 282(3), pages 1088-1100.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:inm:orited:v:25:y:2024:i:1:p:55-80. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Chris Asher (email available below). General contact details of provider: https://edirc.repec.org/data/inforea.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.