IDEAS home Printed from https://ideas.repec.org/a/inm/orisre/v18y2007i1p4-22.html
   My bibliography  Save this article

Decision-Centric Active Learning of Binary-Outcome Models

Author

Listed:
  • Maytal Saar-Tsechansky

    (Red McCombs School of Business, University of Texas at Austin, Austin, Texas 78712)

  • Foster Provost

    (Leonard N. Stern School of Business, New York University, 44 West Fourth Street, New York, New York 10012)

Abstract

It can be expensive to acquire the data required for businesses to employ data-driven predictive modeling---for example, to model consumer preferences to optimize targeting. Prior research has introduced “active-learning” policies for identifying data that are particularly useful for model induction, with the goal of decreasing the statistical error for a given acquisition cost ( error-centric approaches). However, predictive models are used as part of a decision-making process, and costly improvements in model accuracy do not always result in better decisions. This paper introduces a new approach for active data acquisition that specifically targets decision making. The new decision-centric approach departs from traditional active learning by placing emphasis on acquisitions that are more likely to affect decision making. We describe two different types of decision-centric techniques. Next, using direct-marketing data, we compare various data-acquisition techniques. We demonstrate that strategies for reducing statistical error can be wasteful in a decision-making context, and show that one decision-centric technique in particular can improve targeting decisions significantly. We also show that this method is robust in the face of decreasing quality of utility estimations, eventually converging to uniform random sampling, and that it can be extended to situations where different data acquisitions have different costs. The results suggest that businesses should consider modifying their strategies for acquiring information through normal business transactions. For example, a firm such as Amazon.com that models consumer preferences for customized marketing may accelerate learning by proactively offering recommendations---not merely to induce immediate sales, but for improving recommendations in the future.

Suggested Citation

  • Maytal Saar-Tsechansky & Foster Provost, 2007. "Decision-Centric Active Learning of Binary-Outcome Models," Information Systems Research, INFORMS, vol. 18(1), pages 4-22, March.
  • Handle: RePEc:inm:orisre:v:18:y:2007:i:1:p:4-22
    DOI: 10.1287/isre.1070.0111
    as

    Download full text from publisher

    File URL: http://dx.doi.org/10.1287/isre.1070.0111
    Download Restriction: no

    File URL: https://libkey.io/10.1287/isre.1070.0111?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Kevin F. McCardle, 1985. "Information Acquisition and the Adoption of New Technology," Management Science, INFORMS, vol. 31(11), pages 1372-1389, November.
    2. Patricia M. West & Patrick L. Brockett & Linda L. Golden, 1997. "A Comparative Analysis of Neural Networks and Statistical Methods for Predicting Consumer Choice," Marketing Science, INFORMS, vol. 16(4), pages 370-391.
    3. Heckman, James, 2013. "Sample selection bias as a specification error," Applied Econometrics, Russian Presidential Academy of National Economy and Public Administration (RANEPA), vol. 31(3), pages 129-137.
    4. Wendy W. Moe & Peter S. Fader, 2004. "Dynamic Conversion Behavior at E-Commerce Sites," Management Science, INFORMS, vol. 50(3), pages 326-335, March.
    5. Kornish, Laura J., 2006. "Technology choice and timing with positive network effects," European Journal of Operational Research, Elsevier, vol. 173(1), pages 268-282, August.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Mi, Yunlong & Wang, Zongrun & Quan, Pei & Shi, Yong, 2024. "A semi-supervised concept-cognitive computing system for dynamic classification decision making with limited feedback information," European Journal of Operational Research, Elsevier, vol. 315(3), pages 1123-1138.
    2. Yingfei Wang & Inbal Yahav & Balaji Padmanabhan, 2024. "Smart Testing with Vaccination: A Bandit Algorithm for Active Sampling for Managing COVID-19," Information Systems Research, INFORMS, vol. 35(1), pages 120-144, March.
    3. Alain Bensoussan & Radha Mookerjee & Vijay Mookerjee & Wei T. Yue, 2009. "Maintaining Diagnostic Knowledge-Based Systems: A Control-Theoretic Approach," Management Science, INFORMS, vol. 55(2), pages 294-310, February.
    4. Aurélie Lemmens & Sunil Gupta, 2020. "Managing Churn to Maximize Profits," Marketing Science, INFORMS, vol. 39(5), pages 956-973, September.
    5. Xuan Bi & Mochen Yang & Gediminas Adomavicius, 2024. "Consumer Acquisition for Recommender Systems: A Theoretical Framework and Empirical Evaluations," Information Systems Research, INFORMS, vol. 35(1), pages 339-362, March.
    6. Carlos Fernández-Loría & Foster Provost, 2022. "Causal Decision Making and Causal Effect Estimation Are Not the Same…and Why It Matters," INFORMS Joural on Data Science, INFORMS, vol. 1(1), pages 4-16, April.
    7. Stefan Lessmann & Stefan Voß, 2010. "Customer-Centric Decision Support," Business & Information Systems Engineering: The International Journal of WIRTSCHAFTSINFORMATIK, Springer;Gesellschaft für Informatik e.V. (GI), vol. 2(2), pages 79-93, April.
    8. Jing Wang & Panagiotis G. Ipeirotis & Foster Provost, 2017. "Cost-Effective Quality Assurance in Crowd Labeling," Information Systems Research, INFORMS, vol. 28(1), pages 137-158, March.
    9. Bei Yan & Feng Mai & Chaojiang Wu & Rui Chen & Xiaolin Li, 2024. "A Computational Framework for Understanding Firm Communication During Disasters," Information Systems Research, INFORMS, vol. 35(2), pages 590-608, June.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Prabuddha De & Yu (Jeffrey) Hu & Mohammad S. Rahman, 2010. "Technology Usage and Online Sales: An Empirical Study," Management Science, INFORMS, vol. 56(11), pages 1930-1945, November.
    2. Brozynski, Max T. & Leibowicz, Benjamin D., 2020. "Markov models of policy support for technology transitions," European Journal of Operational Research, Elsevier, vol. 286(3), pages 1052-1069.
    3. Lauren E. Cipriano & Thomas A. Weber, 2018. "Population-level intervention and information collection in dynamic healthcare policy," Health Care Management Science, Springer, vol. 21(4), pages 604-631, December.
    4. Dapeng Cui & David Curry, 2005. "Prediction in Marketing Using the Support Vector Machine," Marketing Science, INFORMS, vol. 24(4), pages 595-615, January.
    5. Laura J. Kornish & Ralph L. Keeney, 2008. "Repeated Commit-or-Defer Decisions with a Deadline: The Influenza Vaccine Composition," Operations Research, INFORMS, vol. 56(3), pages 527-541, June.
    6. Darima Fotheringham & Michael A. Wiles, 2023. "The effect of implementing chatbot customer service on stock returns: an event study analysis," Journal of the Academy of Marketing Science, Springer, vol. 51(4), pages 802-822, July.
    7. Song, Wei-Ling & Uzmanoglu, Cihan, 2016. "TARP announcement, bank health, and borrowers’ credit risk," Journal of Financial Stability, Elsevier, vol. 22(C), pages 22-32.
    8. Raymundo M. Campos-Vázquez, 2013. "Efectos de los ingresos no reportados en el nivel y tendencia de la pobreza laboral en México," Ensayos Revista de Economia, Universidad Autonoma de Nuevo Leon, Facultad de Economia, vol. 0(2), pages 23-54, November.
    9. Stephen Brown & William Goetzmann & Bing Liang & Christopher Schwarz, 2008. "Mandatory Disclosure and Operational Risk: Evidence from Hedge Fund Registration," Journal of Finance, American Finance Association, vol. 63(6), pages 2785-2815, December.
    10. Paul W. Miller & Barry R. Chiswick, 2002. "Immigrant earnings: Language skills, linguistic concentrations and the business cycle," Journal of Population Economics, Springer;European Society for Population Economics, vol. 15(1), pages 31-57.
    11. Chul‐Woo Kwon & Peter F. Orazem & Daniel M. Otto, 2006. "Off‐farm labor supply responses to permanent and transitory farm income," Agricultural Economics, International Association of Agricultural Economists, vol. 34(1), pages 59-67, January.
    12. Jonathan Gruber & Aaron Yelowitz, 1999. "Public Health Insurance and Private Savings," Journal of Political Economy, University of Chicago Press, vol. 107(6), pages 1249-1274, December.
    13. Jean-Louis Arcand & Linguère M'Baye, 2013. "Braving the waves: the role of time and risk preferences in illegal migration from Senegal," CERDI Working papers halshs-00855937, HAL.
    14. Sandra Müllbacher & Wolfgang Nagl, 2017. "Labour supply in Austria: an assessment of recent developments and the effects of a tax reform," Empirica, Springer;Austrian Institute for Economic Research;Austrian Economic Association, vol. 44(3), pages 465-486, August.
    15. Campbell, Randall C. & Nagel, Gregory L., 2016. "Private information and limitations of Heckman's estimator in banking and corporate finance research," Journal of Empirical Finance, Elsevier, vol. 37(C), pages 186-195.
    16. Leye Li & Louise Yi Lu & Dongyue Wang, 2022. "External labour market competitions and stock price crash risk: evidence from exposures to competitor CEOs’ award‐winning events," Accounting and Finance, Accounting and Finance Association of Australia and New Zealand, vol. 62(S1), pages 1421-1460, April.
    17. Jože P. Damijan & Mark Knell, 2005. "How Important Is Trade and Foreign Ownership in Closing the Technology Gap? Evidence from Estonia and Slovenia," Review of World Economics (Weltwirtschaftliches Archiv), Springer;Institut für Weltwirtschaft (Kiel Institute for the World Economy), vol. 141(2), pages 271-295, July.
    18. Calcagno, R. & Renneboog, L.D.R., 2004. "Capital Structure and Managerial Compensation : The Effects of Renumeration Seniority," Discussion Paper 2004-120, Tilburg University, Center for Economic Research.
    19. Nakashima, Kiyotaka & Ogawa, Toshiaki, 2020. "The Impacts of Strengthening Regulatory Surveillance on Bank Behavior: A Dynamic Analysis from Incomplete to Complete Enforcement of Capital Regulation in Microprudential Policy," MPRA Paper 99938, University Library of Munich, Germany.
    20. Sarah Bridges & David Lawson, 2008. "Health and Labour Market Participation in Uganda," WIDER Working Paper Series DP2008-07, World Institute for Development Economic Research (UNU-WIDER).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:inm:orisre:v:18:y:2007:i:1:p:4-22. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Chris Asher (email available below). General contact details of provider: https://edirc.repec.org/data/inforea.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.