IDEAS home Printed from https://ideas.repec.org/a/inm/ormnsc/v52y2006i5p697-712.html
   My bibliography  Save this article

Selectively Acquiring Customer Information: A New Data Acquisition Problem and an Active Learning-Based Solution

Author

Listed:
  • Zhiqiang Zheng

    (A. Gary Anderson Graduate School of Management, University of California, Riverside, 18 Anderson Hall, Riverside, California 92521)

  • Balaji Padmanabhan

    (The Wharton School, University of Pennsylvania, 3730 Walnut Street, Philadelphia, Pennsylvania 19104)

Abstract

This paper presents a new information acquisition problem motivated by business applications where customer data has to be acquired with a specific modeling objective in mind. In the last two decades, there has been substantial work in two different fields--optimal experimental design and machine learning--that has addressed the issue of acquiring data in a selective manner with a specific objective in mind. We show that the problem presented here is different from the classic model-based data acquisition problems considered thus far in the literature in both fields. Building on work in optimal experimental design and in machine learning, we develop a new active learning technique for the information acquisition problem presented in this paper. We demonstrate that the proposed method performs well based on results from applying this method across 20 Web usage and machine learning data sets.

Suggested Citation

  • Zhiqiang Zheng & Balaji Padmanabhan, 2006. "Selectively Acquiring Customer Information: A New Data Acquisition Problem and an Active Learning-Based Solution," Management Science, INFORMS, vol. 52(5), pages 697-712, May.
  • Handle: RePEc:inm:ormnsc:v:52:y:2006:i:5:p:697-712
    DOI: 10.1287/mnsc.1050.0488
    as

    Download full text from publisher

    File URL: http://dx.doi.org/10.1287/mnsc.1050.0488
    Download Restriction: no

    File URL: https://libkey.io/10.1287/mnsc.1050.0488?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Rosenberger, William F. & Hu, Mingxiu, 2002. "On the use of generalized linear models following a sequential design," Statistics & Probability Letters, Elsevier, vol. 56(2), pages 155-161, January.
    2. Young-Hoon Park & Peter S. Fader, 2004. "Modeling Browsing Behavior at Multiple Websites," Marketing Science, INFORMS, vol. 23(3), pages 280-303, May.
    3. Kallberg, Jarl G. & Udell, Gregory F., 2003. "The value of private sector business credit information sharing: The US case," Journal of Banking & Finance, Elsevier, vol. 27(3), pages 449-469, March.
    4. J. C. Moore & H. R. Rao & A. Whinston & K. Nam & T. S. Raghu, 1997. "Information Acquisition Policies for Resource Allocation Among Multiple Agents," Information Systems Research, INFORMS, vol. 8(2), pages 151-170, June.
    5. Fiona Steele & James Brown & Ray Chambers, 2002. "A controlled donor imputation system for a one‐number census," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 165(3), pages 495-522, October.
    6. Vijay S. Mookerjee & Michael V. Mannino, 1997. "Redesigning Case Retrieval to Reduce Information Acquisition Costs," Information Systems Research, INFORMS, vol. 8(1), pages 51-68, March.
    7. T. M. F. Smith, 1993. "Populations and Selection: Limitations of Statistics," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 156(2), pages 145-166, March.
    8. Vijay S. Mookerjee & Brian L. Dos Santos, 1993. "Inductive Expert System Design: Maximizing System Value," Information Systems Research, INFORMS, vol. 4(2), pages 111-140, June.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Maytal Saar-Tsechansky & Prem Melville & Foster Provost, 2009. "Active Feature-Value Acquisition," Management Science, INFORMS, vol. 55(4), pages 664-684, April.
    2. Zhiqiang (Eric) Zheng & Peter Fader & Balaji Padmanabhan, 2012. "From Business Intelligence to Competitive Intelligence: Inferring Competitive Measures Using Augmented Site-Centric Data," Information Systems Research, INFORMS, vol. 23(3-part-1), pages 698-720, September.
    3. Meghana Deodhar & Joydeep Ghosh & Maytal Saar-Tsechansky & Vineet Keshari, 2017. "Active Learning with Multiple Localized Regression Models," INFORMS Journal on Computing, INFORMS, vol. 29(3), pages 503-522, August.
    4. Jing Wang & Panagiotis G. Ipeirotis & Foster Provost, 2017. "Cost-Effective Quality Assurance in Crowd Labeling," Information Systems Research, INFORMS, vol. 28(1), pages 137-158, March.
    5. Rajkumar Venkatesan & Alexander Bleier & Werner Reinartz & Nalini Ravishanker, 2019. "Improving customer profit predictions with customer mindset metrics through multiple overimputation," Journal of the Academy of Marketing Science, Springer, vol. 47(5), pages 771-794, September.
    6. Yingfei Wang & Inbal Yahav & Balaji Padmanabhan, 2024. "Smart Testing with Vaccination: A Bandit Algorithm for Active Sampling for Managing COVID-19," Information Systems Research, INFORMS, vol. 35(1), pages 120-144, March.
    7. Alain Bensoussan & Radha Mookerjee & Vijay Mookerjee & Wei T. Yue, 2009. "Maintaining Diagnostic Knowledge-Based Systems: A Control-Theoretic Approach," Management Science, INFORMS, vol. 55(2), pages 294-310, February.
    8. R Fildes & K Nikolopoulos & S F Crone & A A Syntetos, 2008. "Forecasting and operational research: a review," Journal of the Operational Research Society, Palgrave Macmillan;The OR Society, vol. 59(9), pages 1150-1172, September.
    9. Ransome Epie Bawack & Samuel Fosso Wamba & Kevin Daniel André Carillo & Shahriar Akter, 2022. "Artificial intelligence in E-Commerce: a bibliometric study and literature review," Electronic Markets, Springer;IIM University of St. Gallen, vol. 32(1), pages 297-338, March.
    10. H-V Seow, 2010. "Question selection responding to information on customers from heterogeneous populations to select offers that maximize expected profit," Journal of the Operational Research Society, Palgrave Macmillan;The OR Society, vol. 61(3), pages 443-454, March.
    11. Kaiquan Xu & Stephen Shaoyi Liao & Raymond Y. K. Lau & J. Leon Zhao, 2014. "Effective Active Learning Strategies for the Use of Large-Margin Classifiers in Semantic Annotation: An Optimal Parameter Discovery Perspective," INFORMS Journal on Computing, INFORMS, vol. 26(3), pages 461-483, August.
    12. Nigel Melville & Michael McQuaid, 2012. "Research Note ---Generating Shareable Statistical Databases for Business Value: Multiple Imputation with Multimodal Perturbation," Information Systems Research, INFORMS, vol. 23(2), pages 559-574, June.
    13. Wolfgang Ketter & Karsten Schroer & Konstantina Valogianni, 2023. "Information Systems Research for Smart Sustainable Mobility: A Framework and Call for Action," Information Systems Research, INFORMS, vol. 34(3), pages 1045-1065, September.
    14. Zhiyuan Wang & Zhiqiang (Eric) Zheng & Wei Jiang & Shaojie Tang, 2021. "Blockchain‐Enabled Data Sharing in Supply Chains: Model, Operationalization, and Tutorial," Production and Operations Management, Production and Operations Management Society, vol. 30(7), pages 1965-1985, July.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Voorberg, S. & van Jaarsveld, W. & Eshuis, R. & van Houtum, G.J., 2023. "Information acquisition for service contract quotations made by repair shops," European Journal of Operational Research, Elsevier, vol. 305(3), pages 1166-1177.
    2. Vijay S. Mookerjee & Michael V. Mannino, 2000. "Mean-Risk Trade-Offs in Inductive Expert Systems," Information Systems Research, INFORMS, vol. 11(2), pages 137-158, June.
    3. P Pendharkar, 2009. "Misclassification cost minimizing fitness functions for genetic algorithm-based artificial neural network classifiers," Journal of the Operational Research Society, Palgrave Macmillan;The OR Society, vol. 60(8), pages 1123-1134, August.
    4. Michael V. Mannino & Vijay S. Mookerjee, 1999. "Optimizing Expert Systems: Heuristics for Efficiently Generating Low-Cost Information Acquisition Strategies," INFORMS Journal on Computing, INFORMS, vol. 11(3), pages 278-291, August.
    5. Zhiqiang (Eric) Zheng & Peter Fader & Balaji Padmanabhan, 2012. "From Business Intelligence to Competitive Intelligence: Inferring Competitive Measures Using Augmented Site-Centric Data," Information Systems Research, INFORMS, vol. 23(3-part-1), pages 698-720, September.
    6. Parag Pendharkar & Sudhir Nanda, 2006. "A misclassification cost‐minimizing evolutionary–neural classification approach," Naval Research Logistics (NRL), John Wiley & Sons, vol. 53(5), pages 432-447, August.
    7. Fabbri, Daniela & Menichini, Anna Maria C., 2016. "The commitment problem of secured lending," Journal of Financial Economics, Elsevier, vol. 120(3), pages 561-584.
    8. Michalski, Tomasz & Ors, Evren, 2012. "(Interstate) Banking and (interstate) trade: Does real integration follow financial integration?," Journal of Financial Economics, Elsevier, vol. 104(1), pages 89-117.
    9. Baah Aye Kusi & Elikplimi Komla Agbloyor & Vera Ogeh Fiador & Kofi Achampong Osei, 2016. "Does Information Sharing Promote or Detract from Bank Returns: Evidence from Ghana," African Development Review, African Development Bank, vol. 28(3), pages 332-343, September.
    10. Christa Hainz, 2011. "Measuring Information Sharing in Credit Markets," ifo DICE Report, ifo Institute - Leibniz Institute for Economic Research at the University of Munich, vol. 9(1), pages 21-27, 05.
    11. Samuel Fosu & Albert Danso & Henry Agyei-Boapeah & Collins G. Ntim & Emmanuel Adegbite, 2020. "Credit information sharing and loan default in developing countries: the moderating effect of banking market concentration and national governance quality," Review of Quantitative Finance and Accounting, Springer, vol. 55(1), pages 55-103, July.
    12. Ongena, Steven & Popov, Alexander & Udell, Gregory F., 2013. "“When the cat's away the mice will play”: Does regulation at home affect bank risk-taking abroad?," Journal of Financial Economics, Elsevier, vol. 108(3), pages 727-750.
    13. Barry L. Bayus, 2013. "Crowdsourcing New Product Ideas over Time: An Analysis of the Dell IdeaStorm Community," Management Science, INFORMS, vol. 59(1), pages 226-244, June.
    14. Beck, Thorsten & De Jonghe, Olivier & Schepens, Glenn, 2013. "Bank competition and stability: Cross-country heterogeneity," Journal of Financial Intermediation, Elsevier, vol. 22(2), pages 218-244.
    15. Martin Brown & Christian Zehnder, 2007. "Credit Reporting, Relationship Banking, and Loan Repayment," Journal of Money, Credit and Banking, Blackwell Publishing, vol. 39(8), pages 1883-1918, December.
    16. Geoffroy Enjolras & Philippe Madiès, 2019. "The determinants of loan acceptance: a case study of French farms," Economics Bulletin, AccessEcon, vol. 39(1), pages 358-371.
    17. Berger, Allen N. & Cerqueiro, Geraldo & Penas, María F., 2011. "Does debtor protection really protect debtors? Evidence from the small business credit market," Journal of Banking & Finance, Elsevier, vol. 35(7), pages 1843-1857, July.
    18. Alberto Bennardo & Marco Pagano & Salvatore Piccolo, 2015. "Multiple Bank Lending, Creditor Rights, and Information Sharing," Review of Finance, European Finance Association, vol. 19(2), pages 519-570.
    19. Kim, Chul & Jun, Duk Bin & Park, Sungho, 2018. "Capturing flexible correlations in multiple-discrete choice outcomes using copulas," International Journal of Research in Marketing, Elsevier, vol. 35(1), pages 34-59.
    20. Tevfik Aktekin & Dev K. Dutta & Jeffrey E. Sohl, 2018. "Entrepreneurial firms and financial attractiveness for securing debt capital: a Bayesian analysis," Venture Capital, Taylor & Francis Journals, vol. 20(1), pages 27-50, January.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:inm:ormnsc:v:52:y:2006:i:5:p:697-712. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Chris Asher (email available below). General contact details of provider: https://edirc.repec.org/data/inforea.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.