IDEAS home Printed from https://ideas.repec.org/a/inm/ormnsc/v70y2024i8p5115-5130.html
   My bibliography  Save this article

Ensemble Experiments to Optimize Interventions Along the Customer Journey: A Reinforcement Learning Approach

Author

Listed:
  • Yicheng Song

    (Carlson School of Management, University of Minnesota, Minneapolis, Minnesota 55455)

  • Tianshu Sun

    (Center for Digital Transformation, Cheung Kong Graduate School of Business, Beijing 100006, China; Marshall School of Business, University of Southern California, Los Angeles, California 90089)

Abstract

Firms adopt randomized experiments to evaluate various interventions (e.g., website design, creative content, and pricing). However, most randomized experiments are designed to identify the impact of one specific intervention. The literature on randomized experiments lacks a holistic approach to optimize a sequence of interventions along the customer journey. Specifically, locally optimal interventions unveiled by randomized experiments might be globally suboptimal when considering their interdependence as well as the long-term rewards. Fortunately, the accumulation of a large number of historical experiments creates exogenous interventions at different stages along the customer journey and provides a new opportunity. This study integrates multiple experiments within the reinforcement learning (RL) framework to tackle the questions that cannot be answered by stand-alone randomized experiments. How can we learn optimal policy with a sequence of interventions along the customer journey based on an ensemble of historical experiments? Additionally, how can we learn from multiple historical experiments to guide future intervention trials? We propose a Bayesian recurrent Q -network model that leverages the exogenous interventions from multiple experiments to learn their effectiveness at different stages of the customer journey and optimize them for long-term rewards. Beyond optimization within the existing interventions, the Bayesian model also estimates the distribution of rewards, which can guide subject allocation in the design of future experiments to optimally balance exploration and exploitation. In summary, the proposed model creates a two-way complementarity between RL and randomized experiments, and thus, it provides a holistic approach to learning and optimizing interventions along the customer journey.

Suggested Citation

  • Yicheng Song & Tianshu Sun, 2024. "Ensemble Experiments to Optimize Interventions Along the Customer Journey: A Reinforcement Learning Approach," Management Science, INFORMS, vol. 70(8), pages 5115-5130, August.
  • Handle: RePEc:inm:ormnsc:v:70:y:2024:i:8:p:5115-5130
    DOI: 10.1287/mnsc.2023.4914
    as

    Download full text from publisher

    File URL: http://dx.doi.org/10.1287/mnsc.2023.4914
    Download Restriction: no

    File URL: https://libkey.io/10.1287/mnsc.2023.4914?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:inm:ormnsc:v:70:y:2024:i:8:p:5115-5130. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Chris Asher (email available below). General contact details of provider: https://edirc.repec.org/data/inforea.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.