Author
Listed:
- Yonatan Gur
(Stanford University, Stanford, California 94305)
- Ahmadreza Momeni
(Stanford University, Stanford, California 94305)
Abstract
Problem definition: Sequential experiments that are deployed in a broad range of practices are characterized by an exploration-exploitation trade-off that is well understood when in each time period feedback is received only on the action that was selected in that period. However, in many practical settings, additional information may become available between decision epochs. We study the performance that one may achieve when leveraging such auxiliary information and the design of algorithms that effectively do so without prior knowledge of the information arrival process. Methodology/results: Our formulation considers a broad class of distributions that are informative about rewards from actions and allows auxiliary observations from these distributions to arrive according to an arbitrary and a priori unknown process. When it is known how to map auxiliary observations to reward estimates, we characterize the best achievable performance as a function of the information arrival process. In terms of achieving optimal performance, we establish that upper confidence bound and Thompson sampling algorithms possess natural robustness with respect to the information arrival process, which uncovers a novel property of these popular algorithms. When the mappings connecting auxiliary observations and rewards are a priori unknown, we characterize a necessary and sufficient condition under which auxiliary information allows performance improvement and devise an adaptive policy (termed 2UCBs) that guarantees near optimality. We use a data set from a large media site to analyze the value that may be captured by leveraging auxiliary observations in the design of content recommendations. Managerial implications: Our study highlights the importance of utilizing auxiliary information in the design of sequential experiments and characterizes how salient features of the auxiliary information stream impact performance. Our study also emphasizes the risk in processing auxiliary information using nonadaptive approaches that are predicated on correct interpretation of this information, as opposed to deploying flexible, adaptive methods.
Suggested Citation
Yonatan Gur & Ahmadreza Momeni, 2022.
"Adaptive Sequential Experiments with Unknown Information Arrival Processes,"
Manufacturing & Service Operations Management, INFORMS, vol. 24(5), pages 2666-2684, September.
Handle:
RePEc:inm:ormsom:v:24:y:2022:i:5:p:2666-2684
DOI: 10.1287/msom.2022.1116
Download full text from publisher
Corrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:inm:ormsom:v:24:y:2022:i:5:p:2666-2684. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
We have no bibliographic references for this item. You can help adding them by using this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Chris Asher (email available below). General contact details of provider: https://edirc.repec.org/data/inforea.html .
Please note that corrections may take a couple of weeks to filter through
the various RePEc services.