Author
Listed:
- Mohammad Zhalechian
(Department of Industrial and Operations Engineering, University of Michigan, Ann Arbor, Michigan 48109)
- Esmaeil Keyvanshokooh
(Department of Information and Operations Management, Mayes Business School, Texas A&M University, College Station, Texas 77845)
- Cong Shi
(Department of Industrial and Operations Engineering, University of Michigan, Ann Arbor, Michigan 48109)
- Mark P. Van Oyen
(Department of Industrial and Operations Engineering, University of Michigan, Ann Arbor, Michigan 48109)
Abstract
Joint online learning and resource allocation is a fundamental problem inherent in many applications. In a general setting, heterogeneous customers arrive sequentially, each of which can be allocated to a resource in an online fashion. Customers stochastically consume the resources, allocations yield stochastic rewards, and the system receives feedback outcomes with delay. We introduce a generic framework that judiciously synergizes online learning with a broad class of online resource allocation mechanisms, where the sequence of customer contexts is adversarial, and the customer reward and the resource consumption are stochastic and unknown. First, we propose an online algorithm for a general resource allocation problem, called personalized resource allocation while learning with delay, which strikes a three-way balance between exploration, exploitation, and hedging against adversarial arrival sequence. We provide a performance guarantee for this online algorithm in terms of Bayesian regret. Next, we develop our second online algorithm for an advance scheduling problem, called personalized advance scheduling while learning with delay (PAS-LD), and evaluate its theoretical performance. The PAS-LD algorithm has a more delicate structure and offers multiday scheduling while accounting for the no-show behavior of customers. We demonstrate the practicality and efficacy of our PAS-LD algorithm using clinical data from a partner health system. Our results show that the proposed algorithm provides promising results compared with several benchmark policies.
Suggested Citation
Mohammad Zhalechian & Esmaeil Keyvanshokooh & Cong Shi & Mark P. Van Oyen, 2022.
"Online Resource Allocation with Personalized Learning,"
Operations Research, INFORMS, vol. 70(4), pages 2138-2161, July.
Handle:
RePEc:inm:oropre:v:70:y:2022:i:4:p:2138-2161
DOI: 10.1287/opre.2022.2294
Download full text from publisher
Corrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:inm:oropre:v:70:y:2022:i:4:p:2138-2161. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
We have no bibliographic references for this item. You can help adding them by using this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Chris Asher (email available below). General contact details of provider: https://edirc.repec.org/data/inforea.html .
Please note that corrections may take a couple of weeks to filter through
the various RePEc services.