IDEAS home Printed from https://ideas.repec.org/a/inm/ormnsc/v71y2025i2p1779-1802.html
   My bibliography  Save this article

Self-Adapting Network Relaxations for Weakly Coupled Markov Decision Processes

Author

Listed:
  • Selvaprabu Nadarajah

    (Information and Decision Sciences, University of Illinois Chicago, Chicago, Illinois 60607)

  • Andre A. Cire

    (Department of Management, University of Toronto Scarborough & Rotman School of Management, Toronto, Ontario M5S 1A1, Canada)

Abstract

High-dimensional weakly coupled Markov decision processes (WDPs) arise in dynamic decision making and reinforcement learning, decomposing into smaller Markov decision processes (MDPs) when linking constraints are relaxed. The Lagrangian relaxation of WDPs (LAG) exploits this property to compute policies and (optimistic) bounds efficiently; however, dualizing linking constraints averages away combinatorial information. We introduce feasibility network relaxations (FNRs), a new class of linear programming relaxations that exactly represents the linking constraints. We develop a procedure to obtain the unique minimally sized relaxation, which we refer to as self-adapting FNR, as its size automatically adjusts to the structure of the linking constraints. Our analysis informs model selection: (i) the self-adapting FNR provides (weakly) stronger bounds than LAG, is polynomially sized when linking constraints admit a tractable network representation, and can even be smaller than LAG, and (ii) self-adapting FNR provides bounds and policies that match the approximate linear programming (ALP) approach but is substantially smaller in size than the ALP formulation and a recent alternative Lagrangian that is equivalent to ALP. We perform numerical experiments on constrained dynamic assortment and preemptive maintenance applications. Our results show that self-adapting FNR significantly improves upon LAG in terms of policy performance and/or bounds, while being an order of magnitude faster than an alternative Lagrangian and ALP, which are unsolvable in several instances.

Suggested Citation

  • Selvaprabu Nadarajah & Andre A. Cire, 2025. "Self-Adapting Network Relaxations for Weakly Coupled Markov Decision Processes," Management Science, INFORMS, vol. 71(2), pages 1779-1802, February.
  • Handle: RePEc:inm:ormnsc:v:71:y:2025:i:2:p:1779-1802
    DOI: 10.1287/mnsc.2022.01108
    as

    Download full text from publisher

    File URL: http://dx.doi.org/10.1287/mnsc.2022.01108
    Download Restriction: no

    File URL: https://libkey.io/10.1287/mnsc.2022.01108?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:inm:ormnsc:v:71:y:2025:i:2:p:1779-1802. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Chris Asher (email available below). General contact details of provider: https://edirc.repec.org/data/inforea.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.