IDEAS home Printed from https://ideas.repec.org/a/inm/oropre/v42y1994i1p175-183.html
   My bibliography  Save this article

Mean-Variance Tradeoffs in an Undiscounted MDP

Author

Listed:
  • Matthew J. Sobel

    (State University of New York at Stony Brook, Stony Brook, New York)

Abstract

A stationary policy and an initial state in an MDP (Markov decision process) induce a stationary probability distribution of the reward. The problem analyzed here is generating the Pareto optima in the sense of high mean and low variance of the stationary distribution. In the unichain case, Pareto optima can be computed either with policy improvement or with a linear program having the same number of variables and one more constraint than the formulation for gain-rate optimization. The same linear program suffices in the multichain case if the ergodic class is an element of choice.

Suggested Citation

  • Matthew J. Sobel, 1994. "Mean-Variance Tradeoffs in an Undiscounted MDP," Operations Research, INFORMS, vol. 42(1), pages 175-183, February.
  • Handle: RePEc:inm:oropre:v:42:y:1994:i:1:p:175-183
    DOI: 10.1287/opre.42.1.175
    as

    Download full text from publisher

    File URL: http://dx.doi.org/10.1287/opre.42.1.175
    Download Restriction: no

    File URL: https://libkey.io/10.1287/opre.42.1.175?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. René Caldentey & Martin B. Haugh, 2009. "Supply Contracts with Financial Hedging," Operations Research, INFORMS, vol. 57(1), pages 47-65, February.
    2. Jun Fei & Eugene Feinberg, 2013. "Variance minimization for constrained discounted continuous-time MDPs with exponentially distributed stopping times," Annals of Operations Research, Springer, vol. 208(1), pages 433-450, September.
    3. Li Xia, 2020. "Risk‐Sensitive Markov Decision Processes with Combined Metrics of Mean and Variance," Production and Operations Management, Production and Operations Management Society, vol. 29(12), pages 2808-2827, December.
    4. Chunling Luo & Chin Hon Tan, 2020. "Almost Stochastic Dominance for Most Risk-Averse Decision Makers," Decision Analysis, INFORMS, vol. 17(2), pages 169-184, June.
    5. Alessandro Arlotto & Noah Gans & J. Michael Steele, 2014. "Markov Decision Problems Where Means Bound Variances," Operations Research, INFORMS, vol. 62(4), pages 864-875, August.
    6. Ma, Shuai & Ma, Xiaoteng & Xia, Li, 2023. "A unified algorithm framework for mean-variance optimization in discounted Markov decision processes," European Journal of Operational Research, Elsevier, vol. 311(3), pages 1057-1067.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:inm:oropre:v:42:y:1994:i:1:p:175-183. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Chris Asher (email available below). General contact details of provider: https://edirc.repec.org/data/inforea.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.