IDEAS home Printed from https://ideas.repec.org/a/spr/mathme/v62y2005i3p375-386.html
   My bibliography  Save this article

Zero-sum constrained stochastic games with independent state processes

Author

Listed:
  • Eitan Altman
  • Konstantin Avrachenkov
  • Richard Marquez
  • Gregory Miller

Abstract

We consider a zero-sum stochastic game with side constraints for both players with a special structure. There are two independent controlled Markov chains, one for each player. The transition probabilities of the chain associated with a player as well as the related side constraints depend only on the actions of the corresponding player; the side constraints also depend on the player’s controlled chain. The global cost that player 1 wishes to minimize and that player 2 wishes to maximize, depend however on the actions and Markov chains of both players. We obtain a linear programming formulations that allows to compute the value and saddle point policies for this problem. We illustrate the theoretical results through a zero-sum stochastic game in wireless networks in which each player has power constraints Copyright Springer-Verlag 2005

Suggested Citation

  • Eitan Altman & Konstantin Avrachenkov & Richard Marquez & Gregory Miller, 2005. "Zero-sum constrained stochastic games with independent state processes," Mathematical Methods of Operations Research, Springer;Gesellschaft für Operations Research (GOR);Nederlands Genootschap voor Besliskunde (NGB), vol. 62(3), pages 375-386, December.
  • Handle: RePEc:spr:mathme:v:62:y:2005:i:3:p:375-386
    DOI: 10.1007/s00186-005-0034-4
    as

    Download full text from publisher

    File URL: http://hdl.handle.net/10.1007/s00186-005-0034-4
    Download Restriction: Access to full text is restricted to subscribers.

    File URL: https://libkey.io/10.1007/s00186-005-0034-4?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Gomez-Ramirez, E. & Najim, K. & Poznyak, A. S., 2003. "Saddle-point calculation for constrained finite Markov chains," Journal of Economic Dynamics and Control, Elsevier, vol. 27(10), pages 1833-1853, August.
    2. A. Hordijk & L. C. M. Kallenberg, 1984. "Constrained Undiscounted Stochastic Dynamic Programming," Mathematics of Operations Research, INFORMS, vol. 9(2), pages 276-289, May.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Flesch, J. & Schoenmakers, G.M. & Vrieze, K., 2008. "Stochastic games on a product state space: the periodic case," Research Memorandum 016, Maastricht University, Maastricht Research School of Economics of Technology and Organization (METEOR).
    2. János Flesch & Gijs Schoenmakers & Koos Vrieze, 2008. "Stochastic Games on a Product State Space," Mathematics of Operations Research, INFORMS, vol. 33(2), pages 403-420, May.
    3. Compte, Olivier & Postlewaite, Andrew, 2015. "Plausible cooperation," Games and Economic Behavior, Elsevier, vol. 91(C), pages 45-59.
    4. János Flesch & Gijs Schoenmakers & Koos Vrieze, 2009. "Stochastic games on a product state space: the periodic case," International Journal of Game Theory, Springer;Game Theory Society, vol. 38(2), pages 263-289, June.
    5. Flesch, J. & Schoenmakers, G.M. & Vrieze, K., 2007. "Stochastic games on a product state space," Research Memorandum 010, Maastricht University, Maastricht Research School of Economics of Technology and Organization (METEOR).
    6. Olivier Compte & Andrew Postlewaite, 2007. "Effecting Cooperation," PIER Working Paper Archive 09-019, Penn Institute for Economic Research, Department of Economics, University of Pennsylvania, revised 29 May 2009.
    7. Andrew Postlewaite & Olivier Compte, 2008. "Repeated Relationships with Limits on Information Processing," PIER Working Paper Archive 08-026, Penn Institute for Economic Research, Department of Economics, University of Pennsylvania.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Lodewijk Kallenberg, 2013. "Derman’s book as inspiration: some results on LP for MDPs," Annals of Operations Research, Springer, vol. 208(1), pages 63-94, September.
    2. Dijk, N.M. van, 1989. "Truncation of Markov decision problems with a queueing network overflow control application," Serie Research Memoranda 0065, VU University Amsterdam, Faculty of Economics, Business Administration and Econometrics.
    3. Vladimir Ejov & Jerzy A. Filar & Michael Haythorpe & Giang T. Nguyen, 2009. "Refined MDP-Based Branch-and-Fix Algorithm for the Hamiltonian Cycle Problem," Mathematics of Operations Research, INFORMS, vol. 34(3), pages 758-768, August.
    4. Nielsen, Lars Relund & Kristensen, Anders Ringgaard, 2006. "Finding the K best policies in a finite-horizon Markov decision process," European Journal of Operational Research, Elsevier, vol. 175(2), pages 1164-1179, December.
    5. Vladimir Ejov & Jerzy A. Filar & Minh-Tuan Nguyen, 2004. "Hamiltonian Cycles and Singularly Perturbed Markov Chains," Mathematics of Operations Research, INFORMS, vol. 29(1), pages 114-131, February.
    6. Dmitry Krass & O. J. Vrieze, 2002. "Achieving Target State-Action Frequencies in Multichain Average-Reward Markov Decision Processes," Mathematics of Operations Research, INFORMS, vol. 27(3), pages 545-566, August.
    7. Vivek S. Borkar & Vladimir Gaitsgory, 2019. "Linear Programming Formulation of Long-Run Average Optimal Control Problem," Journal of Optimization Theory and Applications, Springer, vol. 181(1), pages 101-125, April.
    8. Arie Leizarowitz, 2003. "An Algorithm to Identify and Compute Average Optimal Policies in Multichain Markov Decision Processes," Mathematics of Operations Research, INFORMS, vol. 28(3), pages 553-586, August.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:mathme:v:62:y:2005:i:3:p:375-386. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.