Approachability in Stackelberg Stochastic Games with Vector Costs
Author
Abstract
Suggested Citation
DOI: 10.1007/s13235-016-0198-y
Download full text from publisher
As the access to this document is restricted, you may want to search for a different version of it.
References listed on IDEAS
- Shie Mannor & Nahum Shimkin, 2003. "The Empirical Bayes Envelope and Regret Minimization in Competitive Markov Decision Processes," Mathematics of Operations Research, INFORMS, vol. 28(2), pages 327-345, May.
- Michel Benaïm & Josef Hofbauer & Sylvain Sorin, 2006.
"Stochastic Approximations and Differential Inclusions, Part II: Applications,"
Mathematics of Operations Research, INFORMS, vol. 31(4), pages 673-695, November.
- Michel Benaïm & Josef Hofbauer & Sylvain Sorin, 2005. "Stochastic Approximations and Differential Inclusions; Part II: Applications," Working Papers hal-00242974, HAL.
- Milman, Emanuel, 2006. "Approachable sets of vector payoffs in stochastic games," Games and Economic Behavior, Elsevier, vol. 56(1), pages 135-147, July.
- Huizhen Yu & Dimitri P. Bertsekas, 2013. "On Boundedness of Q-Learning Iterates for Stochastic Shortest Path Problems," Mathematics of Operations Research, INFORMS, vol. 38(2), pages 209-227, May.
- Michel Benaim & Josef Hofbauer & Sylvain Sorin, 2005. "Stochastic Approximations and Differential Inclusions II: Applications," Levine's Bibliography 784828000000000098, UCLA Department of Economics.
- Eyal Even-Dar & Sham. M. Kakade & Yishay Mansour, 2009. "Online Markov Decision Processes," Mathematics of Operations Research, INFORMS, vol. 34(3), pages 726-736, August.
- Jia Yuan Yu & Shie Mannor & Nahum Shimkin, 2009. "Markov Decision Processes with Arbitrary Reward Processes," Mathematics of Operations Research, INFORMS, vol. 34(3), pages 737-757, August.
Citations
Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
Cited by:
- Soham R. Phade & Venkat Anantharam, 2023. "Learning in Games with Cumulative Prospect Theoretic Preferences," Dynamic Games and Applications, Springer, vol. 13(1), pages 265-306, March.
Most related items
These are the items that most often cite the same works as this one and are cited by the same works as this one.- Mathieu Faure & Gregory Roth, 2010. "Stochastic Approximations of Set-Valued Dynamical Systems: Convergence with Positive Probability to an Attractor," Mathematics of Operations Research, INFORMS, vol. 35(3), pages 624-640, August.
- Akimoto, Youhei & Auger, Anne & Hansen, Nikolaus, 2022. "An ODE method to prove the geometric convergence of adaptive stochastic algorithms," Stochastic Processes and their Applications, Elsevier, vol. 145(C), pages 269-307.
- Shiau Hong Lim & Huan Xu & Shie Mannor, 2016. "Reinforcement Learning in Robust Markov Decision Processes," Mathematics of Operations Research, INFORMS, vol. 41(4), pages 1325-1353, November.
- Andrey Bernstein & Shie Mannor & Nahum Shimkin, 2014. "Opportunistic Approachability and Generalized No-Regret Problems," Mathematics of Operations Research, INFORMS, vol. 39(4), pages 1057-1083, November.
- Saeed Hadikhanloo & Rida Laraki & Panayotis Mertikopoulos & Sylvain Sorin, 2022. "Learning in nonatomic games, part Ⅰ: Finite action spaces and population games," Post-Print hal-03767995, HAL.
- Andriy Zapechelnyuk, 2009. "Limit Behavior of No-regret Dynamics," Discussion Papers 21, Kyiv School of Economics.
- Sylvain Sorin, 2023. "Continuous Time Learning Algorithms in Optimization and Game Theory," Dynamic Games and Applications, Springer, vol. 13(1), pages 3-24, March.
- Viossat, Yannick & Zapechelnyuk, Andriy, 2013.
"No-regret dynamics and fictitious play,"
Journal of Economic Theory, Elsevier, vol. 148(2), pages 825-842.
- Yannick Viossat & Andriy Zapechelnyuk, 2013. "No-regret Dynamics and Fictitious Play," Post-Print hal-00713871, HAL.
- Michel Benaïm & Mathieu Faure, 2013.
"Consistency of Vanishingly Smooth Fictitious Play,"
Mathematics of Operations Research, INFORMS, vol. 38(3), pages 437-450, August.
- Michel Benaïm & Mathieu Faure, 2013. "Consistency of Vanishingly Smooth Fictitious Play," Post-Print hal-01498243, HAL.
- Michel Benaim & Olivier Raimond, 2007. "Simulated Annealing, Vertex-Reinforced Random Walks and Learning in Games," Levine's Bibliography 122247000000001702, UCLA Department of Economics.
- Josef Hofbauer & Sylvain Sorin & Yannick Viossat, 2009.
"Time Average Replicator and Best-Reply Dynamics,"
Mathematics of Operations Research, INFORMS, vol. 34(2), pages 263-269, May.
- Josef Hofbauer & Sylvain Sorin & Yannick Viossat, 2009. "Time Average Replicator and Best Reply Dynamics," Post-Print hal-00360767, HAL.
- Jason M. Altschuler & Kunal Talwar, 2021. "Online Learning over a Finite Action Set with Limited Switching," Mathematics of Operations Research, INFORMS, vol. 46(1), pages 179-203, February.
- Michel Benaïm & Josef Hofbauer & Sylvain Sorin, 2012. "Perturbations of Set-Valued Dynamical Systems, with Applications to Game Theory," Dynamic Games and Applications, Springer, vol. 2(2), pages 195-205, June.
- Bervoets, Sebastian & Faure, Mathieu, 2020.
"Convergence in games with continua of equilibria,"
Journal of Mathematical Economics, Elsevier, vol. 90(C), pages 25-30.
- Sebastian Bervoets & Mathieu Faure, 2020. "Convergence in games with continua of equilibria," Post-Print hal-02964989, HAL.
- Benaïm, Michel & Hofbauer, Josef & Hopkins, Ed, 2009.
"Learning in games with unstable equilibria,"
Journal of Economic Theory, Elsevier, vol. 144(4), pages 1694-1709, July.
- Ed Hopkins & Josef Hofbauer & Michel Benaim, 2005. "Learning in Games with Unstable Equilibria," Edinburgh School of Economics Discussion Paper Series 135, Edinburgh School of Economics, University of Edinburgh.
- Michel Benaim & Josef Hofbauer & Ed Hopkins, 2006. "Learning in Games with Unstable Equilibria," Levine's Bibliography 321307000000000547, UCLA Department of Economics.
- Michel Benaim & Josef Hofbauer & Ed Hopkins, 2005. "Learning in Games with Unstable Equilibria," Levine's Bibliography 784828000000000609, UCLA Department of Economics.
- Fournier, Gaëtan & Kuperwasser, Eden & Munk, Orin & Solan, Eilon & Weinbaum, Avishay, 2021.
"Approachability with constraints,"
European Journal of Operational Research, Elsevier, vol. 292(2), pages 687-695.
- Gaëtan Fournier & Eden Kuperwasser & Orin Munk & Eilon Solan & Avishay Weinbaum, 2021. "Approachability with constraints," Post-Print hal-03138536, HAL.
- Rad Niazadeh & Negin Golrezaei & Joshua Wang & Fransisca Susan & Ashwinkumar Badanidiyuru, 2023. "Online Learning via Offline Greedy Algorithms: Applications in Market Design and Optimization," Management Science, INFORMS, vol. 69(7), pages 3797-3817, July.
- Cason, Timothy N. & Friedman, Daniel & Hopkins, Ed, 2010.
"Testing the TASP: An experimental investigation of learning in games with unstable equilibria,"
Journal of Economic Theory, Elsevier, vol. 145(6), pages 2309-2331, November.
- Timothy N. Cason & Daniel Friedman & Ed Hopkins, 2009. "Testing the TASP: An Experimental Investigation of Learning in Games with Unstable Equilibria," Edinburgh School of Economics Discussion Paper Series 188, Edinburgh School of Economics, University of Edinburgh.
- Cason, Timothy N. & Friedman, Daniel & Hopkins, Ed H, 2009. "Testing the TASP: An Experimental Investigation of Learning in Games with Unstable Equilibria," Santa Cruz Department of Economics, Working Paper Series qt8kp6c049, Department of Economics, UC Santa Cruz.
- Timothy N. Cason & Daniel Friedman & Ed Hopkins, 2010. "Testing the TASP: An Experimental Investigation of Learning in Games with Unstable Equilibria," Purdue University Economics Working Papers 1233, Purdue University, Department of Economics.
- Cason, Timothy N. & Friedman, Daniel UC & Hopkins, Ed, 2009. "Testing the TASP: An Experimental Investigation of Learning in Games with Unstable Equilibria," SIRE Discussion Papers 2009-15, Scottish Institute for Research in Economics (SIRE).
- Eunji Lim, 2011. "On the Convergence Rate for Stochastic Approximation in the Nonsmooth Setting," Mathematics of Operations Research, INFORMS, vol. 36(3), pages 527-537, August.
- Mannor, Shie & Shimkin, Nahum, 2008. "Regret minimization in repeated matrix games with variable stage duration," Games and Economic Behavior, Elsevier, vol. 63(1), pages 227-258, May.
Corrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:dyngam:v:7:y:2017:i:3:d:10.1007_s13235-016-0198-y. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .
Please note that corrections may take a couple of weeks to filter through the various RePEc services.