On the robustness of learning in games with stochastically perturbed payoff observations

My bibliography Save this article

On the robustness of learning in games with stochastically perturbed payoff observations

Author

Listed:

Bravo, Mario
Mertikopoulos, Panayotis

Registered:

Abstract

Motivated by the scarcity of accurate payoff feedback in practical applications of game theory, we examine a class of learning dynamics where players adjust their choices based on past payoff observations that are subject to noise and random disturbances. First, in the single-player case (corresponding to an agent trying to adapt to an arbitrarily changing environment), we show that the stochastic dynamics under study lead to no regret almost surely, irrespective of the noise level in the player's observations. In the multi-player case, we find that dominated strategies become extinct and we show that strict Nash equilibria are stochastically stable and attracting; conversely, if a state is stable or attracting with positive probability, then it is a Nash equilibrium. Finally, we provide an averaging principle for 2-player games, and we show that in zero-sum games with an interior equilibrium, time averages converge to Nash equilibrium for any noise level.

Suggested Citation

Bravo, Mario & Mertikopoulos, Panayotis, 2017. "On the robustness of learning in games with stochastically perturbed payoff observations," Games and Economic Behavior, Elsevier, vol. 103(C), pages 41-66.

Handle: RePEc:eee:gamebe:v:103:y:2017:i:c:p:41-66
DOI: 10.1016/j.geb.2016.06.004

Download full text from publisher

As the access to this document is restricted, you may want to search for a different version of it.

References listed on IDEAS

Erev, Ido & Roth, Alvin E, 1998. "Predicting How People Play Games: Reinforcement Learning in Experimental Games with Unique, Mixed Strategy Equilibria," American Economic Review, American Economic Association, vol. 88(4), pages 848-881, September.
Cominetti, Roberto & Melo, Emerson & Sorin, Sylvain, 2010. "A payoff-based learning procedure and its application to traffic games," Games and Economic Behavior, Elsevier, vol. 70(1), pages 71-83, September.
Ed Hopkins, 2002. "Two Competing Models of How People Learn in Games," Econometrica, Econometric Society, vol. 70(6), pages 2141-2166, November.
- Ed Hopkins, 1999. "Two Competing Models of How People Learn in Games," Edinburgh School of Economics Discussion Paper Series 42, Edinburgh School of Economics, University of Edinburgh, revised Dec 2000.
- Ed Hopkins, 2001. "Two Competing Models of How People Learn in Games," NajEcon Working Paper Reviews 625018000000000226, www.najecon.org.
- Ed Hopkins, 2000. "Two Competing Models of How People Learn in Games," Edinburgh School of Economics Discussion Paper Series 51, Edinburgh School of Economics, University of Edinburgh, revised Dec 2000.
- Ed Hopkins, 2001. "Two Competing Models of How People Learn in Games," Levine's Working Paper Archive 625018000000000226, David K. Levine.
Borgers, Tilman & Sarin, Rajiv, 1997. "Learning Through Reinforcement and Replicator Dynamics," Journal of Economic Theory, Elsevier, vol. 77(1), pages 1-14, November.
- Tilman Börgers & Rajiv Sarin, "undated". "Learning Through Reinforcement and Replicator Dynamics," ELSE working papers 051, ESRC Centre on Economics Learning and Social Evolution.
- T. Borgers & R. Sarin, 2010. "Learning Through Reinforcement and Replicator Dynamics," Levine's Working Paper Archive 380, David K. Levine.
Anna Nagurney & Ding Zhang, 1997. "Projected Dynamical Systems in the Formulation, Stability Analysis, and Computation of Fixed-Demand Traffic Network Equilibria," Transportation Science, INFORMS, vol. 31(2), pages 147-158, May.
Samuelson, Larry & Zhang, Jianbo, 1992. "Evolutionary stability in asymmetric games," Journal of Economic Theory, Elsevier, vol. 57(2), pages 363-391, August.
Drew Fudenberg & David K. Levine, 1998. "The Theory of Learning in Games," MIT Press Books, The MIT Press, edition 1, volume 1, number 0262061945, December.
- Drew Fudenberg & David K. Levine, 1996. "The Theory of Learning in Games," Levine's Working Paper Archive 624, David K. Levine.
Oyarzun, Carlos & Ruf, Johannes, 2014. "Convergence in models with bounded expected relative hazard rates," Journal of Economic Theory, Elsevier, vol. 154(C), pages 229-244.
Rustichini, Aldo, 1999. "Optimal Properties of Stimulus--Response Learning Models," Games and Economic Behavior, Elsevier, vol. 29(1-2), pages 244-273, October.
Michel Benaim & Josef Hofbauer & Sylvain Sorin, 2005. "Stochastic Approximations and Differential Inclusions II: Applications," Levine's Bibliography 784828000000000098, UCLA Department of Economics.
Friedman, Daniel, 1991. "Evolutionary Games in Economics," Econometrica, Econometric Society, vol. 59(3), pages 637-666, May.
Panayotis Mertikopoulos & William H. Sandholm, 2016. "Learning in Games via Reinforcement and Regularization," Mathematics of Operations Research, INFORMS, vol. 41(4), pages 1297-1324, November.
Michel Benaïm & Josef Hofbauer & Sylvain Sorin, 2005. "Stochastic Approximations and Differential Inclusions; Part II: Applications," Working Papers hal-00242974, HAL.
Cabrales, Antonio, 2000. "Stochastic Replicator Dynamics," International Economic Review, Department of Economics, University of Pennsylvania and Osaka University Institute of Social and Economic Research Association, vol. 41(2), pages 451-481, May.
- Antonio Cabrales, 1993. "Stochastic replicator dynamics," Economics Working Papers 54, Department of Economics and Business, Universitat Pompeu Fabra.
- A. Cabrales, 2010. "Stochastic Replicator Dynamics," Levine's Working Paper Archive 489, David K. Levine.
Lahkar, Ratul & Sandholm, William H., 2008. "The projection dynamic and the geometry of population games," Games and Economic Behavior, Elsevier, vol. 64(2), pages 565-590, November.
Michel Benaïm & Mathieu Faure, 2013. "Consistency of Vanishingly Smooth Fictitious Play," Mathematics of Operations Research, INFORMS, vol. 38(3), pages 437-450, August.
- Michel Benaïm & Mathieu Faure, 2013. "Consistency of Vanishingly Smooth Fictitious Play," Post-Print hal-01498243, HAL.
Nachbar, J H, 1990. ""Evolutionary" Selection Dynamics in Games: Convergence and Limit Properties," International Journal of Game Theory, Springer;Game Theory Society, vol. 19(1), pages 59-89.

Full references (including those not matched with items on IDEAS)

Citations

Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.

Cited by:

Mertikopoulos, Panayotis & Sandholm, William H., 2018. "Riemannian game dynamics," Journal of Economic Theory, Elsevier, vol. 177(C), pages 315-364.
Li, Li & Xu, Zichun & Wang, Hui, 2020. "Stochastically perturbed payoff observations in an evolutionary game," Economics Letters, Elsevier, vol. 192(C).
Saeed Hadikhanloo & Rida Laraki & Panayotis Mertikopoulos & Sylvain Sorin, 2022. "Learning in nonatomic games, part Ⅰ: Finite action spaces and population games," Post-Print hal-03767995, HAL.
Masiliūnas, Aidas, 2023. "Learning in rent-seeking contests with payoff risk and foregone payoff information," Games and Economic Behavior, Elsevier, vol. 140(C), pages 50-72.

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Panayotis Mertikopoulos & William H. Sandholm, 2016. "Learning in Games via Reinforcement and Regularization," Mathematics of Operations Research, INFORMS, vol. 41(4), pages 1297-1324, November.
Jonathan Newton, 2018. "Evolutionary Game Theory: A Renaissance," Games, MDPI, vol. 9(2), pages 1-67, May.
Sandholm, William H., 2015. "Population Games and Deterministic Evolutionary Dynamics," Handbook of Game Theory with Economic Applications,, Elsevier.
Mertikopoulos, Panayotis & Sandholm, William H., 2018. "Riemannian game dynamics," Journal of Economic Theory, Elsevier, vol. 177(C), pages 315-364.
Saeed Hadikhanloo & Rida Laraki & Panayotis Mertikopoulos & Sylvain Sorin, 2022. "Learning in nonatomic games, part Ⅰ: Finite action spaces and population games," Post-Print hal-03767995, HAL.
Mertikopoulos, Panayotis & Sandholm, William H., 2024. "Nested replicator dynamics, nested logit choice, and similarity-based learning," Journal of Economic Theory, Elsevier, vol. 220(C).
Fabrizio Germano, 2007. "Stochastic Evolution of Rules for Playing Finite Normal Form Games," Theory and Decision, Springer, vol. 62(4), pages 311-333, May.
Pierre Coucheney & Bruno Gaujal & Panayotis Mertikopoulos, 2015. "Penalty-Regulated Dynamics and Robust Learning Procedures in Games," Mathematics of Operations Research, INFORMS, vol. 40(3), pages 611-633, March.
Reinoud Joosten, 2009. "Paul Samuelson's critique and equilibrium concepts in evolutionary game theory," Papers on Economics and Evolution 2009-16, Philipps University Marburg, Department of Geography.
Philippe Jehiel & Aviman Satpathy, 2024. "Learning to be Indifferent in Complex Decisions: A Coarse Payoff-Assessment Model," Papers 2412.09321, arXiv.org, revised Dec 2024.
Ianni, A., 2002. "Reinforcement learning and the power law of practice: some analytical results," Discussion Paper Series In Economics And Econometrics 203, Economics Division, School of Social Sciences, University of Southampton.
Benaïm, Michel & Hofbauer, Josef & Hopkins, Ed, 2009. "Learning in games with unstable equilibria," Journal of Economic Theory, Elsevier, vol. 144(4), pages 1694-1709, July.
- Ed Hopkins & Josef Hofbauer & Michel Benaim, 2005. "Learning in Games with Unstable Equilibria," Edinburgh School of Economics Discussion Paper Series 135, Edinburgh School of Economics, University of Edinburgh.
- Michel Benaim & Josef Hofbauer & Ed Hopkins, 2006. "Learning in Games with Unstable Equilibria," Levine's Bibliography 321307000000000547, UCLA Department of Economics.
- Michel Benaim & Josef Hofbauer & Ed Hopkins, 2005. "Learning in Games with Unstable Equilibria," Levine's Bibliography 784828000000000609, UCLA Department of Economics.
Tsakas, Elias & Voorneveld, Mark, 2009. "The target projection dynamic," Games and Economic Behavior, Elsevier, vol. 67(2), pages 708-719, November.
- Tsakas, Elias & Voorneveld, Mark, 2007. "The target projection dynamic," SSE/EFI Working Paper Series in Economics and Finance 670, Stockholm School of Economics, revised 13 Aug 2007.
Demichelis, Stefano & Ritzberger, Klaus, 2003. "From evolutionary to strategic stability," Journal of Economic Theory, Elsevier, vol. 113(1), pages 51-75, November.
- DEMICHELIS, Stefano & RITZBERGER, Klaus, 2000. "From evolutionary to strategic stability," LIDAM Discussion Papers CORE 2000059, Université catholique de Louvain, Center for Operations Research and Econometrics (CORE).
Cason, Timothy N. & Friedman, Daniel & Hopkins, Ed, 2010. "Testing the TASP: An experimental investigation of learning in games with unstable equilibria," Journal of Economic Theory, Elsevier, vol. 145(6), pages 2309-2331, November.
- Timothy N. Cason & Daniel Friedman & Ed Hopkins, 2009. "Testing the TASP: An Experimental Investigation of Learning in Games with Unstable Equilibria," Edinburgh School of Economics Discussion Paper Series 188, Edinburgh School of Economics, University of Edinburgh.
- Cason, Timothy N. & Friedman, Daniel & Hopkins, Ed H, 2009. "Testing the TASP: An Experimental Investigation of Learning in Games with Unstable Equilibria," Santa Cruz Department of Economics, Working Paper Series qt8kp6c049, Department of Economics, UC Santa Cruz.
- Timothy N. Cason & Daniel Friedman & Ed Hopkins, 2010. "Testing the TASP: An Experimental Investigation of Learning in Games with Unstable Equilibria," Purdue University Economics Working Papers 1233, Purdue University, Department of Economics.
- Cason, Timothy N. & Friedman, Daniel UC & Hopkins, Ed, 2009. "Testing the TASP: An Experimental Investigation of Learning in Games with Unstable Equilibria," SIRE Discussion Papers 2009-15, Scottish Institute for Research in Economics (SIRE).
Funai, Naoki, 2022. "Reinforcement learning with foregone payoff information in normal form games," Journal of Economic Behavior & Organization, Elsevier, vol. 200(C), pages 638-660.
Beggs, A.W., 2005. "On the convergence of reinforcement learning," Journal of Economic Theory, Elsevier, vol. 122(1), pages 1-36, May.
- Alan Beggs, 2002. "On the Convergence of Reinforcement Learning," Economics Series Working Papers 96, University of Oxford, Department of Economics.
Bernergård, Axel & Mohlin, Erik, 2019. "Evolutionary selection against iteratively weakly dominated strategies," Games and Economic Behavior, Elsevier, vol. 117(C), pages 82-97.
- Bernergård, Axel & Mohlin, Erik, 2017. "Evolutionary Selection against Iteratively Weakly Dominated Strategies," Working Papers 2017:18, Lund University, Department of Economics, revised 12 Nov 2018.
Sylvain Sorin, 2023. "Continuous Time Learning Algorithms in Optimization and Game Theory," Dynamic Games and Applications, Springer, vol. 13(1), pages 3-24, March.
Reinoud Joosten & Berend Roorda, 2008. "Generalized projection dynamics in evolutionary game theory," Papers on Economics and Evolution 2008-11, Philipps University Marburg, Department of Geography.

More about this item

Keywords

Dominated strategies; Learning; Nash equilibrium; Regret minimization; Regularization; Robustness; Stochastic game dynamics; Stochastic stability;
All these keywords.

JEL classification:

C61 - Mathematical and Quantitative Methods - - Mathematical Methods; Programming Models; Mathematical and Simulation Modeling - - - Optimization Techniques; Programming Models; Dynamic Analysis
C72 - Mathematical and Quantitative Methods - - Game Theory and Bargaining Theory - - - Noncooperative Games
C73 - Mathematical and Quantitative Methods - - Game Theory and Bargaining Theory - - - Stochastic and Dynamic Games; Evolutionary Games

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:gamebe:v:103:y:2017:i:c:p:41-66. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/inca/622836 .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

On the robustness of learning in games with stochastically perturbed payoff observations

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Citations

Most related items

More about this item

Keywords

JEL classification:

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data