The dynamics of generalized reinforcement learning
Author
Abstract
Suggested Citation
DOI: 10.1016/j.jet.2014.01.002
Download full text from publisher
As the access to this document is restricted, you may want to search for a different version of it.
References listed on IDEAS
- Karandikar, Rajeeva & Mookherjee, Dilip & Ray, Debraj & Vega-Redondo, Fernando, 1998.
"Evolving Aspirations and Cooperation,"
Journal of Economic Theory, Elsevier, vol. 80(2), pages 292-331, June.
- Debraj Ray & Dilip Mookherjee & Fernando Vega Redondo & Rajeeva L. Karandikar, 1996. "Evolving aspirations and cooperation," Working Papers. Serie AD 1996-06, Instituto Valenciano de Investigaciones Económicas, S.A. (Ivie).
- Erev, Ido & Roth, Alvin E, 1998. "Predicting How People Play Games: Reinforcement Learning in Experimental Games with Unique, Mixed Strategy Equilibria," American Economic Review, American Economic Association, vol. 88(4), pages 848-881, September.
- Fudenberg, Drew & Takahashi, Satoru, 2011.
"Heterogeneous beliefs and local information in stochastic fictitious play,"
Games and Economic Behavior, Elsevier, vol. 71(1), pages 100-120, January.
- Drew Fudenberg & Satoru Takahashi, 2008. "Heterogeneous Beliefs and Local Information in Stochastic Fictitious Play," Levine's Working Paper Archive 122247000000001695, David K. Levine.
- Takahashi, Satoru & Fudenberg, Drew, 2011. "Heterogeneous beliefs and local information in stochastic fictitious play," Scholarly Articles 27755310, Harvard University Department of Economics.
- Borgers, Tilman & Sarin, Rajiv, 2000.
"Naive Reinforcement Learning with Endogenous Aspirations,"
International Economic Review, Department of Economics, University of Pennsylvania and Osaka University Institute of Social and Economic Research Association, vol. 41(4), pages 921-950, November.
- Tilman Börgers & Rajiv Sarin, "undated". "Naive Reinforcement Learning With Endogenous Aspiration," ELSE working papers 037, ESRC Centre on Economics Learning and Social Evolution.
- T. Borgers & R. Sarin, 2010. "Naïve Reinforcement Learning With Endogenous Aspirations," Levine's Working Paper Archive 381, David K. Levine.
- Gaunersdorfer Andrea & Hofbauer Josef, 1995.
"Fictitious Play, Shapley Polygons, and the Replicator Equation,"
Games and Economic Behavior, Elsevier, vol. 11(2), pages 279-303, November.
- A. Gaunersdorfer & J. Hofbauer, 2010. "Fictitious Play, Shapley Polygons and the Replicator Equation," Levine's Working Paper Archive 438, David K. Levine.
- Borgers, Tilman & Sarin, Rajiv, 1997.
"Learning Through Reinforcement and Replicator Dynamics,"
Journal of Economic Theory, Elsevier, vol. 77(1), pages 1-14, November.
- Tilman Börgers & Rajiv Sarin, "undated". "Learning Through Reinforcement and Replicator Dynamics," ELSE working papers 051, ESRC Centre on Economics Learning and Social Evolution.
- T. Borgers & R. Sarin, 2010. "Learning Through Reinforcement and Replicator Dynamics," Levine's Working Paper Archive 380, David K. Levine.
- Roth, Alvin E. & Erev, Ido, 1995. "Learning in extensive-form games: Experimental data and simple dynamic models in the intermediate term," Games and Economic Behavior, Elsevier, vol. 8(1), pages 164-212.
- Lahkar, Ratul & Seymour, Robert M., 2013. "Reinforcement learning in population games," Games and Economic Behavior, Elsevier, vol. 80(C), pages 10-38.
Citations
Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
Cited by:
- Schauf, Andrew & Oh, Poong, 2021. "Adaptation strategies and collective dynamics of extraction in networked commons of bistable resources," SocArXiv wmtqk, Center for Open Science.
- Jonathan Newton, 2018. "Evolutionary Game Theory: A Renaissance," Games, MDPI, vol. 9(2), pages 1-67, May.
- Lahkar, Ratul, 2017. "Equilibrium selection in the stag hunt game under generalized reinforcement learning," Journal of Economic Behavior & Organization, Elsevier, vol. 138(C), pages 63-68.
- Funai, Naoki, 2022. "Reinforcement learning with foregone payoff information in normal form games," Journal of Economic Behavior & Organization, Elsevier, vol. 200(C), pages 638-660.
Most related items
These are the items that most often cite the same works as this one and are cited by the same works as this one.- Mengel, Friederike, 2012.
"Learning across games,"
Games and Economic Behavior, Elsevier, vol. 74(2), pages 601-619.
- Friederike Mengel, 2007. "Learning Across Games," Working Papers. Serie AD 2007-05, Instituto Valenciano de Investigaciones Económicas, S.A. (Ivie).
- Schuster, Stephan, 2012. "Applications in Agent-Based Computational Economics," MPRA Paper 47201, University Library of Munich, Germany.
- Izquierdo, Luis R. & Izquierdo, Segismundo S. & Gotts, Nicholas M. & Polhill, J. Gary, 2007. "Transient and asymptotic dynamics of reinforcement learning in games," Games and Economic Behavior, Elsevier, vol. 61(2), pages 259-276, November.
- Ed Hopkins, 2002.
"Two Competing Models of How People Learn in Games,"
Econometrica, Econometric Society, vol. 70(6), pages 2141-2166, November.
- Ed Hopkins, 2000. "Two Competing Models of How People Learn in Games," Edinburgh School of Economics Discussion Paper Series 51, Edinburgh School of Economics, University of Edinburgh.
- Ed Hopkins, 2001. "Two Competing Models of How People Learn in Games," NajEcon Working Paper Reviews 625018000000000226, www.najecon.org.
- Ed Hopkins, 2001. "Two Competing Models of How People Learn in Games," Levine's Working Paper Archive 625018000000000226, David K. Levine.
- Oyarzun, Carlos & Sarin, Rajiv, 2013.
"Learning and risk aversion,"
Journal of Economic Theory, Elsevier, vol. 148(1), pages 196-225.
- Carlos Oyarzun & Rajiv Sarin, 2005. "Learning and Risk Aversion," Levine's Bibliography 784828000000000482, UCLA Department of Economics.
- Carlos Oyarzun & Rajiv Sarin, 2012. "Learning and Risk Aversion," Levine's Working Paper Archive 786969000000000572, David K. Levine.
- Tilman Börgers & Antonio J. Morales & Rajiv Sarin, 2004.
"Expedient and Monotone Learning Rules,"
Econometrica, Econometric Society, vol. 72(2), pages 383-405, March.
- Tilman Börgers & Rajiv Sarin & Antonio J. Morales, 2001. "Expedient and Monotone Learning Rules," Economic Working Papers at Centro de Estudios Andaluces E2001/06, Centro de Estudios Andaluces.
- Tilman Borgers & Antonio Morales & Rajiv Sarin, 2010. "Expedient and Monotone Learning Rules," Levine's Working Paper Archive 625018000000000099, David K. Levine.
- Schuster, Stephan, 2010. "Network Formation with Adaptive Agents," MPRA Paper 27388, University Library of Munich, Germany.
- Laslier, Jean-Francois & Topol, Richard & Walliser, Bernard, 2001.
"A Behavioral Learning Process in Games,"
Games and Economic Behavior, Elsevier, vol. 37(2), pages 340-366, November.
- Laslier, J.-F. & Topol, R. & Walliser, B., 1999. "A Behavioral Learning Process in Games," Papers 99-03, Paris X - Nanterre, U.F.R. de Sc. Ec. Gest. Maths Infor..
- J.-F. Laslier & R. Topol & B. Walliser, 1999. "A behavioral learning process in games," THEMA Working Papers 99-03, THEMA (THéorie Economique, Modélisation et Applications), Université de Cergy-Pontoise.
- Segismundo S. Izquierdo & Luis R. Izquierdo & Nicholas M. Gotts, 2008. "Reinforcement Learning Dynamics in Social Dilemmas," Journal of Artificial Societies and Social Simulation, Journal of Artificial Societies and Social Simulation, vol. 11(2), pages 1-1.
- Lahkar, Ratul & Seymour, Robert M., 2013. "Reinforcement learning in population games," Games and Economic Behavior, Elsevier, vol. 80(C), pages 10-38.
- Sarin, Rajiv & Vahid, Farshid, 2001.
"Predicting How People Play Games: A Simple Dynamic Model of Choice,"
Games and Economic Behavior, Elsevier, vol. 34(1), pages 104-122, January.
- Sarin, R. & Vahid, F., 1999. "Predicting how People Play Games: a Simple Dynamic Model of Choice," Monash Econometrics and Business Statistics Working Papers 12/99, Monash University, Department of Econometrics and Business Statistics.
- Dixon, Huw D. & Sbriglia, Patrizia & Somma, Ernesto, 2006. "Learning to collude: An experiment in convergence and equilibrium selection in oligopoly," Research in Economics, Elsevier, vol. 60(3), pages 155-167, September.
- Duffy, John, 2006.
"Agent-Based Models and Human Subject Experiments,"
Handbook of Computational Economics, in: Leigh Tesfatsion & Kenneth L. Judd (ed.), Handbook of Computational Economics, edition 1, volume 2, chapter 19, pages 949-1011,
Elsevier.
- John Duffy, 2004. "Agent-Based Models and Human Subject Experiments," Computational Economics 0412001, University Library of Munich, Germany.
- Droste, Edward & Kosfeld, Michael & Voorneveld, Mark, 2003. "Best-reply matching in games," Mathematical Social Sciences, Elsevier, vol. 46(3), pages 291-309, December.
- Napel, Stefan, 2003. "Aspiration adaptation in the ultimatum minigame," Games and Economic Behavior, Elsevier, vol. 43(1), pages 86-106, April.
- Ianni, A., 2002. "Reinforcement learning and the power law of practice: some analytical results," Discussion Paper Series In Economics And Econometrics 203, Economics Division, School of Social Sciences, University of Southampton.
- DeJong, D.V. & Blume, A. & Neumann, G., 1998.
"Learning in Sender-Receiver Games,"
Other publications TiSEM
4a8b4f46-f30b-4ad2-bb0c-1, Tilburg University, School of Economics and Management.
- Blume, A. & DeJong, D.V. & Neumann, G.R. & Savin, N.E., 1998. "Learning in Sender-Receiver Games," Working Papers 98-02, University of Iowa, Department of Economics.
- DeJong, D.V. & Blume, A. & Neumann, G., 1998. "Learning in Sender-Receiver Games," Discussion Paper 1998-28, Tilburg University, Center for Economic Research.
- Andreas Blume & Douglas V. DeJong & George R. Neumann & Nathan E. Savin, 1998. "Learning in Sender-Receiver Games," CIG Working Papers FS IV 98-13, Wissenschaftszentrum Berlin (WZB), Research Unit: Competition and Innovation (CIG).
- Jean-François Laslier & Bernard Walliser, 2015.
"Stubborn learning,"
Theory and Decision, Springer, vol. 79(1), pages 51-93, July.
- Jean-François Laslier & Bernard Walliser, 2011. "Stubborn Learning," Working Papers hal-00609501, HAL.
- Jean-François Laslier & Bernard Walliser, 2011. "Stubborn Learning," PSE Working Papers hal-00609501, HAL.
- Jean-François Laslier & Bernard Walliser, 2015. "Stubborn learning," Post-Print halshs-01310229, HAL.
- Jean-François Laslier & Bernard Walliser, 2015. "Stubborn learning," PSE-Ecole d'économie de Paris (Postprint) halshs-01310229, HAL.
- Ponti, Giovanni, 2000.
"Continuous-time evolutionary dynamics: theory and practice,"
Research in Economics, Elsevier, vol. 54(2), pages 187-214, June.
- Giovanni Ponti, 1999. "- Continuous-Time Evolutionary Dynamics: Theory And Practice," Working Papers. Serie AD 1999-31, Instituto Valenciano de Investigaciones Económicas, S.A. (Ivie).
- Franke, Reiner, 2003. "Reinforcement learning in the El Farol model," Journal of Economic Behavior & Organization, Elsevier, vol. 51(3), pages 367-388, July.
More about this item
Keywords
Reinforcement learning; Negative reinforcement; Replicator dynamic;All these keywords.
JEL classification:
- C72 - Mathematical and Quantitative Methods - - Game Theory and Bargaining Theory - - - Noncooperative Games
- C73 - Mathematical and Quantitative Methods - - Game Theory and Bargaining Theory - - - Stochastic and Dynamic Games; Evolutionary Games
Statistics
Access and download statisticsCorrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:jetheo:v:151:y:2014:i:c:p:584-595. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/inca/622869 .
Please note that corrections may take a couple of weeks to filter through the various RePEc services.