Learning within a Markovian Environment
Author
Abstract
Suggested Citation
Download full text from publisher
References listed on IDEAS
- Erev, Ido & Roth, Alvin E, 1998. "Predicting How People Play Games: Reinforcement Learning in Experimental Games with Unique, Mixed Strategy Equilibria," American Economic Review, American Economic Association, vol. 88(4), pages 848-881, September.
- Glenn Ellison & Drew Fudenberg, 1995.
"Word-of-Mouth Communication and Social Learning,"
The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 110(1), pages 93-125.
- Fudenberg, Drew & Ellison, Glenn, 1995. "Word-of-Mouth Communication and Social Learning," Scholarly Articles 3196300, Harvard University Department of Economics.
- A. Banerjee & Drew Fudenberg, 2010. "Word-of-Mouth Communication and Social Learning," Levine's Working Paper Archive 425, David K. Levine.
- John G. Cross, 1973. "A Stochastic Learning Model of Economic Behavior," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 87(2), pages 239-266.
Citations
Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
Cited by:
- Rivas, Javier, 2013.
"Probability matching and reinforcement learning,"
Journal of Mathematical Economics, Elsevier, vol. 49(1), pages 17-21.
- Javier Rivas, 2011. "Probability Matching and Reinforcement Learning," Discussion Papers in Economics 11/20, Division of Economics, School of Business, University of Leicester.
- Yves Ortiz & Martin schüle, 2011. "Limited Rationality and Strategic Interaction: A Probabilistic Multi-Agent Model," Working Papers 11.08, Swiss National Bank, Study Center Gerzensee.
Most related items
These are the items that most often cite the same works as this one and are cited by the same works as this one.- Oyarzun, Carlos & Ruf, Johannes, 2014. "Convergence in models with bounded expected relative hazard rates," Journal of Economic Theory, Elsevier, vol. 154(C), pages 229-244.
- Ianni, A., 2002. "Reinforcement learning and the power law of practice: some analytical results," Discussion Paper Series In Economics And Econometrics 203, Economics Division, School of Social Sciences, University of Southampton.
- Osili, Una Okonkwo & Paulson, Anna, 2014. "Crises and confidence: Systemic banking crises and depositor behavior," Journal of Financial Economics, Elsevier, vol. 111(3), pages 646-660.
- Ponti, Giovanni, 2000.
"Continuous-time evolutionary dynamics: theory and practice,"
Research in Economics, Elsevier, vol. 54(2), pages 187-214, June.
- Giovanni Ponti, 1999. "- Continuous-Time Evolutionary Dynamics: Theory And Practice," Working Papers. Serie AD 1999-31, Instituto Valenciano de Investigaciones Económicas, S.A. (Ivie).
- Apesteguia, Jose & Huck, Steffen & Oechssler, Jorg, 2007.
"Imitation--theory and experimental evidence,"
Journal of Economic Theory, Elsevier, vol. 136(1), pages 217-235, September.
- Jose Apesteguia & Steffen Huck & Jorg Oechssler, 2003. "Imitation - Theory and Experimental Evidence," Experimental 0309001, University Library of Munich, Germany.
- Apestgeguia, Jose & Huck, Steffen & Oechssler, Jörg, 2005. "Imitation - Theory and Experimental Evidence," Discussion Paper Series of SFB/TR 15 Governance and the Efficiency of Economic Systems 54, Free University of Berlin, Humboldt University of Berlin, University of Bonn, University of Mannheim, University of Munich.
- Apesteguia, José & Huck, Steffen & Oechssler, Jörg, 2003. "Imitation - Theory and Experimental Evidence," Bonn Econ Discussion Papers 20/2003, University of Bonn, Bonn Graduate School of Economics (BGSE).
- Apesteguia, Jose & Huck, Steffen & Oechssler, Joerg, 2003. "Imitation - Theory and Experimental Evidence," University of California at Santa Barbara, Economics Working Paper Series qt3h0887tj, Department of Economics, UC Santa Barbara.
- Jose Alpesteguia & Steffen Huck & Jörg Oechssler, 2003. "Imitation - Theory and Experimental Evidence," CESifo Working Paper Series 1049, CESifo.
- José Apesteguía & Steffen Huck & Jorg Oechssler, 2003. "Imitation-Theory and Experimental Evidence-," Documentos de Trabajo - Lan Gaiak Departamento de Economía - Universidad Pública de Navarra 0306, Departamento de Economía - Universidad Pública de Navarra.
- Jose Apesteguia & Steffen Huck & Jorg Oechssler, 2004. "Imitation - Theory and Experimental Evidence," Levine's Bibliography 122247000000000132, UCLA Department of Economics.
- Hopkins, Ed, 2007.
"Adaptive learning models of consumer behavior,"
Journal of Economic Behavior & Organization, Elsevier, vol. 64(3-4), pages 348-368.
- Ed Hopkins, 2004. "Adaptive Learning Models of Consumer Behaviour," Edinburgh School of Economics Discussion Paper Series 121, Edinburgh School of Economics, University of Edinburgh.
- Ed Hopkins, 2006. "Adaptive Learning Models of Consumer Behaviour," Levine's Bibliography 122247000000000658, UCLA Department of Economics.
- Ed Hopkins, 2010. "Adaptive Learning Models of Consumer Behaviour," Levine's Working Paper Archive 506439000000000346, David K. Levine.
- Oyarzun, Carlos & Sarin, Rajiv, 2013.
"Learning and risk aversion,"
Journal of Economic Theory, Elsevier, vol. 148(1), pages 196-225.
- Carlos Oyarzun & Rajiv Sarin, 2005. "Learning and Risk Aversion," Levine's Bibliography 784828000000000482, UCLA Department of Economics.
- Carlos Oyarzun & Rajiv Sarin, 2012. "Learning and Risk Aversion," Levine's Working Paper Archive 786969000000000572, David K. Levine.
- Jiayang Li & Zhaoran Wang & Yu Marco Nie, 2023. "Wardrop Equilibrium Can Be Boundedly Rational: A New Behavioral Theory of Route Choice," Papers 2304.02500, arXiv.org, revised Feb 2024.
- repec:awi:wpaper:0419 is not listed on IDEAS
- Bernergård, Axel & Mohlin, Erik, 2019.
"Evolutionary selection against iteratively weakly dominated strategies,"
Games and Economic Behavior, Elsevier, vol. 117(C), pages 82-97.
- Bernergård, Axel & Mohlin, Erik, 2017. "Evolutionary Selection against Iteratively Weakly Dominated Strategies," Working Papers 2017:18, Lund University, Department of Economics, revised 12 Nov 2018.
- Jonathan Newton, 2018. "Evolutionary Game Theory: A Renaissance," Games, MDPI, vol. 9(2), pages 1-67, May.
- Innocenti, Stefania & Cowan, Robin, 2019.
"Self-efficacy beliefs and imitation: A two-armed bandit experiment,"
European Economic Review, Elsevier, vol. 113(C), pages 156-172.
- Stefania Innocenti & Robin Cowan, 2019. "Self-efficacy beliefs and imitation : A two-armed bandit experiment," Post-Print hal-03213711, HAL.
- Shu-Heng Chen & Yi-Lin Hsieh, 2011. "Reinforcement Learning in Experimental Asset Markets," Eastern Economic Journal, Palgrave Macmillan;Eastern Economic Association, vol. 37(1), pages 109-133.
- Aloys Prinz, 2019. "Learning (Not) to Evade Taxes," Games, MDPI, vol. 10(4), pages 1-18, September.
- Tilman Börgers & Antonio J. Morales & Rajiv Sarin, 2004.
"Expedient and Monotone Learning Rules,"
Econometrica, Econometric Society, vol. 72(2), pages 383-405, March.
- Tilman Börgers & Rajiv Sarin & Antonio J. Morales, 2001. "Expedient and Monotone Learning Rules," Economic Working Papers at Centro de Estudios Andaluces E2001/06, Centro de Estudios Andaluces.
- Tilman Borgers & Antonio Morales & Rajiv Sarin, 2010. "Expedient and Monotone Learning Rules," Levine's Working Paper Archive 625018000000000099, David K. Levine.
- Tassos Patokos, 2014. "Introducing Disappointment Dynamics and Comparing Behaviors in Evolutionary Games: Some Simulation Results," Games, MDPI, vol. 5(1), pages 1-25, January.
- Laslier, Jean-Francois & Topol, Richard & Walliser, Bernard, 2001.
"A Behavioral Learning Process in Games,"
Games and Economic Behavior, Elsevier, vol. 37(2), pages 340-366, November.
- Laslier, J.-F. & Topol, R. & Walliser, B., 1999. "A Behavioral Learning Process in Games," Papers 99-03, Paris X - Nanterre, U.F.R. de Sc. Ec. Gest. Maths Infor..
- J.-F. Laslier & R. Topol & B. Walliser, 1999. "A behavioral learning process in games," THEMA Working Papers 99-03, THEMA (THéorie Economique, Modélisation et Applications), Université de Cergy-Pontoise.
- Segismundo S. Izquierdo & Luis R. Izquierdo & Nicholas M. Gotts, 2008. "Reinforcement Learning Dynamics in Social Dilemmas," Journal of Artificial Societies and Social Simulation, Journal of Artificial Societies and Social Simulation, vol. 11(2), pages 1-1.
- Atanasios Mitropoulos, 2001. "Learning Under Little Information: An Experiment on Mutual Fate Control," Game Theory and Information 0110003, University Library of Munich, Germany.
- Jaspersen, Johannes G. & Montibeller, Gilberto, 2020. "On the learning patterns and adaptive behavior of terrorist organizations," European Journal of Operational Research, Elsevier, vol. 282(1), pages 221-234.
- Atanasios Mitropoulos, 2001. "On the Measurement of the Predictive Success of Learning Theories in Repeated Games," Experimental 0110001, University Library of Munich, Germany.
More about this item
Keywords
Adaptive Learning; Markov Chains; Non-stationarity; Reinforcement Learning;All these keywords.
JEL classification:
- C73 - Mathematical and Quantitative Methods - - Game Theory and Bargaining Theory - - - Stochastic and Dynamic Games; Evolutionary Games
NEP fields
This paper has been announced in the following NEP Reports:- NEP-CBA-2008-02-16 (Central Banking)
- NEP-CBE-2008-02-16 (Cognitive and Behavioural Economics)
- NEP-EVO-2008-02-16 (Evolutionary Economics)
- NEP-GTH-2008-02-16 (Game Theory)
Statistics
Access and download statisticsCorrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eui:euiwps:eco2008/13. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Cécile Brière (email available below). General contact details of provider: https://edirc.repec.org/data/deiueit.html .
Please note that corrections may take a couple of weeks to filter through the various RePEc services.