This file is part of IDEAS, which uses RePEc data


[ Papers | Articles | Software | Books | Chapters | Authors | Institutions | JEL Classification | NEP reports | Search | New papers by email | Author registration | Rankings | Volunteers | FAQ | Blog | Help! ]

Learning within a Markovian Environment

Author info | Abstract | Publisher info | Download info | Related research | Statistics
Author Info
Javier Rivas

Additional information is available for the following registered author(s):

Abstract

We investigate learning in a setting where each period a population has to choose between two actions and the payoff of each action is unknown by the players. The population learns according to reinforcement and the environment is non-stationary, meaning that there is correlation between the payoff of each action today and the payoff of each action in the past. We show that when players observe realized and foregone payoffs, a suboptimal mixed strategy is selected. On the other hand, when players only observe realized payoffs, a unique action, which is optimal if actions perform different enough, is selected in the long run. When looking for efficient reinforcement learning rules, we find that it is optimal to disregard the information from foregone payoffs and to learn as if only realized payoffs were observed.

Download Info
To download:

If you experience problems downloading a file, check if you have the proper application to view it first. Information about this may be contained in the File-Format links below. In case of further problems read the IDEAS help page. Note that these files are not on the IDEAS site. Please be patient as the files may be large.

File URL: http://cadmus.iue.it/dspace/bitstream/1814/8084/1/ECO-2008-13.pdf
File Format:
File Function: main text
Download Restriction: no

Publisher Info
Paper provided by European University Institute in its series Economics Working Papers with number ECO2008/13.

Download reference. The following formats are available: HTML (with abstract), plain text (with abstract), BibTeX, RIS (EndNote, RefMan, ProCite), ReDIF
Length:
Date of creation: 2008
Date of revision:
Handle: RePEc:eui:euiwps:eco2008/13

Contact details of provider:
Postal: Badia Fiesolana, Via dei Roccettini, 9, 50016 San Domenico di Fiesole (FI) Italy
Phone: +39-055-4685.982
Fax: +39-055-4685.902
Web page: http://www.eui.eu/ECO/
More information through EDIRC

For technical questions regarding this item, or to correct its listing, contact: (Marcia Gastaldo).

Related research
Keywords: Adaptive Learning; Markov Chains; Non-stationarity; Reinforcement Learning;

Find related papers by JEL classification:
C73 - Mathematical and Quantitative Methods - - Game Theory and Bargaining Theory - - - Stochastic and Dynamic Games; Evolutionary Games

This paper has been announced in the following NEP Reports:

References listed on IDEAS
Please report citation or reference errors to , or , if you are the registered author of the cited work, log in to your RePEc Author Service profile, click on "citations" and make appropriate adjustments.:
  1. Vulkan, Nir, 2000. " An Economist's Perspective on Probability Matching," Journal of Economic Surveys, Blackwell Publishing, vol. 14(1), pages 101-18, February. [Downloadable!] (restricted)
  2. Cross, John G, 1973. "A Stochastic Learning Model of Economic Behavior," The Quarterly Journal of Economics, MIT Press, vol. 87(2), pages 239-66, May. [Downloadable!] (restricted)
  3. Rubinstein, Ariel, 2002. "Irrational diversification in multiple decision problems," European Economic Review, Elsevier, vol. 46(8), pages 1369-1378, September. [Downloadable!] (restricted)
  4. Erev, Ido & Roth, Alvin E, 1998. "Predicting How People Play Games: Reinforcement Learning in Experimental Games with Unique, Mixed Strategy Equilibria," American Economic Review, American Economic Association, vol. 88(4), pages 848-81, September. [Downloadable!] (restricted)
  5. Rustichini, Aldo, 1999. "Optimal Properties of Stimulus--Response Learning Models," Games and Economic Behavior, Elsevier, vol. 29(1-2), pages 244-273, October. [Downloadable!] (restricted)
  6. Ellison, Glenn & Fudenberg, Drew, 1995. "Word-of-Mouth Communication and Social Learning," The Quarterly Journal of Economics, MIT Press, vol. 110(1), pages 93-125, February. [Downloadable!] (restricted)
  7. Samuelson Larry, 1994. "Stochastic Stability in Games with Alternative Best Replies," Journal of Economic Theory, Elsevier, vol. 64(1), pages 35-65, October. [Downloadable!] (restricted)
Full references

Statistics
Access and download statistics

Did you know? There is a FAQ (frequently asked questions).

This page was last updated on 2009-11-12.


This information is provided to you by IDEAS at the Department of Economics, College of Liberal Arts and Sciences, University of Connecticut using RePEc data on a server sponsored by the Society for Economic Dynamics.