Author
Listed:
- Ebbe Hendon
(Institute of Economics, University of Copenhagen)
- Hans Jørgen Jacobsen
(Institute of Economics, University of Copenhagen)
- Michael Teit Nielsen
(Institute of Economics, University of Copenhagen)
- Birgitte Sloth
(Institute of Economics, University of Copenhagen)
Abstract
We report new results on a slightly generalized version of the classical learning process for games proposed by Brown (1951), and further analyzed by Robinson (1951), Miyasawa (1961) and Shapley (1964). the learning process rests on an exogenous prior distribution over strategies - a mixed strategy combination - and an initial weight, a number w e [0,8[, given to that prior. In step s of the learning process, a vector best reply to the weighted average of the strategy combinations of all previous steps is calculated. The average is then updated by this last best reply, and the learning process keeps track of the updated averages. The learning process can be given several interpretations: A preplay thought process of a one shot game (fictitious play), or a sequence of actual plays of a repeated game. We show that if the average strategy combination of our process converges to a specific point, this point is a Nash equilibrium. If it does not converge, the pure strategies used with positive probability in the limit, i.e. in cluster points, are 'rationalizable', and in fact have the 'best response property' in the sense of Pearce (1984). We show that for all 2 x 2 bimatrix games, for any prior and any initial weight, the average strategy combination converges, and we characterize the solution for these games. It turns out that if the initial weight is large, there is a striking similarity between the convergence point of the learning process and the equilibrium traced by the tracing procedure of Harsanyi (1975). Finally we apply the learning process to (the agent strategic form) extensive form games, assuming a completely mixed prior and a strictly positive initial weight reflecting initial uncertainty. In games of perfect information the process always converges to a sub-game perfect equilibrium. In games of imperfect information, if the process, as well as the beliefs derived from it by Bayes' rule, converges, then the convergence point is a sequential equilibrium.
Suggested Citation
Ebbe Hendon & Hans Jørgen Jacobsen & Michael Teit Nielsen & Birgitte Sloth, 1990.
"A Learning Process for Games,"
Discussion Papers
90-20, University of Copenhagen. Department of Economics.
Handle:
RePEc:kud:kuiedp:9020
Download full text from publisher
To our knowledge, this item is not available for
download. To find whether it is available, there are three
options:
1. Check below whether another version of this item is available online.
2. Check on the provider's
web page
whether it is in fact available.
3. Perform a
search for a similarly titled item that would be
available.
Corrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:kud:kuiedp:9020. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
We have no bibliographic references for this item. You can help adding them by using this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Thomas Hoffmann (email available below). General contact details of provider: https://edirc.repec.org/data/okokudk.html .
Please note that corrections may take a couple of weeks to filter through
the various RePEc services.