IDEAS home Printed from https://ideas.repec.org/p/els/esrcls/023.html
   My bibliography  Save this paper

Learning with Hazy Beliefs

Author

Listed:
  • Dean Foster
  • Peyton Young

Abstract

Players are rational if they always choose best replies given their beliefs. They are good predictors if the difference between their beliefs and the distribution of the others' actual strategies goes to zero over time. Learning is deterministic if beliefs are fully determined by the initial conditions and the observed data. (Bayesian updating is a particular example). If players are rational, good predictors, and learn deterministically, there are many games for which neither beliefs nor actions converge to a Nash equilibrium. We introduce an alternative approach to learning called prospecting in which players are rational and good predictors, but beliefs have a small random component. In any finite game, and from any initial conditions, prospecting players learn to play arbitrarily close to Nash equilibrium with probability one.

Suggested Citation

  • Dean Foster & Peyton Young, "undated". "Learning with Hazy Beliefs," ELSE working papers 023, ESRC Centre on Economics Learning and Social Evolution.
  • Handle: RePEc:els:esrcls:023
    as

    Download full text from publisher

    File URL: ftp://ftp.repec.org/RePEc/els/esrcls/hazy.pdf
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Fudenberg Drew & Kreps David M., 1993. "Learning Mixed Equilibria," Games and Economic Behavior, Elsevier, vol. 5(3), pages 320-367, July.
    2. J. Jordan, 2010. "Three Problems in Learning Mixed-Strategy Equilibria," Levine's Working Paper Archive 475, David K. Levine.
    3. Foster, Dean P. & Vohra, Rakesh V., 1997. "Calibrated Learning and Correlated Equilibrium," Games and Economic Behavior, Elsevier, vol. 21(1-2), pages 40-55, October.
    4. Nyarko, Yaw, 1994. "Bayesian Learning Leads to Correlated Equilibria in Normal Form Games," Economic Theory, Springer;Society for the Advancement of Economic Theory (SAET), vol. 4(6), pages 821-841, October.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Eric Friedman, 1998. "Learnability of a class of Non-atomic Games arising on the Internet," Departmental Working Papers 199824, Rutgers University, Department of Economics.
    2. Foster, Dean P. & Young, H. Peyton, 2003. "Learning, hypothesis testing, and Nash equilibrium," Games and Economic Behavior, Elsevier, vol. 45(1), pages 73-96, October.
    3. Matthew O. Jackson & Ehud Kalai, 1997. "False Reputation in a Society of Players," Discussion Papers 1184R, Northwestern University, Center for Mathematical Studies in Economics and Management Science.
    4. Timothy Salmon, 2004. "Evidence for Learning to Learn Behavior in Normal Form Games," Theory and Decision, Springer, vol. 56(4), pages 367-404, April.
    5. Turdaliev, Nurlan, 2002. "Calibration and Bayesian learning," Games and Economic Behavior, Elsevier, vol. 41(1), pages 103-119, October.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Sandroni, Alvaro & Smorodinsky, Rann, 2004. "Belief-based equilibrium," Games and Economic Behavior, Elsevier, vol. 47(1), pages 157-171, April.
    2. Jackson, Matthew O. & Kalai, Ehud, 1999. "Reputation versus Social Learning," Journal of Economic Theory, Elsevier, vol. 88(1), pages 40-59, September.
    3. Fudenberg, Drew & Levine, David K., 1999. "Conditional Universal Consistency," Games and Economic Behavior, Elsevier, vol. 29(1-2), pages 104-130, October.
    4. Kalai, Ehud & Lehrer, Ehud & Smorodinsky, Rann, 1999. "Calibrated Forecasting and Merging," Games and Economic Behavior, Elsevier, vol. 29(1-2), pages 151-169, October.
    5. Fudenberg, Drew & Levine, David K., 1995. "Consistency and cautious fictitious play," Journal of Economic Dynamics and Control, Elsevier, vol. 19(5-7), pages 1065-1089.
    6. , & ,, 2015. "Rationalizable partition-confirmed equilibrium," Theoretical Economics, Econometric Society, vol. 10(3), September.
    7. Turdaliev, Nurlan, 2002. "Calibration and Bayesian learning," Games and Economic Behavior, Elsevier, vol. 41(1), pages 103-119, October.
    8. Friedman, James W. & Mezzetti, Claudio, 2001. "Learning in Games by Random Sampling," Journal of Economic Theory, Elsevier, vol. 98(1), pages 55-84, May.
    9. Foster, Dean P. & Young, H. Peyton, 2003. "Learning, hypothesis testing, and Nash equilibrium," Games and Economic Behavior, Elsevier, vol. 45(1), pages 73-96, October.
    10. H. Peyton Young, 2007. "The Possible and the Impossible in Multi-Agent Learning," Economics Series Working Papers 304, University of Oxford, Department of Economics.
    11. Foster, Dean P. & Vohra, Rakesh, 1999. "Regret in the On-Line Decision Problem," Games and Economic Behavior, Elsevier, vol. 29(1-2), pages 7-35, October.
    12. Sergiu Hart & Andreu Mas-Colell, 2013. "Stochastic Uncoupled Dynamics And Nash Equilibrium," World Scientific Book Chapters, in: Simple Adaptive Strategies From Regret-Matching to Uncoupled Dynamics, chapter 8, pages 165-189, World Scientific Publishing Co. Pte. Ltd..
    13. Tom Johnston & Michael Savery & Alex Scott & Bassel Tarbush, 2023. "Game Connectivity and Adaptive Dynamics," Papers 2309.10609, arXiv.org, revised Nov 2023.
    14. Mathieu Faure & Gregory Roth, 2010. "Stochastic Approximations of Set-Valued Dynamical Systems: Convergence with Positive Probability to an Attractor," Mathematics of Operations Research, INFORMS, vol. 35(3), pages 624-640, August.
    15. Anthonisen, Niels, 1997. "On the Convergence of Beliefs within Populations in Games with Learning," Journal of Economic Theory, Elsevier, vol. 76(1), pages 169-184, September.
    16. Ianni, A., 2002. "Reinforcement learning and the power law of practice: some analytical results," Discussion Paper Series In Economics And Econometrics 203, Economics Division, School of Social Sciences, University of Southampton.
    17. Benaïm, Michel & Hofbauer, Josef & Hopkins, Ed, 2009. "Learning in games with unstable equilibria," Journal of Economic Theory, Elsevier, vol. 144(4), pages 1694-1709, July.
    18. Hopkins, Ed, 1999. "Learning, Matching, and Aggregation," Games and Economic Behavior, Elsevier, vol. 26(1), pages 79-110, January.
    19. Sandholm,W.H., 2003. "Excess payoff dynamics, potential dynamics, and stable games," Working papers 5, Wisconsin Madison - Social Systems.
    20. Chiara Scarampi & Richard Fairchild & Luca Fumarco & Alberto Palermo & Neal Hinvest, 2021. "Social Metacognition: A Correlational Device for Strategic Interactions," Working Papers 2111, Tulane University, Department of Economics.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:els:esrcls:023. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: s. malkani (email available below). General contact details of provider: https://edirc.repec.org/data/elucluk.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.