IDEAS home Printed from https://ideas.repec.org/p/pra/mprapa/40040.html
   My bibliography  Save this paper

Learning Nash Equilibria

Author

Listed:
  • Dai, Darong

Abstract

In the paper, we re-investigate the long run behavior of an adaptive learning process driven by the stochastic replicator dynamics developed by Fudenberg and Harris (1992). It is demonstrated that the Nash equilibrium will be the robust limit of the adaptive learning process as long as it is reachable for the learning dynamics in almost surely finite time. Doob’s martingale theory and Girsanov Theorem play very important roles in confirming the required assertion.

Suggested Citation

  • Dai, Darong, 2012. "Learning Nash Equilibria," MPRA Paper 40040, University Library of Munich, Germany.
  • Handle: RePEc:pra:mprapa:40040
    as

    Download full text from publisher

    File URL: https://mpra.ub.uni-muenchen.de/40040/1/MPRA_paper_40040.pdf
    File Function: original version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Fudenberg, D. & Harris, C., 1992. "Evolutionary dynamics with aggregate shocks," Journal of Economic Theory, Elsevier, vol. 57(2), pages 420-441, August.
    2. Alan Beggs, 2002. "Stochastic evolution with slow learning," Economic Theory, Springer;Society for the Advancement of Economic Theory (SAET), vol. 19(2), pages 379-405.
    3. Ken Binmore & Larry Samuelson, "undated". "Evolutionary Drift and Equilibrium Selection," ELSE working papers 011, ESRC Centre on Economics Learning and Social Evolution.
    4. Jordan J. S., 1993. "Three Problems in Learning Mixed-Strategy Nash Equilibria," Games and Economic Behavior, Elsevier, vol. 5(3), pages 368-386, July.
    5. Ellison, Glenn & Fudenberg, Drew, 2000. "Learning Purified Mixed Equilibria," Journal of Economic Theory, Elsevier, vol. 90(1), pages 84-115, January.
    6. Canning, David, 1992. "Average behavior in learning models," Journal of Economic Theory, Elsevier, vol. 57(2), pages 442-472, August.
    7. Gaunersdorfer Andrea & Hofbauer Josef, 1995. "Fictitious Play, Shapley Polygons, and the Replicator Equation," Games and Economic Behavior, Elsevier, vol. 11(2), pages 279-303, November.
    8. Borgers, Tilman & Sarin, Rajiv, 1997. "Learning Through Reinforcement and Replicator Dynamics," Journal of Economic Theory, Elsevier, vol. 77(1), pages 1-14, November.
    9. Young, H Peyton, 1993. "The Evolution of Conventions," Econometrica, Econometric Society, vol. 61(1), pages 57-84, January.
    10. Hofbauer, Josef & Hopkins, Ed, 2005. "Learning in perturbed asymmetric games," Games and Economic Behavior, Elsevier, vol. 52(1), pages 133-152, July.
    11. Ken Binmore & Larry Samuelson, 1999. "Evolutionary Drift and Equilibrium Selection," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 66(2), pages 363-393.
    12. Cabrales, Antonio, 2000. "Stochastic Replicator Dynamics," International Economic Review, Department of Economics, University of Pennsylvania and Osaka University Institute of Social and Economic Research Association, vol. 41(2), pages 451-481, May.
    13. Binmore Kenneth G. & Samuelson Larry & Vaughan Richard, 1995. "Musical Chairs: Modeling Noisy Evolution," Games and Economic Behavior, Elsevier, vol. 11(1), pages 1-35, October.
    14. Kaniovski Yuri M. & Young H. Peyton, 1995. "Learning Dynamics in Games with Stochastic Perturbations," Games and Economic Behavior, Elsevier, vol. 11(2), pages 330-363, November.
    15. Ken Binmore & Larry Samuelson, "undated". "Evolutionary Drift And Equilibrium Selection," ELSE working papers 049, ESRC Centre on Economics Learning and Social Evolution.
    16. Gale, John & Binmore, Kenneth G. & Samuelson, Larry, 1995. "Learning to be imperfect: The ultimatum game," Games and Economic Behavior, Elsevier, vol. 8(1), pages 56-90.
    17. Benaim, Michel & Hirsch, Morris W., 1999. "Mixed Equilibria and Dynamical Systems Arising from Fictitious Play in Perturbed Games," Games and Economic Behavior, Elsevier, vol. 29(1-2), pages 36-72, October.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Dai, Darong, 2012. "On the Existence and Stability of Pareto Optimal Endogenous Matching with Fairness," MPRA Paper 40560, University Library of Munich, Germany.
    2. Sandholm, William H., 2003. "Evolution and equilibrium under inexact information," Games and Economic Behavior, Elsevier, vol. 44(2), pages 343-378, August.
    3. Ed Hopkins, 2002. "Two Competing Models of How People Learn in Games," Econometrica, Econometric Society, vol. 70(6), pages 2141-2166, November.
    4. Sandholm,W.H., 1999. "Markov evolution with inexact information," Working papers 15, Wisconsin Madison - Social Systems.
    5. Dai, Darong, 2012. "On the existence and stability of Pareto optimal endogenous matching with fairness," MPRA Paper 40457, University Library of Munich, Germany.
    6. Ianni, A., 2002. "Reinforcement learning and the power law of practice: some analytical results," Discussion Paper Series In Economics And Econometrics 203, Economics Division, School of Social Sciences, University of Southampton.
    7. Ponti, Giovanni, 2000. "Continuous-time evolutionary dynamics: theory and practice," Research in Economics, Elsevier, vol. 54(2), pages 187-214, June.
    8. Hofbauer, Josef & Hopkins, Ed, 2005. "Learning in perturbed asymmetric games," Games and Economic Behavior, Elsevier, vol. 52(1), pages 133-152, July.
    9. Jonathan Newton, 2018. "Evolutionary Game Theory: A Renaissance," Games, MDPI, vol. 9(2), pages 1-67, May.
    10. N. Williams, 2002. "Stability and Long Run Equilibrium in Stochastic Fictitious Play," Princeton Economic Theory Working Papers cbeeeb49cc8afc83f125df5a8, David K. Levine.
    11. Uriarte, Jose Ramon, 2007. "A behavioural foundation for models of evolutionary drift," Journal of Economic Behavior & Organization, Elsevier, vol. 63(3), pages 497-513, July.
    12. Uriarte Ayo, José Ramón, 2005. "A Behavioral Foundation for Models of Evolutionary Drift," IKERLANAK 2005-19, Universidad del País Vasco - Departamento de Fundamentos del Análisis Económico I.
    13. Ponti, Giovanni, 2000. "Cycles of Learning in the Centipede Game," Games and Economic Behavior, Elsevier, vol. 30(1), pages 115-141, January.
    14. Dai, Darong, 2012. "On the Existence of Pareto Optimal Endogenous Matching," MPRA Paper 43125, University Library of Munich, Germany.
    15. Simon P. Anderson & Jacob K. Goeree & Charles A. Holt, 1999. "Stochastic Game Theory: Adjustment to Equilibrium Under Noisy Directional Learning," Virginia Economics Online Papers 327, University of Virginia, Department of Economics.
    16. Hofbauer,J. & Sandholm,W.H., 2001. "Evolution and learning in games with randomly disturbed payoffs," Working papers 5, Wisconsin Madison - Social Systems.
    17. Hofbauer,J. & Sandholm,W.H., 2001. "Evolution and learning in games with randomly disturbed payoffs," Working papers 5, Wisconsin Madison - Social Systems.
    18. Williams, Noah, 2022. "Learning and equilibrium transitions: Stochastic stability in discounted stochastic fictitious play," Journal of Economic Dynamics and Control, Elsevier, vol. 145(C).
    19. Hofbauer, Josef & Sandholm, William H., 2007. "Evolution in games with randomly disturbed payoffs," Journal of Economic Theory, Elsevier, vol. 132(1), pages 47-69, January.
    20. Sandholm, William H., 2015. "Population Games and Deterministic Evolutionary Dynamics," Handbook of Game Theory with Economic Applications,, Elsevier.

    More about this item

    Keywords

    Stochastic replicator dynamics; Adaptive learning; Nash equilibria; Global convergence; Robustness;
    All these keywords.

    JEL classification:

    • C72 - Mathematical and Quantitative Methods - - Game Theory and Bargaining Theory - - - Noncooperative Games
    • C73 - Mathematical and Quantitative Methods - - Game Theory and Bargaining Theory - - - Stochastic and Dynamic Games; Evolutionary Games

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:pra:mprapa:40040. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Joachim Winter (email available below). General contact details of provider: https://edirc.repec.org/data/vfmunde.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.