IDEAS home Printed from https://ideas.repec.org/a/aea/aejmic/v14y2022i3p321-52.html
   My bibliography  Save this article

Strategic Teaching and Learning in Games

Author

Listed:
  • Burkhard C. Schipper

Abstract

We show there is no uncoupled learning heuristic leading to Nash equilibrium in all finite games that a player has an incentive to adopt, that would be evolutionary stable, or that could "learn itself." Rather, a player has an incentive to strategically teach a learning opponent to secure at least the Stackelberg leader payoff. This observation holds even when we restrict to generic games, two-player games, potential games, games with strategic complements, or 2 x 2 games, in which learning is known to be "nice." It also applies to uncoupled learning heuristics leading to correlated equilibria, rationalizability, iterated admissibility, or minimal CURB sets.

Suggested Citation

  • Burkhard C. Schipper, 2022. "Strategic Teaching and Learning in Games," American Economic Journal: Microeconomics, American Economic Association, vol. 14(3), pages 321-352, August.
  • Handle: RePEc:aea:aejmic:v:14:y:2022:i:3:p:321-52
    DOI: 10.1257/mic.20170139
    as

    Download full text from publisher

    File URL: https://www.aeaweb.org/doi/10.1257/mic.20170139
    Download Restriction: no

    File URL: https://www.aeaweb.org/doi/10.1257/mic.20170139.ds
    Download Restriction: Access to full text is restricted to AEA members and institutional subscribers.

    File URL: https://libkey.io/10.1257/mic.20170139?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    Other versions of this item:

    References listed on IDEAS

    as
    1. Moulin, Herve, 1979. "Dominance Solvable Voting Schemes," Econometrica, Econometric Society, vol. 47(6), pages 1137-1151, November.
    2. Nikolaus Robalino & Arthur Robson, 2016. "The Evolution of Strategic Sophistication," American Economic Review, American Economic Association, vol. 106(4), pages 1046-1072, April.
    3. Milgrom, Paul & Roberts, John, 1990. "Rationalizability, Learning, and Equilibrium in Games with Strategic Complementarities," Econometrica, Econometric Society, vol. 58(6), pages 1255-1277, November.
    4. Basu, Kaushik & Weibull, Jorgen W., 1991. "Strategy subsets closed under rational behavior," Economics Letters, Elsevier, vol. 36(2), pages 141-146, June.
    5. Peter Duersch & Albert Kolb & Jörg Oechssler & Burkhard Schipper, 2010. "Rage against the machines: how subjects play against learning algorithms," Economic Theory, Springer;Society for the Advancement of Economic Theory (SAET), vol. 43(3), pages 407-430, June.
    6. John H. Nachbar, 1997. "Prediction, Optimization, and Learning in Repeated Games," Econometrica, Econometric Society, vol. 65(2), pages 275-310, March.
    7. Burkhard Schipper, 2011. "Strategic Control of Myopic Best Reply in Repeated Games," Working Papers 284, University of California, Davis, Department of Economics.
    8. Fudenberg, Drew & Levine, David, 1998. "Learning in games," European Economic Review, Elsevier, vol. 42(3-5), pages 631-639, May.
    9. Dürsch, Peter & Kolb, Albert & Oechssler, Jörg & Schipper, Burkhard, 2005. "Rage against the machines : how subjects learn to play against computers," Papers 05-36, Sonderforschungsbreich 504.
    10. Germano, Fabrizio & Lugosi, Gabor, 2007. "Global Nash convergence of Foster and Young's regret testing," Games and Economic Behavior, Elsevier, vol. 60(1), pages 135-154, July.
    11. Terracol, Antoine & Vaksmann, Jonathan, 2009. "Dumbing down rational players: Learning and teaching in an experimental game," Journal of Economic Behavior & Organization, Elsevier, vol. 70(1-2), pages 54-71, May.
    12. Chong, Juin-Kuan & Camerer, Colin F. & Ho, Teck H., 2006. "A learning-based model of repeated games with incomplete information," Games and Economic Behavior, Elsevier, vol. 55(2), pages 340-371, May.
    13. , P. & , Peyton, 2006. "Regret testing: learning to play Nash equilibrium without knowing you have an opponent," Theoretical Economics, Econometric Society, vol. 1(3), pages 341-367, September.
    14. Bernheim, B Douglas, 1984. "Rationalizable Strategic Behavior," Econometrica, Econometric Society, vol. 52(4), pages 1007-1028, July.
    15. Yakov Babichenko, 2010. "Uncoupled automata and pure Nash equilibria," International Journal of Game Theory, Springer;Game Theory Society, vol. 39(3), pages 483-502, July.
    16. Aumann, Robert J., 1974. "Subjectivity and correlation in randomized strategies," Journal of Mathematical Economics, Elsevier, vol. 1(1), pages 67-96, March.
    17. Peter Duersch & Jörg Oechssler & Burkhard Schipper, 2014. "When is tit-for-tat unbeatable?," International Journal of Game Theory, Springer;Game Theory Society, vol. 43(1), pages 25-36, February.
    18. Binmore, Kenneth G. & Samuelson, Larry, 1992. "Evolutionary stability in repeated games played by finite automata," Journal of Economic Theory, Elsevier, vol. 57(2), pages 278-305, August.
    19. Kalai, Ehud & Lehrer, Ehud, 1993. "Rational Learning Leads to Nash Equilibrium," Econometrica, Econometric Society, vol. 61(5), pages 1019-1045, September.
    20. Pearce, David G, 1984. "Rationalizable Strategic Behavior and the Problem of Perfection," Econometrica, Econometric Society, vol. 52(4), pages 1029-1050, July.
    21. Foster, Dean P. & Vohra, Rakesh V., 1997. "Calibrated Learning and Correlated Equilibrium," Games and Economic Behavior, Elsevier, vol. 21(1-2), pages 40-55, October.
    22. Sergiu Hart & Andreu Mas-Colell, 2013. "A General Class Of Adaptive Strategies," World Scientific Book Chapters, in: Simple Adaptive Strategies From Regret-Matching to Uncoupled Dynamics, chapter 3, pages 47-76, World Scientific Publishing Co. Pte. Ltd..
    23. Jordan, J. S., 1991. "Bayesian learning in normal form games," Games and Economic Behavior, Elsevier, vol. 3(1), pages 60-81, February.
    24. Dean Foster & H Peyton Young, 1999. "On the Impossibility of Predicting the Behavior of Rational Agents," Economics Working Paper Archive 423, The Johns Hopkins University,Department of Economics, revised Jun 2001.
    25. Ellison, Glenn, 1997. "Learning from Personal Experience: One Rational Guy and the Justification of Myopia," Games and Economic Behavior, Elsevier, vol. 19(2), pages 180-210, May.
    26. Lipman, Barton L, 1991. "How to Decide How to Decide How to. . . : Modeling Limited Rationality," Econometrica, Econometric Society, vol. 59(4), pages 1105-1125, July.
    27. Sorin, Sylvain, 1999. "Merging, Reputation, and Repeated Games with Incomplete Information," Games and Economic Behavior, Elsevier, vol. 29(1-2), pages 274-308, October.
    28. Fernando Vega-Redondo, 1997. "The Evolution of Walrasian Behavior," Econometrica, Econometric Society, vol. 65(2), pages 375-384, March.
    29. Israeli, Eitan, 1999. "Sowing Doubt Optimally in Two-Person Repeated Games," Games and Economic Behavior, Elsevier, vol. 28(2), pages 203-216, August.
    30. Fudenberg, Drew & Levine, David K., 1999. "Conditional Universal Consistency," Games and Economic Behavior, Elsevier, vol. 29(1-2), pages 104-130, October.
    31. Mailath, George J. & Samuelson, Larry, 2006. "Repeated Games and Reputations: Long-Run Relationships," OUP Catalogue, Oxford University Press, number 9780195300796.
    32. John H. Nachbar, 2001. "Bayesian learning in repeated games of incomplete information," Social Choice and Welfare, Springer;The Society for Social Choice and Welfare, vol. 18(2), pages 303-326.
    33. Foster, Dean P. & Young, H. Peyton, 2003. "Learning, hypothesis testing, and Nash equilibrium," Games and Economic Behavior, Elsevier, vol. 45(1), pages 73-96, October.
    34. Drew Fudenberg & David K. Levine, 2008. "Reputation And Equilibrium Selection In Games With A Patient Player," World Scientific Book Chapters, in: Drew Fudenberg & David K Levine (ed.), A Long-Run Collaboration On Long-Run Games, chapter 7, pages 123-142, World Scientific Publishing Co. Pte. Ltd..
    35. Sergiu Hart & Andreu Mas-Colell, 2013. "A Simple Adaptive Procedure Leading To Correlated Equilibrium," World Scientific Book Chapters, in: Simple Adaptive Strategies From Regret-Matching to Uncoupled Dynamics, chapter 2, pages 17-46, World Scientific Publishing Co. Pte. Ltd..
    36. Schipper, Burkhard C., 2009. "Imitators and optimizers in Cournot oligopoly," Journal of Economic Dynamics and Control, Elsevier, vol. 33(12), pages 1981-1990, December.
    37. Sergiu Hart & Andreu Mas-Colell, 2013. "Stochastic Uncoupled Dynamics And Nash Equilibrium," World Scientific Book Chapters, in: Simple Adaptive Strategies From Regret-Matching to Uncoupled Dynamics, chapter 8, pages 165-189, World Scientific Publishing Co. Pte. Ltd..
    38. Aumann, Robert J. & Sorin, Sylvain, 1989. "Cooperation and bounded recall," Games and Economic Behavior, Elsevier, vol. 1(1), pages 5-39, March.
    39. Fudenberg, Drew & Maskin, Eric, 1990. "Evolution and Cooperation in Noisy Repeated Games," American Economic Review, American Economic Association, vol. 80(2), pages 274-279, May.
    40. Cripps, Martin W. & Schmidt, Klaus M. & Thomas, Jonathan P., 1996. "Reputation in Perturbed Repeated Games," Journal of Economic Theory, Elsevier, vol. 69(2), pages 387-410, May.
    41. Drew Fudenberg & David K. Levine, 1998. "The Theory of Learning in Games," MIT Press Books, The MIT Press, edition 1, volume 1, number 0262061945, April.
    42. Kim, Yong-Gwan, 1994. "Evolutionarily stable strategies in the repeated prisoner's dilemma," Mathematical Social Sciences, Elsevier, vol. 28(3), pages 167-197, December.
    43. Cripps, Martin W & Thomas, Jonathan P, 1995. "Reputation and Commitment in Two-Person Repeated Games without Discounting," Econometrica, Econometric Society, vol. 63(6), pages 1401-1419, November.
    44. Fabrizio Germano, 2007. "Stochastic Evolution of Rules for Playing Finite Normal Form Games," Theory and Decision, Springer, vol. 62(4), pages 311-333, May.
    45. Martin W. Cripps & Jonathan P. Thomas, 2003. "Some Asymptotic Results in Discounted Repeated Games of One-Sided Incomplete Information," Mathematics of Operations Research, INFORMS, vol. 28(3), pages 433-462, August.
    46. Shalev Jonathan, 1994. "Nonzero-Sum Two-Person Repeated Games with Incomplete Information and Known-Own Payoffs," Games and Economic Behavior, Elsevier, vol. 7(2), pages 246-259, September.
    47. John H. Nachbar, 2005. "Beliefs in Repeated Games," Econometrica, Econometric Society, vol. 73(2), pages 459-480, March.
    48. Sergiu Hart & Andreu Mas-Colell, 2013. "Uncoupled Dynamics Do Not Lead To Nash Equilibrium," World Scientific Book Chapters, in: Simple Adaptive Strategies From Regret-Matching to Uncoupled Dynamics, chapter 7, pages 153-163, World Scientific Publishing Co. Pte. Ltd..
    49. Camerer, Colin F. & Ho, Teck-Hua & Chong, Juin-Kuan, 2002. "Sophisticated Experience-Weighted Attraction Learning and Strategic Teaching in Repeated Games," Journal of Economic Theory, Elsevier, vol. 104(1), pages 137-188, May.
    50. Schipper, Burkhard C, 2011. "Strategic control of myopic best reply in repeated games," MPRA Paper 30219, University Library of Munich, Germany.
    51. Young, H. Peyton, 2009. "Learning by trial and error," Games and Economic Behavior, Elsevier, vol. 65(2), pages 626-643, March.
    52. Young, H. Peyton, 2004. "Strategic Learning and its Limits," OUP Catalogue, Oxford University Press, number 9780199269181.
    53. Kyle Hyndman & Erkut Y. Ozbay & Andrew Schotter & Wolf Ze’ev Ehrblatt, 2012. "Convergence: An Experimental Study Of Teaching And Learning In Repeated Games," Journal of the European Economic Association, European Economic Association, vol. 10(3), pages 573-604, May.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Heller, Yuval & Mohlin, Erik, 2019. "Coevolution of deception and preferences: Darwin and Nash meet Machiavelli," Games and Economic Behavior, Elsevier, vol. 113(C), pages 223-247.
    2. Jindani, Sam, 2022. "Learning efficient equilibria in repeated games," Journal of Economic Theory, Elsevier, vol. 205(C).
    3. Ioannis Kordonis & Alexandros C. Charalampidis & George P. Papavassilopoulos, 2018. "Pretending in Dynamic Games, Alternative Outcomes and Application to Electricity Markets," Dynamic Games and Applications, Springer, vol. 8(4), pages 844-873, December.
    4. Burkhard C. Schipper, 2019. "Dynamic Exploitation of Myopic Best Response," Dynamic Games and Applications, Springer, vol. 9(4), pages 1143-1167, December.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Burkhard Schipper, 2015. "Strategic teaching and learning in games," Working Papers 151, University of California, Davis, Department of Economics.
    2. Sergiu Hart & Yishay Mansour, 2013. "How Long To Equilibrium? The Communication Complexity Of Uncoupled Equilibrium Procedures," World Scientific Book Chapters, in: Simple Adaptive Strategies From Regret-Matching to Uncoupled Dynamics, chapter 10, pages 215-249, World Scientific Publishing Co. Pte. Ltd..
    3. Germano, Fabrizio & Lugosi, Gabor, 2007. "Global Nash convergence of Foster and Young's regret testing," Games and Economic Behavior, Elsevier, vol. 60(1), pages 135-154, July.
    4. Burkhard C. Schipper, 2019. "Dynamic Exploitation of Myopic Best Response," Dynamic Games and Applications, Springer, vol. 9(4), pages 1143-1167, December.
    5. Dean P Foster & Peyton Young, 2006. "Regret Testing Leads to Nash Equilibrium," Levine's Working Paper Archive 784828000000000676, David K. Levine.
    6. Tom Johnston & Michael Savery & Alex Scott & Bassel Tarbush, 2023. "Game Connectivity and Adaptive Dynamics," Papers 2309.10609, arXiv.org, revised Nov 2023.
    7. Chernov, G. & Susin, I., 2019. "Models of learning in games: An overview," Journal of the New Economic Association, New Economic Association, vol. 44(4), pages 77-125.
    8. Jonathan Newton, 2018. "Evolutionary Game Theory: A Renaissance," Games, MDPI, vol. 9(2), pages 1-67, May.
    9. Vivaldo M. Mendes & Diana A. Mendes & Orlando Gomes, 2008. "Learning to Play Nash in Deterministic Uncoupled Dynamics," Working Papers Series 1 ercwp1808, ISCTE-IUL, Business Research Unit (BRU-IUL).
    10. H. Peyton Young, 2007. "The Possible and the Impossible in Multi-Agent Learning," Economics Series Working Papers 304, University of Oxford, Department of Economics.
    11. Norman, Thomas W.L., 2015. "Learning, hypothesis testing, and rational-expectations equilibrium," Games and Economic Behavior, Elsevier, vol. 90(C), pages 93-105.
    12. Sergiu Hart & Yishay Mansour, 2006. "The Communication Complexity of Uncoupled Nash Equilibrium Procedures," Levine's Bibliography 122247000000001299, UCLA Department of Economics.
    13. Rene Saran & Roberto Serrano, 2012. "Regret Matching with Finite Memory," Dynamic Games and Applications, Springer, vol. 2(1), pages 160-175, March.
    14. Ioannis Kordonis & Alexandros C. Charalampidis & George P. Papavassilopoulos, 2018. "Pretending in Dynamic Games, Alternative Outcomes and Application to Electricity Markets," Dynamic Games and Applications, Springer, vol. 8(4), pages 844-873, December.
    15. Schipper, Burkhard C, 2011. "Strategic control of myopic best reply in repeated games," MPRA Paper 30219, University Library of Munich, Germany.
    16. Jindani, Sam, 2022. "Learning efficient equilibria in repeated games," Journal of Economic Theory, Elsevier, vol. 205(C).
    17. Foster, Dean P. & Hart, Sergiu, 2018. "Smooth calibration, leaky forecasts, finite recall, and Nash dynamics," Games and Economic Behavior, Elsevier, vol. 109(C), pages 271-293.
    18. Babichenko, Yakov, 2012. "Completely uncoupled dynamics and Nash equilibria," Games and Economic Behavior, Elsevier, vol. 76(1), pages 1-14.
    19. Eric Friedman & Scott Shenker & Amy Greenwald, 1998. "Learning in Networks Contexts: Experimental Results from Simulations," Departmental Working Papers 199825, Rutgers University, Department of Economics.
    20. Foster, Dean P. & Young, H. Peyton, 2003. "Learning, hypothesis testing, and Nash equilibrium," Games and Economic Behavior, Elsevier, vol. 45(1), pages 73-96, October.

    More about this item

    JEL classification:

    • C73 - Mathematical and Quantitative Methods - - Game Theory and Bargaining Theory - - - Stochastic and Dynamic Games; Evolutionary Games
    • D83 - Microeconomics - - Information, Knowledge, and Uncertainty - - - Search; Learning; Information and Knowledge; Communication; Belief; Unawareness

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:aea:aejmic:v:14:y:2022:i:3:p:321-52. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Michael P. Albert (email available below). General contact details of provider: https://edirc.repec.org/data/aeaaaea.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.