IDEAS home Printed from https://ideas.repec.org/p/cda/wpaper/232.html
   My bibliography  Save this paper

Strategic Teaching and Learning in Games

Author

Listed:
  • Burkhard Schipper

    (Department of Economics, University of California Davis)

Abstract

It is known that there are uncoupled learning heuristics leading to Nash equilibrium in all finite games. Why should players use such learning heuristics and where could they come from? We show that there is no uncoupled learning heuristic leading to Nash equilibrium in all finite games that a player has an incentive to adopt, that would be evolutionary stable or that could "learn itself". Rather, a player has an incentive to strategically teach such a learning opponent in order to secure at least the Stackelberg leader payoff. The impossibility result remains intact when restricted to the classes of generic games, two-player games, potential games, games with strategic complements or 2 x 2 games, in which learning is known to be "nice". More generally, it also applies to uncoupled learning heuristics leading to correlated equilibria, rationalizable outcomes, iterated admissible outcomes, or minimal curb sets. A possibility result restricted to "strategically trivial" games fails if some generic games outside this class are considered as well.

Suggested Citation

  • Burkhard Schipper, 2017. "Strategic Teaching and Learning in Games," Working Papers 232, University of California, Davis, Department of Economics.
  • Handle: RePEc:cda:wpaper:232
    as

    Download full text from publisher

    File URL: https://repec.dss.ucdavis.edu/files/ySkEqXRffwPF9EynqWr1JUdM/17-2.pdf
    Download Restriction: no
    ---><---

    Other versions of this item:

    References listed on IDEAS

    as
    1. Moulin, Herve, 1979. "Dominance Solvable Voting Schemes," Econometrica, Econometric Society, vol. 47(6), pages 1137-1151, November.
    2. Peter Duersch & Albert Kolb & Jörg Oechssler & Burkhard Schipper, 2010. "Rage against the machines: how subjects play against learning algorithms," Economic Theory, Springer;Society for the Advancement of Economic Theory (SAET), vol. 43(3), pages 407-430, June.
    3. Fudenberg, Drew & Levine, David, 1998. "Learning in games," European Economic Review, Elsevier, vol. 42(3-5), pages 631-639, May.
    4. Germano, Fabrizio & Lugosi, Gabor, 2007. "Global Nash convergence of Foster and Young's regret testing," Games and Economic Behavior, Elsevier, vol. 60(1), pages 135-154, July.
    5. Terracol, Antoine & Vaksmann, Jonathan, 2009. "Dumbing down rational players: Learning and teaching in an experimental game," Journal of Economic Behavior & Organization, Elsevier, vol. 70(1-2), pages 54-71, May.
    6. Peter Duersch & Jörg Oechssler & Burkhard Schipper, 2014. "When is tit-for-tat unbeatable?," International Journal of Game Theory, Springer;Game Theory Society, vol. 43(1), pages 25-36, February.
    7. Schipper, Burkhard C, 2011. "Strategic control of myopic best reply in repeated games," MPRA Paper 30219, University Library of Munich, Germany.
    8. Kalai, Ehud & Lehrer, Ehud, 1993. "Rational Learning Leads to Nash Equilibrium," Econometrica, Econometric Society, vol. 61(5), pages 1019-1045, September.
    9. Pearce, David G, 1984. "Rationalizable Strategic Behavior and the Problem of Perfection," Econometrica, Econometric Society, vol. 52(4), pages 1029-1050, July.
    10. Foster, Dean P. & Vohra, Rakesh V., 1997. "Calibrated Learning and Correlated Equilibrium," Games and Economic Behavior, Elsevier, vol. 21(1-2), pages 40-55, October.
    11. Sergiu Hart & Andreu Mas-Colell, 2013. "A General Class Of Adaptive Strategies," World Scientific Book Chapters, in: Simple Adaptive Strategies From Regret-Matching to Uncoupled Dynamics, chapter 3, pages 47-76, World Scientific Publishing Co. Pte. Ltd..
    12. Jordan, J. S., 1991. "Bayesian learning in normal form games," Games and Economic Behavior, Elsevier, vol. 3(1), pages 60-81, February.
    13. John H. Nachbar, 1997. "Prediction, Optimization, and Learning in Repeated Games," Econometrica, Econometric Society, vol. 65(2), pages 275-310, March.
    14. Dean Foster & H Peyton Young, 1999. "On the Impossibility of Predicting the Behavior of Rational Agents," Economics Working Paper Archive 423, The Johns Hopkins University,Department of Economics, revised Jun 2001.
    15. Lipman, Barton L, 1991. "How to Decide How to Decide How to. . . : Modeling Limited Rationality," Econometrica, Econometric Society, vol. 59(4), pages 1105-1125, July.
    16. Israeli, Eitan, 1999. "Sowing Doubt Optimally in Two-Person Repeated Games," Games and Economic Behavior, Elsevier, vol. 28(2), pages 203-216, August.
    17. Basu, Kaushik & Weibull, Jorgen W., 1991. "Strategy subsets closed under rational behavior," Economics Letters, Elsevier, vol. 36(2), pages 141-146, June.
    18. Drew Fudenberg & David K. Levine, 2008. "Reputation And Equilibrium Selection In Games With A Patient Player," World Scientific Book Chapters, in: Drew Fudenberg & David K Levine (ed.), A Long-Run Collaboration On Long-Run Games, chapter 7, pages 123-142, World Scientific Publishing Co. Pte. Ltd..
    19. Sergiu Hart & Andreu Mas-Colell, 2013. "Stochastic Uncoupled Dynamics And Nash Equilibrium," World Scientific Book Chapters, in: Simple Adaptive Strategies From Regret-Matching to Uncoupled Dynamics, chapter 8, pages 165-189, World Scientific Publishing Co. Pte. Ltd..
    20. Bernheim, B Douglas, 1984. "Rationalizable Strategic Behavior," Econometrica, Econometric Society, vol. 52(4), pages 1007-1028, July.
    21. Foster, Dean P. & Young, H. Peyton, 2003. "Learning, hypothesis testing, and Nash equilibrium," Games and Economic Behavior, Elsevier, vol. 45(1), pages 73-96, October.
    22. Aumann, Robert J. & Sorin, Sylvain, 1989. "Cooperation and bounded recall," Games and Economic Behavior, Elsevier, vol. 1(1), pages 5-39, March.
    23. Fudenberg, Drew & Maskin, Eric, 1990. "Evolution and Cooperation in Noisy Repeated Games," American Economic Review, American Economic Association, vol. 80(2), pages 274-279, May.
    24. Cripps, Martin W. & Schmidt, Klaus M. & Thomas, Jonathan P., 1996. "Reputation in Perturbed Repeated Games," Journal of Economic Theory, Elsevier, vol. 69(2), pages 387-410, May.
    25. Chong, Juin-Kuan & Camerer, Colin F. & Ho, Teck H., 2006. "A learning-based model of repeated games with incomplete information," Games and Economic Behavior, Elsevier, vol. 55(2), pages 340-371, May.
    26. Sergiu Hart & Andreu Mas-Colell, 2013. "A Simple Adaptive Procedure Leading To Correlated Equilibrium," World Scientific Book Chapters, in: Simple Adaptive Strategies From Regret-Matching to Uncoupled Dynamics, chapter 2, pages 17-46, World Scientific Publishing Co. Pte. Ltd..
    27. Fudenberg, Drew & Levine, David K., 1999. "Conditional Universal Consistency," Games and Economic Behavior, Elsevier, vol. 29(1-2), pages 104-130, October.
    28. Dürsch, Peter & Kolb, Albert & Oechssler, Jörg & Schipper, Burkhard C., 2005. "Rage Against the Machines: How Subjects Learn to Play Against Computers," Bonn Econ Discussion Papers 31/2005, University of Bonn, Bonn Graduate School of Economics (BGSE).
    29. Cripps, Martin W & Thomas, Jonathan P, 1995. "Reputation and Commitment in Two-Person Repeated Games without Discounting," Econometrica, Econometric Society, vol. 63(6), pages 1401-1419, November.
    30. Fabrizio Germano, 2007. "Stochastic Evolution of Rules for Playing Finite Normal Form Games," Theory and Decision, Springer, vol. 62(4), pages 311-333, May.
    31. Martin W. Cripps & Jonathan P. Thomas, 2003. "Some Asymptotic Results in Discounted Repeated Games of One-Sided Incomplete Information," Mathematics of Operations Research, INFORMS, vol. 28(3), pages 433-462, August.
    32. Shalev Jonathan, 1994. "Nonzero-Sum Two-Person Repeated Games with Incomplete Information and Known-Own Payoffs," Games and Economic Behavior, Elsevier, vol. 7(2), pages 246-259, September.
    33. John H. Nachbar, 2005. "Beliefs in Repeated Games," Econometrica, Econometric Society, vol. 73(2), pages 459-480, March.
    34. Schipper, Burkhard C, 2011. "Strategic control of myopic best reply in repeated games," MPRA Paper 30219, University Library of Munich, Germany.
    35. Aumann, Robert J., 1974. "Subjectivity and correlation in randomized strategies," Journal of Mathematical Economics, Elsevier, vol. 1(1), pages 67-96, March.
    36. Young, H. Peyton, 2004. "Strategic Learning and its Limits," OUP Catalogue, Oxford University Press, number 9780199269181.
    37. Sergiu Hart & Andreu Mas-Colell, 2013. "Uncoupled Dynamics Do Not Lead To Nash Equilibrium," World Scientific Book Chapters, in: Simple Adaptive Strategies From Regret-Matching to Uncoupled Dynamics, chapter 7, pages 153-163, World Scientific Publishing Co. Pte. Ltd..
    38. Nikolaus Robalino & Arthur Robson, 2016. "The Evolution of Strategic Sophistication," American Economic Review, American Economic Association, vol. 106(4), pages 1046-1072, April.
    39. Milgrom, Paul & Roberts, John, 1990. "Rationalizability, Learning, and Equilibrium in Games with Strategic Complementarities," Econometrica, Econometric Society, vol. 58(6), pages 1255-1277, November.
    40. , P. & , Peyton, 2006. "Regret testing: learning to play Nash equilibrium without knowing you have an opponent," Theoretical Economics, Econometric Society, vol. 1(3), pages 341-367, September.
    41. Yakov Babichenko, 2010. "Uncoupled automata and pure Nash equilibria," International Journal of Game Theory, Springer;Game Theory Society, vol. 39(3), pages 483-502, July.
    42. Binmore, Kenneth G. & Samuelson, Larry, 1992. "Evolutionary stability in repeated games played by finite automata," Journal of Economic Theory, Elsevier, vol. 57(2), pages 278-305, August.
    43. Ellison, Glenn, 1997. "Learning from Personal Experience: One Rational Guy and the Justification of Myopia," Games and Economic Behavior, Elsevier, vol. 19(2), pages 180-210, May.
    44. Sorin, Sylvain, 1999. "Merging, Reputation, and Repeated Games with Incomplete Information," Games and Economic Behavior, Elsevier, vol. 29(1-2), pages 274-308, October.
    45. Fernando Vega-Redondo, 1997. "The Evolution of Walrasian Behavior," Econometrica, Econometric Society, vol. 65(2), pages 375-384, March.
    46. Mailath, George J. & Samuelson, Larry, 2006. "Repeated Games and Reputations: Long-Run Relationships," OUP Catalogue, Oxford University Press, number 9780195300796.
    47. John H. Nachbar, 2001. "Bayesian learning in repeated games of incomplete information," Social Choice and Welfare, Springer;The Society for Social Choice and Welfare, vol. 18(2), pages 303-326.
    48. Schipper, Burkhard C., 2009. "Imitators and optimizers in Cournot oligopoly," Journal of Economic Dynamics and Control, Elsevier, vol. 33(12), pages 1981-1990, December.
    49. Drew Fudenberg & David K. Levine, 1998. "The Theory of Learning in Games," MIT Press Books, The MIT Press, edition 1, volume 1, number 0262061945, December.
    50. Kim, Yong-Gwan, 1994. "Evolutionarily stable strategies in the repeated prisoner's dilemma," Mathematical Social Sciences, Elsevier, vol. 28(3), pages 167-197, December.
    51. Camerer, Colin F. & Ho, Teck-Hua & Chong, Juin-Kuan, 2002. "Sophisticated Experience-Weighted Attraction Learning and Strategic Teaching in Repeated Games," Journal of Economic Theory, Elsevier, vol. 104(1), pages 137-188, May.
    52. Young, H. Peyton, 2009. "Learning by trial and error," Games and Economic Behavior, Elsevier, vol. 65(2), pages 626-643, March.
    53. Kyle Hyndman & Erkut Y. Ozbay & Andrew Schotter & Wolf Ze’ev Ehrblatt, 2012. "Convergence: An Experimental Study Of Teaching And Learning In Repeated Games," Journal of the European Economic Association, European Economic Association, vol. 10(3), pages 573-604, May.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Heller, Yuval & Mohlin, Erik, 2019. "Coevolution of deception and preferences: Darwin and Nash meet Machiavelli," Games and Economic Behavior, Elsevier, vol. 113(C), pages 223-247.
    2. Burkhard C. Schipper, 2019. "Dynamic Exploitation of Myopic Best Response," Dynamic Games and Applications, Springer, vol. 9(4), pages 1143-1167, December.
    3. Ioannis Kordonis & Alexandros C. Charalampidis & George P. Papavassilopoulos, 2018. "Pretending in Dynamic Games, Alternative Outcomes and Application to Electricity Markets," Dynamic Games and Applications, Springer, vol. 8(4), pages 844-873, December.
    4. Jindani, Sam, 2022. "Learning efficient equilibria in repeated games," Journal of Economic Theory, Elsevier, vol. 205(C).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Burkhard Schipper, 2015. "Strategic teaching and learning in games," Working Papers 151, University of California, Davis, Department of Economics.
    2. Sergiu Hart & Yishay Mansour, 2013. "How Long To Equilibrium? The Communication Complexity Of Uncoupled Equilibrium Procedures," World Scientific Book Chapters, in: Simple Adaptive Strategies From Regret-Matching to Uncoupled Dynamics, chapter 10, pages 215-249, World Scientific Publishing Co. Pte. Ltd..
    3. Germano, Fabrizio & Lugosi, Gabor, 2007. "Global Nash convergence of Foster and Young's regret testing," Games and Economic Behavior, Elsevier, vol. 60(1), pages 135-154, July.
    4. Burkhard C. Schipper, 2019. "Dynamic Exploitation of Myopic Best Response," Dynamic Games and Applications, Springer, vol. 9(4), pages 1143-1167, December.
    5. Dean P Foster & Peyton Young, 2006. "Regret Testing Leads to Nash Equilibrium," Levine's Working Paper Archive 784828000000000676, David K. Levine.
    6. Tom Johnston & Michael Savery & Alex Scott & Bassel Tarbush, 2023. "Game Connectivity and Adaptive Dynamics," Papers 2309.10609, arXiv.org, revised Oct 2024.
    7. Jonathan Newton, 2018. "Evolutionary Game Theory: A Renaissance," Games, MDPI, vol. 9(2), pages 1-67, May.
    8. Vivaldo M. Mendes & Diana A. Mendes & Orlando Gomes, 2008. "Learning to Play Nash in Deterministic Uncoupled Dynamics," Working Papers Series 1 ercwp1808, ISCTE-IUL, Business Research Unit (BRU-IUL).
    9. Chernov, G. & Susin, I., 2019. "Models of learning in games: An overview," Journal of the New Economic Association, New Economic Association, vol. 44(4), pages 77-125.
    10. H. Peyton Young, 2007. "The Possible and the Impossible in Multi-Agent Learning," Economics Series Working Papers 304, University of Oxford, Department of Economics.
    11. Ioannis Kordonis & Alexandros C. Charalampidis & George P. Papavassilopoulos, 2018. "Pretending in Dynamic Games, Alternative Outcomes and Application to Electricity Markets," Dynamic Games and Applications, Springer, vol. 8(4), pages 844-873, December.
    12. Rene Saran & Roberto Serrano, 2012. "Regret Matching with Finite Memory," Dynamic Games and Applications, Springer, vol. 2(1), pages 160-175, March.
    13. Schipper, Burkhard C, 2011. "Strategic control of myopic best reply in repeated games," MPRA Paper 30219, University Library of Munich, Germany.
    14. Jindani, Sam, 2022. "Learning efficient equilibria in repeated games," Journal of Economic Theory, Elsevier, vol. 205(C).
    15. Babichenko, Yakov, 2012. "Completely uncoupled dynamics and Nash equilibria," Games and Economic Behavior, Elsevier, vol. 76(1), pages 1-14.
    16. Eric Friedman & Scott Shenker & Amy Greenwald, 1998. "Learning in Networks Contexts: Experimental Results from Simulations," Departmental Working Papers 199825, Rutgers University, Department of Economics.
    17. Foster, Dean P. & Hart, Sergiu, 2018. "Smooth calibration, leaky forecasts, finite recall, and Nash dynamics," Games and Economic Behavior, Elsevier, vol. 109(C), pages 271-293.
    18. Norman, Thomas W.L., 2015. "Learning, hypothesis testing, and rational-expectations equilibrium," Games and Economic Behavior, Elsevier, vol. 90(C), pages 93-105.
    19. Sergiu Hart & Yishay Mansour, 2006. "The Communication Complexity of Uncoupled Nash Equilibrium Procedures," Levine's Bibliography 122247000000001299, UCLA Department of Economics.
    20. Peter Duersch & Albert Kolb & Jörg Oechssler & Burkhard Schipper, 2010. "Rage against the machines: how subjects play against learning algorithms," Economic Theory, Springer;Society for the Advancement of Economic Theory (SAET), vol. 43(3), pages 407-430, June.

    More about this item

    Keywords

    Learning in games; Interactive learning; Higher-order learning;
    All these keywords.

    JEL classification:

    • C72 - Mathematical and Quantitative Methods - - Game Theory and Bargaining Theory - - - Noncooperative Games
    • C73 - Mathematical and Quantitative Methods - - Game Theory and Bargaining Theory - - - Stochastic and Dynamic Games; Evolutionary Games

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:cda:wpaper:232. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Letters and Science IT Services Unit (email available below). General contact details of provider: https://edirc.repec.org/data/educdus.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.