Deep Learning to Play Games

My bibliography Save this paper

Deep Learning to Play Games

Author

Listed:

Daniele Condorelli
Massimiliano Furlan

Registered:

Abstract

We train two neural networks adversarially to play normal-form games. At each iteration, a row and column network take a new randomly generated game and output individual mixed strategies. The parameters of each network are independently updated via stochastic gradient descent to minimize expected regret given the opponent's strategy. Our simulations demonstrate that the joint behavior of the networks converges to strategies close to Nash equilibria in almost all games. For all $2 \times 2$ and in 80% of $3 \times 3$ games with multiple equilibria, the networks select the risk-dominant equilibrium. Our results show how Nash equilibrium emerges from learning across heterogeneous games.

Suggested Citation

Daniele Condorelli & Massimiliano Furlan, 2024. "Deep Learning to Play Games," Papers 2409.15197, arXiv.org.

Handle: RePEc:arx:papers:2409.15197

Download full text from publisher

References listed on IDEAS

John C. Harsanyi & Reinhard Selten, 1988. "A General Theory of Equilibrium Selection in Games," MIT Press Books, The MIT Press, edition 1, volume 1, number 0262582384, December.
Jacob K. Goeree & Charles A. Holt, 2001. "Ten Little Treasures of Game Theory and Ten Intuitive Contradictions," American Economic Review, American Economic Association, vol. 91(5), pages 1402-1422, December.
- Jacob K. Goeree & Charles A. Holt, 2000. "Ten Little Treasures of Game Theory and Ten Intuitive Contradictions," Virginia Economics Online Papers 333, University of Virginia, Department of Economics.
- Jacob K Goeree & Charles A Holt, 2004. "Ten Little Treasures of Game Theory and Ten Intuitive Contradictions," Levine's Working Paper Archive 618897000000000900, David K. Levine.
Spiliopoulos, Leonidas, 2012. "Interactive learning in 2×2 normal form games by neural network agents," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 391(22), pages 5557-5562.
Fudenberg, Drew & Levine, David, 1998. "Learning in games," European Economic Review, Elsevier, vol. 42(3-5), pages 631-639, May.
- Drew Fudenberg & David K. Levine, 1998. "Learning in Games," Levine's Working Paper Archive 2222, David K. Levine.
Lensberg, Terje & Schenk-Hoppé, Klaus Reiner, 2021. "Cold play: Learning across bimatrix games," Journal of Economic Behavior & Organization, Elsevier, vol. 185(C), pages 419-441.
- Lensberg, Terje & Schenk-Hoppé, Klaus R., 2020. "Cold play: Learning across bimatrix games," MPRA Paper 99095, University Library of Munich, Germany.
Jehiel, Philippe, 2005. "Analogy-based expectation equilibrium," Journal of Economic Theory, Elsevier, vol. 123(2), pages 81-104, August.
- Philippe Jeniel, 2001. "Analogy-Based Expectation Equilibrium," Economics Working Papers 0003, Institute for Advanced Study, School of Social Science.
- Philippe Jehiel, 2005. "Analogy-Based Expectation Equilibrium," Levine's Bibliography 784828000000000106, UCLA Department of Economics.
- Philippe Jehiel, 2005. "Analogy-based Expectation Equilibrium," Post-Print halshs-00754070, HAL.
, & ,, 2008. "Contagion through learning," Theoretical Economics, Econometric Society, vol. 3(4), December.
- Jakub Steiner, 2007. "Contagion through Learning," Edinburgh School of Economics Discussion Paper Series 151, Edinburgh School of Economics, University of Edinburgh.
David Cooper & John H. Kagel, 2003. "Lessons Learned: Generalizing Learning Across Games," American Economic Review, American Economic Association, vol. 93(2), pages 202-207, May.
Robert Aumann & Adam Brandenburger, 2014. "Epistemic Conditions for Nash Equilibrium," World Scientific Book Chapters, in: The Language of Game Theory Putting Epistemics into the Mathematics of Games, chapter 5, pages 113-136, World Scientific Publishing Co. Pte. Ltd..
- Aumann, Robert & Brandenburger, Adam, 1995. "Epistemic Conditions for Nash Equilibrium," Econometrica, Econometric Society, vol. 63(5), pages 1161-1180, September.
Mengel, Friederike, 2012. "Learning across games," Games and Economic Behavior, Elsevier, vol. 74(2), pages 601-619.
- Friederike Mengel, 2007. "Learning Across Games," Working Papers. Serie AD 2007-05, Instituto Valenciano de Investigaciones Económicas, S.A. (Ivie).
Kandori, Michihiro & Mailath, George J & Rob, Rafael, 1993. "Learning, Mutation, and Long Run Equilibria in Games," Econometrica, Econometric Society, vol. 61(1), pages 29-56, January.
- Kandori, M. & Mailath, G.J., 1991. "Learning, Mutation, And Long Run Equilibria In Games," Papers 71, Princeton, Woodrow Wilson School - John M. Olin Program.
- M. Kandori & G. Mailath & R. Rob, 1999. "Learning, Mutation and Long Run Equilibria in Games," Levine's Working Paper Archive 500, David K. Levine.
Samuelson, Larry, 2001. "Analogies, Adaptation, and Anomalies," Journal of Economic Theory, Elsevier, vol. 97(2), pages 320-366, April.
Drew Fudenberg & Annie Liang, 2019. "Predicting and Understanding Initial Play," American Economic Review, American Economic Association, vol. 109(12), pages 4112-4141, December.
- Drew Fudenberg & Annie Liang, 2017. "Predicting and Understanding Initial Play," PIER Working Paper Archive 17-026, Penn Institute for Economic Research, Department of Economics, University of Pennsylvania, revised 04 Jan 2018.
- Drew Fudenberg & Annie Liang, 2017. "Predicting and Understanding Initial Play," PIER Working Paper Archive 18-009, Penn Institute for Economic Research, Department of Economics, University of Pennsylvania, revised 30 Apr 2018.
Devetag, Giovanna, 2005. "Precedent transfer in coordination games: An experiment," Economics Letters, Elsevier, vol. 89(2), pages 227-232, November.
Grimm, Veronika & Mengel, Friederike, 2009. "Cooperation in viscous populations--Experimental evidence," Games and Economic Behavior, Elsevier, vol. 66(1), pages 202-220, May.
- Friederike Mengel & Veronika Grimm, 2007. "Cooperation In Viscous Populations - Experimental Evidence," Working Papers. Serie AD 2007-17, Instituto Valenciano de Investigaciones Económicas, S.A. (Ivie).
Young, H Peyton, 1993. "The Evolution of Conventions," Econometrica, Econometric Society, vol. 61(1), pages 57-84, January.
Sergiu Hart & Andreu Mas-Colell, 2013. "Uncoupled Dynamics Do Not Lead To Nash Equilibrium," World Scientific Book Chapters, in: Simple Adaptive Strategies From Regret-Matching to Uncoupled Dynamics, chapter 7, pages 153-163, World Scientific Publishing Co. Pte. Ltd..
- Sergiu Hart & Andreu Mas-Colell, 2003. "Uncoupled Dynamics Do Not Lead to Nash Equilibrium," American Economic Review, American Economic Association, vol. 93(5), pages 1830-1836, December.
Marchiori, Davide & Di Guida, Sibilla & Polonio, Luca, 2021. "Plasticity of strategic sophistication in interactive decision-making," Journal of Economic Theory, Elsevier, vol. 196(C).
Ignacio Palacios-Huerta, 2003. "Professionals Play Minimax," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 70(2), pages 395-415.
- Ignacio Palacios-Huerta, 2001. "Professionals Play Minimax," Working Papers 2001-17, Brown University, Department of Economics.
Dale O. Stahl, 1999. "Evidence based rules and learning in symmetric normal-form games," International Journal of Game Theory, Springer;Game Theory Society, vol. 28(1), pages 111-130.
Mark Walker & John Wooders, 2001. "Minimax Play at Wimbledon," American Economic Review, American Economic Association, vol. 91(5), pages 1521-1538, December.
LiCalzi Marco, 1995. "Fictitious Play by Cases," Games and Economic Behavior, Elsevier, vol. 11(1), pages 64-89, October.
- M. Li Calzi, 2010. "Fictitious Play By Cases," Levine's Working Paper Archive 407, David K. Levine.
P.-A. Chiappori, 2002. "Testing Mixed-Strategy Equilibria When Players Are Heterogeneous: The Case of Penalty Kicks in Soccer," American Economic Review, American Economic Association, vol. 92(4), pages 1138-1151, September.
Drew Fudenberg & David K. Levine, 1998. "The Theory of Learning in Games," MIT Press Books, The MIT Press, edition 1, volume 1, number 0262061945, December.
- Drew Fudenberg & David K. Levine, 1996. "The Theory of Learning in Games," Levine's Working Paper Archive 624, David K. Levine.
Kohlberg, Elon & Mertens, Jean-Francois, 1986. "On the Strategic Stability of Equilibria," Econometrica, Econometric Society, vol. 54(5), pages 1003-1037, September.
- KOHLBERG, Elon & MERTENS, Jean-François, 1986. "On the strategic stability of equilibria," LIDAM Reprints CORE 716, Université catholique de Louvain, Center for Operations Research and Econometrics (CORE).
- E. Kohlberg & J.-F. Mertens, 1998. "On the Strategic Stability of Equilibria," Levine's Working Paper Archive 445, David K. Levine.
Sgroi, Daniel & Zizzo, Daniel John, 2009. "Learning to play 3×3 games: Neural networks as bounded-rational players," Journal of Economic Behavior & Organization, Elsevier, vol. 69(1), pages 27-38, January.

Full references (including those not matched with items on IDEAS)

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Lensberg, Terje & Schenk-Hoppé, Klaus Reiner, 2021. "Cold play: Learning across bimatrix games," Journal of Economic Behavior & Organization, Elsevier, vol. 185(C), pages 419-441.
- Lensberg, Terje & Schenk-Hoppé, Klaus R., 2020. "Cold play: Learning across bimatrix games," MPRA Paper 99095, University Library of Munich, Germany.
Mohlin, Erik, 2012. "Evolution of theories of mind," Games and Economic Behavior, Elsevier, vol. 75(1), pages 299-318.
- Mohlin, Erik, 2010. "Evolution of Theories of Mind," SSE/EFI Working Paper Series in Economics and Finance 0728, Stockholm School of Economics, revised 20 Mar 2012.
Christoph Kuzmics & Daniel Rodenburger, 2020. "A case of evolutionarily stable attainable equilibrium in the laboratory," Economic Theory, Springer;Society for the Advancement of Economic Theory (SAET), vol. 70(3), pages 685-721, October.
Battalio,R. & Samuelson,L. & Huyck,J. van, 1998. "Risk dominance, payoff dominance and probabilistic choice learning," Working papers 2, Wisconsin Madison - Social Systems.
- Raymond Battalio & Larry Samuelson & John Van Huyck, 2010. "Risk Dominance, Payoff Dominance and Probabilistic Choice Learning," Levine's Working Paper Archive 50, David K. Levine.
Mengel, Friederike, 2012. "Learning across games," Games and Economic Behavior, Elsevier, vol. 74(2), pages 601-619.
- Friederike Mengel, 2007. "Learning Across Games," Working Papers. Serie AD 2007-05, Instituto Valenciano de Investigaciones Económicas, S.A. (Ivie).
Grimm, Veronika & Mengel, Friederike, 2012. "An experiment on learning in a multiple games environment," Journal of Economic Theory, Elsevier, vol. 147(6), pages 2220-2259.
- Grimm, V. & Mengel, F., 2009. "An Experiment on Learning in a Multiple Games Environment," Research Memorandum 007, Maastricht University, Maastricht Research School of Economics of Technology and Organization (METEOR).
Rossella Argenziano & Itzhak Gilboa, 2012. "History as a coordination device," Theory and Decision, Springer, vol. 73(4), pages 501-512, October.
- Gilboa, Itzhak & Argenziano, Rossella, 2006. "History as a Coordination Device," Foerder Institute for Economic Research Working Papers 275700, Tel-Aviv University > Foerder Institute for Economic Research.
- Rossella Argenziano & Itzhak Gilboa, 2012. "History as a coordination device," Post-Print hal-00745596, HAL.
- Argenziano, Rossella & Gilboa, Itzhak, 2010. "History as a Coordination Device," Foerder Institute for Economic Research Working Papers 275753, Tel-Aviv University > Foerder Institute for Economic Research.
Anke Gerber & Thorsten Hens & Bodo Vogt, "undated". "Coordination in a Repeated Stochastic Game with Imperfect Monitoring," IEW - Working Papers 126, Institute for Empirical Research in Economics - University of Zurich.
Gallice, Andrea, 2007. "Best Responding to What? A Behavioral Approach to One Shot Play in 2x2 Games," Discussion Papers in Economics 1365, University of Munich, Department of Economics.
Christoph March, 2011. "Adaptive social learning," Working Papers halshs-00572528, HAL.
- Christoph March, 2016. "Adaptive Social Learning," CESifo Working Paper Series 5783, CESifo.
- Christoph March, 2011. "Adaptive social learning," PSE Working Papers halshs-00572528, HAL.
He, Simin & Wu, Jiabin, 2020. "Compromise and coordination: An experimental study," Games and Economic Behavior, Elsevier, vol. 119(C), pages 216-233.
- He, Simin & Wu, Jiabin, 2018. "Compromise and Coordination: An Experimental Study," MPRA Paper 84713, University Library of Munich, Germany.
Philippe Jehiel, 2022. "Analogy-Based Expectation Equilibrium and Related Concepts:Theory, Applications, and Beyond," Working Papers halshs-03735680, HAL.
- Philippe Jehiel, 2022. "Analogy-Based Expectation Equilibrium and Related Concepts:Theory, Applications, and Beyond," PSE Working Papers halshs-03735680, HAL.
Friederike Mengel & Emanuela Sciubba, 2010. "Extrapolation in Games of Coordination and Dominance Solvable Games," Working Papers 2010.148, Fondazione Eni Enrico Mattei.
- Mengel, Friederike & Sciubba, Emanuela, 2010. "Extrapolation in Games of Coordination and Dominance Solvable Games," Sustainable Development Papers 98475, Fondazione Eni Enrico Mattei (FEEM).
- Mengel, F. & Sciubba, E., 2010. "Extrapolation in games of coordination and dominance solvable games," Research Memorandum 034, Maastricht University, Maastricht Research School of Economics of Technology and Organization (METEOR).
Marco LiCalzi & Roland Mühlenbernd, 2022. "Feature-weighted categorized play across symmetric games," Experimental Economics, Springer;Economic Science Association, vol. 25(3), pages 1052-1078, June.
Demichelis, Stefano & Ritzberger, Klaus, 2003. "From evolutionary to strategic stability," Journal of Economic Theory, Elsevier, vol. 113(1), pages 51-75, November.
- DEMICHELIS, Stefano & RITZBERGER, Klaus, 2000. "From evolutionary to strategic stability," LIDAM Discussion Papers CORE 2000059, Université catholique de Louvain, Center for Operations Research and Econometrics (CORE).
Alós-Ferrer, Carlos & Weidenholzer, Simon, 2008. "Contagion and efficiency," Journal of Economic Theory, Elsevier, vol. 143(1), pages 251-274, November.
Spiliopoulos, Leonidas, 2012. "Pattern recognition and subjective belief learning in a repeated constant-sum game," Games and Economic Behavior, Elsevier, vol. 75(2), pages 921-935.
Alos-Ferrer, Carlos & Weidenholzer, Simon, 2007. "Partial bandwagon effects and local interactions," Games and Economic Behavior, Elsevier, vol. 61(2), pages 179-197, November.
Auriol, Emmanuelle & Platteau, Jean-Philippe & Camilotti, Giula, 2017. "Eradicating Women-Hurting Customs: What Role for Social Engineering?," CEPR Discussion Papers 12107, C.E.P.R. Discussion Papers.
- Jean-Philippe Platteau & Guilia Camilotti & Emmanuelle Auriol, 2017. "Eradicating women-hurting customs: What role for social engineering?," WIDER Working Paper Series wp-2017-145, World Institute for Development Economic Research (UNU-WIDER).
Ge Jiang & Simon Weidenholzer, 2017. "Local interactions under switching costs," Economic Theory, Springer;Society for the Advancement of Economic Theory (SAET), vol. 64(3), pages 571-588, October.

More about this item

NEP fields

This paper has been announced in the following NEP Reports:

NEP-BIG-2024-10-28 (Big Data)
NEP-CMP-2024-10-28 (Computational Economics)
NEP-GTH-2024-10-28 (Game Theory)
NEP-NET-2024-10-28 (Network Economics)

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2409.15197. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Deep Learning to Play Games

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Most related items

More about this item

NEP fields

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data