The Emergence of Strategic Reasoning of Large Language Models

My bibliography Save this paper

The Emergence of Strategic Reasoning of Large Language Models

Author

Listed:

Dongwoo Lee
Gavin Kader

Registered:

Abstract

Although large language models (LLMs) have demonstrated strong reasoning abilities in structured tasks (e.g., coding and mathematics), it remains unexplored whether these abilities extend to strategic multi-agent environments. We investigate strategic reasoning capabilities -- the process of choosing an optimal course of action by predicting and adapting to others' actions -- of LLMs by analyzing their performance in three classical games from behavioral economics. We evaluate three standard LLMs (ChatGPT-4, Claude-2.1, Gemini 1.5) and three specialized reasoning LLMs (GPT-o1, Claude-3.5-Sonnet, Gemini Flash Thinking 2.0) using hierarchical models of bounded rationality. Our results show that reasoning LLMs exhibit superior strategic reasoning compared to standard LLMs (which do not demonstrate substantial capabilities), and often match or exceed human performance. Since strategic reasoning is fundamental to future AI systems (including Agentic AI and Artificial General Intelligence), our findings demonstrate the importance of dedicated reasoning capabilities in achieving effective strategic reasoning.

Suggested Citation

Dongwoo Lee & Gavin Kader, 2024. "The Emergence of Strategic Reasoning of Large Language Models," Papers 2412.13013, arXiv.org, revised Feb 2025.

Handle: RePEc:arx:papers:2412.13013

Download full text from publisher

References listed on IDEAS

Stahl, Dale II & Wilson, Paul W., 1994. "Experimental evidence on players' models of other players," Journal of Economic Behavior & Organization, Elsevier, vol. 25(3), pages 309-327, December.
Erev, Ido & Roth, Alvin E, 1998. "Predicting How People Play Games: Reinforcement Learning in Experimental Games with Unique, Mixed Strategy Equilibria," American Economic Review, American Economic Association, vol. 88(4), pages 848-881, September.
Henning Hermes & Daniel Schunk, 2022. "If you could read my mind–an experimental beauty-contest game with children," Experimental Economics, Springer;Economic Science Association, vol. 25(1), pages 229-253, February.
- Hermes, Henning & Schunk, Daniel, 2019. "If You Could Read My Mind—An Experimental Beauty-Contest Game with Children," Discussion Paper Series in Economics 23/2019, Norwegian School of Economics, Department of Economics.
- Henning Hermes & Daniel Schunk, 2019. "If You Could Read My Mind—An Experimental Beauty-Contest Game with Children," Working Papers 1913, Gutenberg School of Management and Economics, Johannes Gutenberg-Universität Mainz.
Vincent P. Crawford & Miguel A. Costa-Gomes, 2006. "Cognition and Behavior in Two-Person Guessing Games: An Experimental Study," American Economic Review, American Economic Association, vol. 96(5), pages 1737-1768, December.
- Miguel A. Costa-Gomes & Vincent P. Crawford, 2004. "Cognition and Behavior in Two-Person Guessing Games: An Experimental Study," ISER Discussion Paper 0613, Institute of Social and Economic Research, Osaka University.
- Miguel A. Costa-Gomes & Vincent P. Crawford, 2006. "Cognition and Behavior in Two-Person Guessing Games: An Experimental Study," Levine's Bibliography 321307000000000336, UCLA Department of Economics.
- Costa-Gomes, Miguel A. & Crawford, Vincent P., 2004. "Cognition and Behavior in Two-Person Guessing Games: An Experimental Study," University of California at San Diego, Economics Working Paper Series qt449812fx, Department of Economics, UC San Diego.
- Miguel A. Costa-Gomes & Vincent P. Crawford, 2004. "Cognition and Behavior in Two-Person Guessing Games: An Experimental Study," Levine's Bibliography 122247000000000113, UCLA Department of Economics.
- Miguel Costa-Gomes & Vincent P. Crawford, 2004. "Cognition And Behavior In Two-Person Guessing Games: An Experimental Study," Levine's Bibliography 122247000000000143, UCLA Department of Economics.
Grosskopf, Brit & Nagel, Rosemarie, 2008. "The two-person beauty contest," Games and Economic Behavior, Elsevier, vol. 62(1), pages 93-99, January.
Georganas, Sotiris & Healy, Paul J. & Weber, Roberto A., 2015. "On the persistence of strategic sophistication," Journal of Economic Theory, Elsevier, vol. 159(PA), pages 369-400.
- Sotiris Georganas & Paul J. Healy & Roberto A. Weber, 2014. "On the Persistence of Strategic Sophistication," CESifo Working Paper Series 4653, CESifo.
Ho, Teck-Hua & Camerer, Colin & Weigelt, Keith, 1998. "Iterated Dominance and Iterated Best Response in Experimental "p-Beauty Contests."," American Economic Review, American Economic Association, vol. 88(4), pages 947-969, September.
- Ho, Teck Hua & Weigelt, Keith & Camerer, Colin, 1996. "Iterated Dominance and Iterated Best-Response in Experimental P-Beauty Contests," Working Papers 974, California Institute of Technology, Division of the Humanities and Social Sciences.
Taylor Webb & Keith J. Holyoak & Hongjing Lu, 2023. "Emergent analogical reasoning in large language models," Nature Human Behaviour, Nature, vol. 7(9), pages 1526-1541, September.
Ayala Arad & Ariel Rubinstein, 2012. "The 11-20 Money Request Game: A Level-k Reasoning Study," American Economic Review, American Economic Association, vol. 102(7), pages 3561-3573, December.
Nagel, Rosemarie, 1995. "Unraveling in Guessing Games: An Experimental Study," American Economic Review, American Economic Association, vol. 85(5), pages 1313-1326, December.
Vincent P. Crawford & Miguel A. Costa-Gomes & Nagore Iriberri, 2013. "Structural Models of Nonequilibrium Strategic Thinking: Theory, Evidence, and Applications," Journal of Economic Literature, American Economic Association, vol. 51(1), pages 5-62, March.

Full references (including those not matched with items on IDEAS)

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Wanqun Zhao, 2020. "Cost of Reasoning and Strategic Sophistication," Games, MDPI, vol. 11(3), pages 1-27, September.
Georganas, Sotiris & Healy, Paul J. & Weber, Roberto A., 2015. "On the persistence of strategic sophistication," Journal of Economic Theory, Elsevier, vol. 159(PA), pages 369-400.
- Sotiris Georganas & Paul J. Healy & Roberto A. Weber, 2014. "On the Persistence of Strategic Sophistication," CESifo Working Paper Series 4653, CESifo.
Nagel, Rosemarie & Bühren, Christoph & Frank, Björn, 2017. "Inspired and inspiring: Hervé Moulin and the discovery of the beauty contest game," Mathematical Social Sciences, Elsevier, vol. 90(C), pages 191-207.
- Rosemarie Nagel & Christoph Bühren & Björn Frank, 2016. "Inspired and inspiring: Hervé Moulin and the discovery of the beauty contest game," Economics Working Papers 1539, Department of Economics and Business, Universitat Pompeu Fabra, revised Nov 2016.
Berger, Ulrich & De Silva, Hannelore & Fellner-Röhling, Gerlinde, 2016. "Cognitive hierarchies in the minimizer game," Journal of Economic Behavior & Organization, Elsevier, vol. 130(C), pages 337-348.
- Berger, Ulrich & De Silva, Hannelore & Fellner-Röhling, Gerlinde, 2016. "Cognitive Hierarchies in the Minimizer Game," Department of Economics Working Paper Series 211, WU Vienna University of Economics and Business.
- Ulrich Berger & Hannelore De Silva & Gerlinde Fellner-Röhling, 2016. "Cognitive Hierarchies in the Minimizer Game," Department of Economics Working Papers wuwp211, Vienna University of Economics and Business, Department of Economics.
Lindner, Florian & Sutter, Matthias, 2013. "Level-k reasoning and time pressure in the 11–20 money request game," Economics Letters, Elsevier, vol. 120(3), pages 542-545.
- Florian Lindner & Matthias Sutter, 2013. "Level-k reasoning and time pressure in the 11-20 money request game," Working Papers 2013-13, Faculty of Economics and Statistics, Universität Innsbruck.
- Lindner, Florian & Sutter, Matthias, 2013. "Level-k reasoning and time pressure in the 11-20 money request game," Munich Reprints in Economics 19234, University of Munich, Department of Economics.
Ye Jin, 2021. "Does level-k behavior imply level-k thinking?," Experimental Economics, Springer;Economic Science Association, vol. 24(1), pages 330-353, March.
Alaoui, Larbi & Janezic, Katharina A. & Penta, Antonio, 2020. "Reasoning about others' reasoning," Journal of Economic Theory, Elsevier, vol. 189(C).
- Katharina A. Janezic & Antonio Penta & Larbi Alaoui, 2017. "Reasoning about Others' Reasoning," Working Papers 1003, Barcelona School of Economics.
- Larbi Alaoui & Katharina A. Janezic & Antonio Penta, 2017. "Reasoning about others’ reasoning," Economics Working Papers 1587, Department of Economics and Business, Universitat Pompeu Fabra.
Choo, Lawrence C.Y & Kaplan, Todd R., 2014. "Explaining Behavior in the "11-20" Game," MPRA Paper 52808, University Library of Munich, Germany.
- Lawrence C.Y Choo & Todd R. Kaplan, 2014. "Explaining Behavior in the "11-20” Game," Discussion Papers 1401, University of Exeter, Department of Economics.
Bayer, Ralph C. & Renou, Ludovic, 2016. "Logical omniscience at the laboratory," Journal of Behavioral and Experimental Economics (formerly The Journal of Socio-Economics), Elsevier, vol. 64(C), pages 41-49.
Hanaki, Nobuyuki & Koriyama, Yukio & Sutan, Angela & Willinger, Marc, 2019. "The strategic environment effect in beauty contest games," Games and Economic Behavior, Elsevier, vol. 113(C), pages 587-610.
- Nobuyuki Hanaki & Angela Sutan & Marc Willinger, 2016. "The strategic environment effect in beauty contest games," Working Papers halshs-01294915, HAL.
- Nobuyuki Hanaki & Yukio Koriyama & Angela Sutan & Marc Willinger, 2019. "The strategic environment effect in beauty contest games," Post-Print halshs-01929113, HAL.
- Nobuyuki Hanaki & Angela Sutan & Marc Willinger, 2016. "The Strategic Environment Effect in Beauty Contest Games," GREDEG Working Papers 2016-05, Groupe de REcherche en Droit, Economie, Gestion (GREDEG CNRS), Université Côte d'Azur, France.
- Nobuyuki Hanaki & Yukio Koriyama & Angela Sutan & Marc Willinger, 2018. "The strategic environment effect in beauty contest games," Working Papers hal-01954922, HAL.
- Nobuyuki Hanaki & Angela Sutan & Marc Willnger, 2016. "The strategic environment effect in beauty contest games," Working Papers 03-16, LAMETA, Universtiy of Montpellier.
- Nobuyuki Hanaki & Yukio Koriyama & Angela Sutan & Marc Willinger, 2018. "The strategic environment effect in beauty contest games," CEE-M Working Papers hal-01954922, CEE-M, Universtiy of Montpellier, CNRS, INRA, Montpellier SupAgro.
Larbi Alaoui & Antonio Penta, 2016. "Endogenous Depth of Reasoning," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 83(4), pages 1297-1333.
- Larbi Alaoui & Antonio Penta, 2012. "Endogenous depth of reasoning," Economics Working Papers 1332, Department of Economics and Business, Universitat Pompeu Fabra, revised Mar 2014.
- Antonio Penta & Larbi Alaoui, 2015. "Endogenous Depth of Reasoning," Working Papers 653, Barcelona School of Economics.
Bayer, R.-C. & Renou, Ludovic, 2016. "Logical abilities and behavior in strategic-form games," Journal of Economic Psychology, Elsevier, vol. 56(C), pages 39-59.
Carlos Alós-Ferrer & Johannes Buckenmaier, 2021. "Cognitive sophistication and deliberation times," Experimental Economics, Springer;Economic Science Association, vol. 24(2), pages 558-592, June.
- Carlos Alós-Ferrer & Johannes Buckenmaier, 2018. "Cognitive sophistication and deliberation times," ECON - Working Papers 292, Department of Economics - University of Zurich, revised Apr 2019.
King King Li & Kang Rong, 2024. "A two-step guessing game," Theory and Decision, Springer, vol. 97(1), pages 89-108, August.
Mauersberger, Felix & Nagel, Rosemarie & Bühren, Christoph, 2020. "Bounded rationality in Keynesian beauty contests: A lesson for central bankers?," Economics - The Open-Access, Open-Assessment E-Journal (2007-2020), Kiel Institute for the World Economy (IfW Kiel), vol. 14, pages 1-38.
- Mauersberger, Felix & Nagel, Rosemarie & Bühren, Christoph, 2019. "Bounded rationality in Keynesian beauty contests: A lesson for central bankers?," Economics Discussion Papers 2019-53, Kiel Institute for the World Economy (IfW Kiel).
Allred, Sarah & Duffy, Sean & Smith, John, 2016. "Cognitive load and strategic sophistication," Journal of Economic Behavior & Organization, Elsevier, vol. 125(C), pages 162-178.
- Allred, Sarah & Duffy, Sean & Smith, John, 2013. "Cognitive Load and Strategic Sophistication," MPRA Paper 47997, University Library of Munich, Germany.
- Allred, Sarah & Duffy, Sean & Smith, John, 2014. "Cognitive load and strategic sophistication," MPRA Paper 59441, University Library of Munich, Germany.
María Cubel & Santiago Sanchez-Pages, 2014. "Gender differences and stereotypes in the beauty contest," Working Papers 2014/13, Institut d'Economia de Barcelona (IEB).
Dimitris Batzilis & Sonia Jaffe & Steven Levitt & John A. List & Jeffrey Picel, 2019. "Behavior in Strategic Settings: Evidence from a Million Rock-Paper-Scissors Games," Games, MDPI, vol. 10(2), pages 1-34, April.
Teck-Hua Ho & So-Eun Park & Xuanming Su, 2021. "A Bayesian Level- k Model in n -Person Games," Management Science, INFORMS, vol. 67(3), pages 1622-1638, March.
Choo, Lawrence & Kaplan, Todd R. & Zhou, Xiaoyu, 2019. "Can auctions select people by their level-k types?," MPRA Paper 95987, University Library of Munich, Germany.

More about this item

NEP fields

This paper has been announced in the following NEP Reports:

NEP-AIN-2025-01-27 (Artificial Intelligence)
NEP-BIG-2025-01-27 (Big Data)
NEP-CMP-2025-01-27 (Computational Economics)
NEP-EVO-2025-01-27 (Evolutionary Economics)
NEP-NEU-2025-01-27 (Neuroeconomics)

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2412.13013. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

The Emergence of Strategic Reasoning of Large Language Models

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Most related items

More about this item

NEP fields

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data