Computational Performance of Deep Reinforcement Learning to Find Nash Equilibria

My bibliography Save this article

Computational Performance of Deep Reinforcement Learning to Find Nash Equilibria

Author

Listed:

Christoph Graf
(New York University
Stanford University)
Viktor Zobernig
(University of Natural Resources and Life Sciences)
Johannes Schmidt
(University of Natural Resources and Life Sciences)
Claude Klöckl
(University of Natural Resources and Life Sciences)

Registered:

Abstract

We test the performance of deep deterministic policy gradient—a deep reinforcement learning algorithm, able to handle continuous state and action spaces—to find Nash equilibria in a setting where firms compete in offer prices through a uniform price auction. These algorithms are typically considered “model-free” although a large set of parameters is utilized by the algorithm. These parameters may include learning rates, memory buffers, state space dimensioning, normalizations, or noise decay rates, and the purpose of this work is to systematically test the effect of these parameter configurations on convergence to the analytically derived Bertrand equilibrium. We find parameter choices that can reach convergence rates of up to 99%. We show that the algorithm also converges in more complex settings with multiple players and different cost structures. Its reliable convergence may make the method a useful tool to studying strategic behavior of firms even in more complex settings.

Suggested Citation

Christoph Graf & Viktor Zobernig & Johannes Schmidt & Claude Klöckl, 2024. "Computational Performance of Deep Reinforcement Learning to Find Nash Equilibria," Computational Economics, Springer;Society for Computational Economics, vol. 63(2), pages 529-576, February.

Handle: RePEc:kap:compec:v:63:y:2024:i:2:d:10.1007_s10614-022-10351-6
DOI: 10.1007/s10614-022-10351-6

Download full text from publisher

As the access to this document is restricted, you may want to search for a different version of it.

References listed on IDEAS

Emmanuel Guerre & Isabelle Perrigne & Quang Vuong, 2000. "Optimal Nonparametric Estimation of First-Price Auctions," Econometrica, Econometric Society, vol. 68(3), pages 525-574, May.
Johann Lussange & Ivan Lazarevich & Sacha Bourgeois-Gironde & Stefano Palminteri & Boris Gutkin, 2021. "Modelling Stock Markets by Multi-agent Reinforcement Learning," Computational Economics, Springer;Society for Computational Economics, vol. 57(1), pages 113-147, January.
Viossat, Yannick & Zapechelnyuk, Andriy, 2013. "No-regret dynamics and fictitious play," Journal of Economic Theory, Elsevier, vol. 148(2), pages 825-842.
- Yannick Viossat & Andriy Zapechelnyuk, 2013. "No-regret Dynamics and Fictitious Play," Post-Print hal-00713871, HAL.
Noe, Thomas H. & Rebello, Michael & Wang, Jun, 2012. "Learning to bid: The design of auctions under uncertainty and adaptation," Games and Economic Behavior, Elsevier, vol. 74(2), pages 620-636.
Christopher Boyer & B. Brorsen, 2014. "Implications of a Reserve Price in an Agent-Based Common-Value Auction," Computational Economics, Springer;Society for Computational Economics, vol. 43(1), pages 33-51, January.
Drew Fudenberg & Eric Maskin, 2008. "The Folk Theorem In Repeated Games With Discounting Or With Incomplete Information," World Scientific Book Chapters, in: Drew Fudenberg & David K Levine (ed.), A Long-Run Collaboration On Long-Run Games, chapter 11, pages 209-230, World Scientific Publishing Co. Pte. Ltd..
- Fudenberg, Drew & Maskin, Eric, 1986. "The Folk Theorem in Repeated Games with Discounting or with Incomplete Information," Econometrica, Econometric Society, vol. 54(3), pages 533-554, May.
Harrison, Glenn W, 1989. "Theory and Misbehavior of First-Price Auctions," American Economic Review, American Economic Association, vol. 79(4), pages 749-762, September.
- Glenn W. Harrison, 1987. "Theory and Misbehavior of First-Price Auctions," University of Western Ontario, Departmental Research Report Series 8710, University of Western Ontario, Department of Economics.
Emilio Calvano & Giacomo Calzolari & Vincenzo Denicolò & Sergio Pastorello, 2020. "Artificial Intelligence, Algorithmic Pricing, and Collusion," American Economic Review, American Economic Association, vol. 110(10), pages 3267-3297, October.
- Calzolari, Giacomo & Calvano, Emilio & Denicolo, Vincenzo & Pastorello, Sergio, 2018. "Artificial intelligence, algorithmic pricing and collusion," CEPR Discussion Papers 13405, C.E.P.R. Discussion Papers.
Aliabadi, Danial Esmaeili & Kaya, Murat & Şahin, Güvenç, 2017. "An agent-based simulation of power generation company behavior in electricity markets under different market-clearing mechanisms," Energy Policy, Elsevier, vol. 100(C), pages 191-205.
Jian Yao & Ilan Adler & Shmuel S. Oren, 2008. "Modeling and Computing Two-Settlement Oligopolistic Equilibrium in a Congested Electricity Network," Operations Research, INFORMS, vol. 56(1), pages 34-47, February.
Mar Reguant, 2014. "Complementary Bidding Mechanisms and Startup Costs in Electricity Markets," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 81(4), pages 1708-1742.
- Mar Reguant, 2014. "Complementary Bidding Mechanisms and Startup Costs in Electricity Markets," CESifo Working Paper Series 4811, CESifo.
Andreoni James & Miller John H., 1995. "Auctions with Artificial Adaptive Agents," Games and Economic Behavior, Elsevier, vol. 10(1), pages 39-64, July.
Julian Schrittwieser & Ioannis Antonoglou & Thomas Hubert & Karen Simonyan & Laurent Sifre & Simon Schmitt & Arthur Guez & Edward Lockhart & Demis Hassabis & Thore Graepel & Timothy Lillicrap & David , 2020. "Mastering Atari, Go, chess and shogi by planning with a learned model," Nature, Nature, vol. 588(7839), pages 604-609, December.
Koichiro Ito & Mar Reguant, 2016. "Sequential Markets, Market Power, and Arbitrage," American Economic Review, American Economic Association, vol. 106(7), pages 1921-1957, July.
- Koichiro Ito & Mar Reguant, 2014. "Sequential Markets, Market Power and Arbitrage," NBER Working Papers 20782, National Bureau of Economic Research, Inc.
- Koichiro ITO & Mar REGUANT, 2015. "Sequential Markets, Market Power and Arbitrage," Discussion papers 15015, Research Institute of Economy, Trade and Industry (RIETI).
Hommes, Cars H., 2006. "Heterogeneous Agent Models in Economics and Finance," Handbook of Computational Economics, in: Leigh Tesfatsion & Kenneth L. Judd (ed.), Handbook of Computational Economics, edition 1, volume 2, chapter 23, pages 1109-1186, Elsevier.
- Cars H. Hommes, 2005. "Heterogeneous Agent Models in Economics and Finance," Tinbergen Institute Discussion Papers 05-056/1, Tinbergen Institute.
Elodie Guerre & I. Perrigne & Q.H. Vuong, 2000. "Optimal nonparametric estimation of first-price auctions [[Estimation nonparamétrique optimale des enchères au premier prix]]," Post-Print hal-02697497, HAL.
Justin Sirignano & Rama Cont, 2019. "Universal features of price formation in financial markets: perspectives from deep learning," Quantitative Finance, Taylor & Francis Journals, vol. 19(9), pages 1449-1459, September.
Volodymyr Mnih & Koray Kavukcuoglu & David Silver & Andrei A. Rusu & Joel Veness & Marc G. Bellemare & Alex Graves & Martin Riedmiller & Andreas K. Fidjeland & Georg Ostrovski & Stig Petersen & Charle, 2015. "Human-level control through deep reinforcement learning," Nature, Nature, vol. 518(7540), pages 529-533, February.
Jakub Kastl, 2011. "Discrete Bids and Empirical Inference in Divisible Good Auctions," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 78(3), pages 974-1014.
Oriol Vinyals & Igor Babuschkin & Wojciech M. Czarnecki & Michaël Mathieu & Andrew Dudzik & Junyoung Chung & David H. Choi & Richard Powell & Timo Ewalds & Petko Georgiev & Junhyuk Oh & Dan Horgan & M, 2019. "Grandmaster level in StarCraft II using multi-agent reinforcement learning," Nature, Nature, vol. 575(7782), pages 350-354, November.
El Hadi Caoui, 2022. "A Study of Umbrella Damages from Bid Rigging," Journal of Law and Economics, University of Chicago Press, vol. 65(2), pages 239-277.
Foster, Dean P. & Vohra, Rakesh V., 1997. "Calibrated Learning and Correlated Equilibrium," Games and Economic Behavior, Elsevier, vol. 21(1-2), pages 40-55, October.
David Silver & Aja Huang & Chris J. Maddison & Arthur Guez & Laurent Sifre & George van den Driessche & Julian Schrittwieser & Ioannis Antonoglou & Veda Panneershelvam & Marc Lanctot & Sander Dieleman, 2016. "Mastering the game of Go with deep neural networks and tree search," Nature, Nature, vol. 529(7587), pages 484-489, January.
Viehmann, Johannes & Lorenczik, Stefan & Malischek, Raimund, 2021. "Multi-unit multiple bid auctions in balancing markets: An agent-based Q-learning approach," Energy Economics, Elsevier, vol. 93(C).

Full references (including those not matched with items on IDEAS)

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Christoph Graf & Viktor Zobernig & Johannes Schmidt & Claude Klockl, 2021. "Computational Performance of Deep Reinforcement Learning to find Nash Equilibria," Papers 2104.12895, arXiv.org.
Sang Won Kim & Marcelo Olivares & Gabriel Y. Weintraub, 2014. "Measuring the Performance of Large-Scale Combinatorial Auctions: A Structural Estimation Approach," Management Science, INFORMS, vol. 60(5), pages 1180-1201, May.
Esmaeili Aliabadi, Danial & Chan, Katrina, 2022. "The emerging threat of artificial intelligence on competition in liberalized electricity markets: A deep Q-network approach," Applied Energy, Elsevier, vol. 325(C).
Li, Wenqing & Ni, Shaoquan, 2022. "Train timetabling with the general learning environment and multi-agent deep reinforcement learning," Transportation Research Part B: Methodological, Elsevier, vol. 157(C), pages 230-251.
Huang, Ruchen & He, Hongwen & Gao, Miaojue, 2023. "Training-efficient and cost-optimal energy management for fuel cell hybrid electric bus based on a novel distributed deep reinforcement learning framework," Applied Energy, Elsevier, vol. 346(C).
Grundl, Serafin & Zhu, Yu, 2023. "Robust inference in first-price auctions: Overbidding as an identifying restriction," Journal of Econometrics, Elsevier, vol. 235(2), pages 484-506.
Lamy, Laurent & Patnam, Manasa & Visser, Michael, 2016. "Correcting for Sample Selection From Competitive Bidding, with an Application to Estimating the Effect of Wages on Performance," CEPR Discussion Papers 11376, C.E.P.R. Discussion Papers.
- Laurent Lamy & Manasa Patnam & Michael Visser, 2017. "Correcting for Sample Selection From Competitive Bidding, with an Application to Estimating the Effect of Wages on Performance," Post-Print hal-01688267, HAL.
Quang Vuong & Ayse Pehlivan, 2015. "Supply Function Competition and Exporters: Nonparametric Identification and Estimation of Productivity Distributions and Marginal Costs," 2015 Meeting Papers 1414, Society for Economic Dynamics.
Hunt Allcott, 2012. "The Smart Grid, Entry, and Imperfect Competition in Electricity Markets," NBER Working Papers 18071, National Bureau of Economic Research, Inc.
Jason Allen & Jakub Kastl & Milena Wittwer, 2020. "Primary Dealers and the Demand for Government Debt," Working Papers 2020-27, Princeton University. Economics Department..
Ali Hortaçsu & Jakub Kastl & Allen Zhang, 2018. "Bid Shading and Bidder Surplus in the US Treasury Auction System," American Economic Review, American Economic Association, vol. 108(1), pages 147-169, January.
- Ali Hortaçsu & Jakub Kastl & Allen Zhang, 2017. "Bid Shading and Bidder Surplus in the U.S. Treasury Auction System," NBER Working Papers 24024, National Bureau of Economic Research, Inc.
Jason Allen & Jakub Kastl & Milena Wittwer, 2020. "Maturity Composition and the Demand for Government Debt," Staff Working Papers 20-29, Bank of Canada.
- Jason Allen & Jakub Kastl & Milena Wittwer, 2022. "Maturity Composition and the Demand for Government Debt," Working Papers 2022-12, Princeton University. Economics Department..
Pesendorfer, Martin & Cantillon, Estelle, 2007. "Combination Bidding in Multi-Unit Auctions," CEPR Discussion Papers 6083, C.E.P.R. Discussion Papers.
- Cantillon, Estelle & Pesendorfer, Martin, 2013. "Combination bidding in multi-unit auctions," LSE Research Online Documents on Economics 54289, London School of Economics and Political Science, LSE Library.
Philip A Haile & Yuichi Kitamura, 2019. "Unobserved heterogeneity in auctions," The Econometrics Journal, Royal Economic Society, vol. 22(1), pages 1-19.
- Philip A. Haile & Yuichi Kitamura, 2018. "Unobserved Heterogeneity in Auctions," Cowles Foundation Discussion Papers 2141, Cowles Foundation for Research in Economics, Yale University.
Gabrielli, M. Florencia & Willington, Manuel, 2023. "Estimating damages from bidding rings in first-price auctions," Economic Modelling, Elsevier, vol. 126(C).
Weifan Long & Taixian Hou & Xiaoyi Wei & Shichao Yan & Peng Zhai & Lihua Zhang, 2023. "A Survey on Population-Based Deep Reinforcement Learning," Mathematics, MDPI, vol. 11(10), pages 1-17, May.
Patrick Bajari & Ali Hortacsu, 2005. "Are Structural Estimates of Auction Models Reasonable? Evidence from Experimental Data," Journal of Political Economy, University of Chicago Press, vol. 113(4), pages 703-741, August.
- Patrick Bajari & Ali Hortacsu, 2003. "Are Structural Estimates of Auction Models Reasonable? Evidence from Experimental Data," NBER Working Papers 9889, National Bureau of Economic Research, Inc.
- Patrick Bajari & Ali Hortacsu, 2003. "Are Structural Estimates of Auction Models Reasonable? Evidence from Experimental Data," Working Papers 03002, Stanford University, Department of Economics.
Maria Chiara D?Errico, 2020. "Competition in the Italian electricity market: The unforeseen social welfare losses of reform," ECONOMICS AND POLICY OF ENERGY AND THE ENVIRONMENT, FrancoAngeli Editore, vol. 2020(2), pages 75-91.
Shuo Sun & Rundong Wang & Bo An, 2021. "Reinforcement Learning for Quantitative Trading," Papers 2109.13851, arXiv.org.
Jason Allen & Ali Hortaçsu & Eric Richert & Milena Wittwer, 2024. "Entry and Exit in Treasury Auctions," Staff Working Papers 24-29, Bank of Canada.

More about this item

Keywords

Bertrand equilibrium; Competition in uniform price auctions; Deep deterministic policy gradient algorithm; DDPG; Parameter sensitivity analysis;
All these keywords.

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:kap:compec:v:63:y:2024:i:2:d:10.1007_s10614-022-10351-6. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Computational Performance of Deep Reinforcement Learning to Find Nash Equilibria

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Most related items

More about this item

Keywords

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data