IDEAS home Printed from https://ideas.repec.org/a/kap/compec/v63y2024i2d10.1007_s10614-022-10351-6.html
   My bibliography  Save this article

Computational Performance of Deep Reinforcement Learning to Find Nash Equilibria

Author

Listed:
  • Christoph Graf

    (New York University
    Stanford University)

  • Viktor Zobernig

    (University of Natural Resources and Life Sciences)

  • Johannes Schmidt

    (University of Natural Resources and Life Sciences)

  • Claude Klöckl

    (University of Natural Resources and Life Sciences)

Abstract

We test the performance of deep deterministic policy gradient—a deep reinforcement learning algorithm, able to handle continuous state and action spaces—to find Nash equilibria in a setting where firms compete in offer prices through a uniform price auction. These algorithms are typically considered “model-free” although a large set of parameters is utilized by the algorithm. These parameters may include learning rates, memory buffers, state space dimensioning, normalizations, or noise decay rates, and the purpose of this work is to systematically test the effect of these parameter configurations on convergence to the analytically derived Bertrand equilibrium. We find parameter choices that can reach convergence rates of up to 99%. We show that the algorithm also converges in more complex settings with multiple players and different cost structures. Its reliable convergence may make the method a useful tool to studying strategic behavior of firms even in more complex settings.

Suggested Citation

  • Christoph Graf & Viktor Zobernig & Johannes Schmidt & Claude Klöckl, 2024. "Computational Performance of Deep Reinforcement Learning to Find Nash Equilibria," Computational Economics, Springer;Society for Computational Economics, vol. 63(2), pages 529-576, February.
  • Handle: RePEc:kap:compec:v:63:y:2024:i:2:d:10.1007_s10614-022-10351-6
    DOI: 10.1007/s10614-022-10351-6
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s10614-022-10351-6
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s10614-022-10351-6?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Emmanuel Guerre & Isabelle Perrigne & Quang Vuong, 2000. "Optimal Nonparametric Estimation of First-Price Auctions," Econometrica, Econometric Society, vol. 68(3), pages 525-574, May.
    2. Johann Lussange & Ivan Lazarevich & Sacha Bourgeois-Gironde & Stefano Palminteri & Boris Gutkin, 2021. "Modelling Stock Markets by Multi-agent Reinforcement Learning," Computational Economics, Springer;Society for Computational Economics, vol. 57(1), pages 113-147, January.
    3. Viossat, Yannick & Zapechelnyuk, Andriy, 2013. "No-regret dynamics and fictitious play," Journal of Economic Theory, Elsevier, vol. 148(2), pages 825-842.
    4. Noe, Thomas H. & Rebello, Michael & Wang, Jun, 2012. "Learning to bid: The design of auctions under uncertainty and adaptation," Games and Economic Behavior, Elsevier, vol. 74(2), pages 620-636.
    5. Christopher Boyer & B. Brorsen, 2014. "Implications of a Reserve Price in an Agent-Based Common-Value Auction," Computational Economics, Springer;Society for Computational Economics, vol. 43(1), pages 33-51, January.
    6. Drew Fudenberg & Eric Maskin, 2008. "The Folk Theorem In Repeated Games With Discounting Or With Incomplete Information," World Scientific Book Chapters, in: Drew Fudenberg & David K Levine (ed.), A Long-Run Collaboration On Long-Run Games, chapter 11, pages 209-230, World Scientific Publishing Co. Pte. Ltd..
    7. Harrison, Glenn W, 1989. "Theory and Misbehavior of First-Price Auctions," American Economic Review, American Economic Association, vol. 79(4), pages 749-762, September.
    8. Emilio Calvano & Giacomo Calzolari & Vincenzo Denicolò & Sergio Pastorello, 2020. "Artificial Intelligence, Algorithmic Pricing, and Collusion," American Economic Review, American Economic Association, vol. 110(10), pages 3267-3297, October.
    9. Aliabadi, Danial Esmaeili & Kaya, Murat & Şahin, Güvenç, 2017. "An agent-based simulation of power generation company behavior in electricity markets under different market-clearing mechanisms," Energy Policy, Elsevier, vol. 100(C), pages 191-205.
    10. Jian Yao & Ilan Adler & Shmuel S. Oren, 2008. "Modeling and Computing Two-Settlement Oligopolistic Equilibrium in a Congested Electricity Network," Operations Research, INFORMS, vol. 56(1), pages 34-47, February.
    11. Mar Reguant, 2014. "Complementary Bidding Mechanisms and Startup Costs in Electricity Markets," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 81(4), pages 1708-1742.
    12. Andreoni James & Miller John H., 1995. "Auctions with Artificial Adaptive Agents," Games and Economic Behavior, Elsevier, vol. 10(1), pages 39-64, July.
    13. Julian Schrittwieser & Ioannis Antonoglou & Thomas Hubert & Karen Simonyan & Laurent Sifre & Simon Schmitt & Arthur Guez & Edward Lockhart & Demis Hassabis & Thore Graepel & Timothy Lillicrap & David , 2020. "Mastering Atari, Go, chess and shogi by planning with a learned model," Nature, Nature, vol. 588(7839), pages 604-609, December.
    14. Koichiro Ito & Mar Reguant, 2016. "Sequential Markets, Market Power, and Arbitrage," American Economic Review, American Economic Association, vol. 106(7), pages 1921-1957, July.
    15. Hommes, Cars H., 2006. "Heterogeneous Agent Models in Economics and Finance," Handbook of Computational Economics, in: Leigh Tesfatsion & Kenneth L. Judd (ed.), Handbook of Computational Economics, edition 1, volume 2, chapter 23, pages 1109-1186, Elsevier.
    16. Elodie Guerre & I. Perrigne & Q.H. Vuong, 2000. "Optimal nonparametric estimation of first-price auctions [[Estimation nonparamétrique optimale des enchères au premier prix]]," Post-Print hal-02697497, HAL.
    17. Justin Sirignano & Rama Cont, 2019. "Universal features of price formation in financial markets: perspectives from deep learning," Quantitative Finance, Taylor & Francis Journals, vol. 19(9), pages 1449-1459, September.
    18. Volodymyr Mnih & Koray Kavukcuoglu & David Silver & Andrei A. Rusu & Joel Veness & Marc G. Bellemare & Alex Graves & Martin Riedmiller & Andreas K. Fidjeland & Georg Ostrovski & Stig Petersen & Charle, 2015. "Human-level control through deep reinforcement learning," Nature, Nature, vol. 518(7540), pages 529-533, February.
    19. Jakub Kastl, 2011. "Discrete Bids and Empirical Inference in Divisible Good Auctions," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 78(3), pages 974-1014.
    20. Oriol Vinyals & Igor Babuschkin & Wojciech M. Czarnecki & Michaël Mathieu & Andrew Dudzik & Junyoung Chung & David H. Choi & Richard Powell & Timo Ewalds & Petko Georgiev & Junhyuk Oh & Dan Horgan & M, 2019. "Grandmaster level in StarCraft II using multi-agent reinforcement learning," Nature, Nature, vol. 575(7782), pages 350-354, November.
    21. El Hadi Caoui, 2022. "A Study of Umbrella Damages from Bid Rigging," Journal of Law and Economics, University of Chicago Press, vol. 65(2), pages 239-277.
    22. Foster, Dean P. & Vohra, Rakesh V., 1997. "Calibrated Learning and Correlated Equilibrium," Games and Economic Behavior, Elsevier, vol. 21(1-2), pages 40-55, October.
    23. David Silver & Aja Huang & Chris J. Maddison & Arthur Guez & Laurent Sifre & George van den Driessche & Julian Schrittwieser & Ioannis Antonoglou & Veda Panneershelvam & Marc Lanctot & Sander Dieleman, 2016. "Mastering the game of Go with deep neural networks and tree search," Nature, Nature, vol. 529(7587), pages 484-489, January.
    24. Viehmann, Johannes & Lorenczik, Stefan & Malischek, Raimund, 2021. "Multi-unit multiple bid auctions in balancing markets: An agent-based Q-learning approach," Energy Economics, Elsevier, vol. 93(C).
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Christoph Graf & Viktor Zobernig & Johannes Schmidt & Claude Klockl, 2021. "Computational Performance of Deep Reinforcement Learning to find Nash Equilibria," Papers 2104.12895, arXiv.org.
    2. Esmaeili Aliabadi, Danial & Chan, Katrina, 2022. "The emerging threat of artificial intelligence on competition in liberalized electricity markets: A deep Q-network approach," Applied Energy, Elsevier, vol. 325(C).
    3. Li, Wenqing & Ni, Shaoquan, 2022. "Train timetabling with the general learning environment and multi-agent deep reinforcement learning," Transportation Research Part B: Methodological, Elsevier, vol. 157(C), pages 230-251.
    4. Sang Won Kim & Marcelo Olivares & Gabriel Y. Weintraub, 2014. "Measuring the Performance of Large-Scale Combinatorial Auctions: A Structural Estimation Approach," Management Science, INFORMS, vol. 60(5), pages 1180-1201, May.
    5. Lamy, Laurent & Patnam, Manasa & Visser, Michael, 2016. "Correcting for Sample Selection From Competitive Bidding, with an Application to Estimating the Effect of Wages on Performance," CEPR Discussion Papers 11376, C.E.P.R. Discussion Papers.
    6. Quang Vuong & Ayse Pehlivan, 2015. "Supply Function Competition and Exporters: Nonparametric Identification and Estimation of Productivity Distributions and Marginal Costs," 2015 Meeting Papers 1414, Society for Economic Dynamics.
    7. Hunt Allcott, 2012. "The Smart Grid, Entry, and Imperfect Competition in Electricity Markets," NBER Working Papers 18071, National Bureau of Economic Research, Inc.
    8. Jason Allen & Jakub Kastl & Milena Wittwer, 2020. "Primary Dealers and the Demand for Government Debt," Working Papers 2020-27, Princeton University. Economics Department..
    9. Ali Hortaçsu & Jakub Kastl & Allen Zhang, 2018. "Bid Shading and Bidder Surplus in the US Treasury Auction System," American Economic Review, American Economic Association, vol. 108(1), pages 147-169, January.
    10. Gabrielli, M. Florencia & Willington, Manuel, 2023. "Estimating damages from bidding rings in first-price auctions," Economic Modelling, Elsevier, vol. 126(C).
    11. Patrick Bajari & Ali Hortacsu, 2005. "Are Structural Estimates of Auction Models Reasonable? Evidence from Experimental Data," Journal of Political Economy, University of Chicago Press, vol. 113(4), pages 703-741, August.
    12. Shuo Sun & Rundong Wang & Bo An, 2021. "Reinforcement Learning for Quantitative Trading," Papers 2109.13851, arXiv.org.
    13. Justin P. Johnson & Andrew Rhodes & Matthijs Wildenbeest, 2023. "Platform Design When Sellers Use Pricing Algorithms," Econometrica, Econometric Society, vol. 91(5), pages 1841-1879, September.
    14. Ngo, Vu Minh & Nguyen, Huan Huu & Van Nguyen, Phuc, 2023. "Does reinforcement learning outperform deep learning and traditional portfolio optimization models in frontier and developed financial markets?," Research in International Business and Finance, Elsevier, vol. 65(C).
    15. Chloé Le Coq & Sebastian Schwenen, 2020. "Financial contracts as coordination device," Journal of Economics & Management Strategy, Wiley Blackwell, vol. 29(2), pages 241-259, April.
    16. Kastl, Jakub, 2012. "On the properties of equilibria in private value divisible good auctions with constrained bidding," Journal of Mathematical Economics, Elsevier, vol. 48(6), pages 339-352.
    17. Jakub Kastl & Ali Hortacsu, 2007. "Testing for Common Valuation in Treasury Bills Auctions," 2007 Meeting Papers 222, Society for Economic Dynamics.
    18. Hickman Brent R. & Hubbard Timothy P. & Sağlam Yiğit, 2012. "Structural Econometric Methods in Auctions: A Guide to the Literature," Journal of Econometric Methods, De Gruyter, vol. 1(1), pages 67-106, August.
    19. De Moor, Bram J. & Gijsbrechts, Joren & Boute, Robert N., 2022. "Reward shaping to improve the performance of deep reinforcement learning in perishable inventory management," European Journal of Operational Research, Elsevier, vol. 301(2), pages 535-545.
    20. Susan Athey & Philip A. Haile, 2006. "Empirical Models of Auctions," NBER Working Papers 12126, National Bureau of Economic Research, Inc.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:kap:compec:v:63:y:2024:i:2:d:10.1007_s10614-022-10351-6. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.