IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2104.12895.html
   My bibliography  Save this paper

Computational Performance of Deep Reinforcement Learning to find Nash Equilibria

Author

Listed:
  • Christoph Graf
  • Viktor Zobernig
  • Johannes Schmidt
  • Claude Klockl

Abstract

We test the performance of deep deterministic policy gradient (DDPG), a deep reinforcement learning algorithm, able to handle continuous state and action spaces, to learn Nash equilibria in a setting where firms compete in prices. These algorithms are typically considered model-free because they do not require transition probability functions (as in e.g., Markov games) or predefined functional forms. Despite being model-free, a large set of parameters are utilized in various steps of the algorithm. These are e.g., learning rates, memory buffers, state-space dimensioning, normalizations, or noise decay rates and the purpose of this work is to systematically test the effect of these parameter configurations on convergence to the analytically derived Bertrand equilibrium. We find parameter choices that can reach convergence rates of up to 99%. The reliable convergence may make the method a useful tool to study strategic behavior of firms even in more complex settings. Keywords: Bertrand Equilibrium, Competition in Uniform Price Auctions, Deep Deterministic Policy Gradient Algorithm, Parameter Sensitivity Analysis

Suggested Citation

  • Christoph Graf & Viktor Zobernig & Johannes Schmidt & Claude Klockl, 2021. "Computational Performance of Deep Reinforcement Learning to find Nash Equilibria," Papers 2104.12895, arXiv.org.
  • Handle: RePEc:arx:papers:2104.12895
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2104.12895
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Emmanuel Guerre & Isabelle Perrigne & Quang Vuong, 2000. "Optimal Nonparametric Estimation of First-Price Auctions," Econometrica, Econometric Society, vol. 68(3), pages 525-574, May.
    2. Jakub Kastl, 2011. "Discrete Bids and Empirical Inference in Divisible Good Auctions," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 78(3), pages 974-1014.
    3. Jian Yao & Ilan Adler & Shmuel S. Oren, 2008. "Modeling and Computing Two-Settlement Oligopolistic Equilibrium in a Congested Electricity Network," Operations Research, INFORMS, vol. 56(1), pages 34-47, February.
    4. Elodie Guerre & I. Perrigne & Q.H. Vuong, 2000. "Optimal nonparametric estimation of first-price auctions [[Estimation nonparamétrique optimale des enchères au premier prix]]," Post-Print hal-02697497, HAL.
    5. Christoph Graf & Federico Quaglia & Frank A. Wolak, 2020. "Simplified Electricity Market Models with Significant Intermittent Renewable Capacity: Evidence from Italy," NBER Working Papers 27262, National Bureau of Economic Research, Inc.
    6. Foster, Dean P. & Vohra, Rakesh V., 1997. "Calibrated Learning and Correlated Equilibrium," Games and Economic Behavior, Elsevier, vol. 21(1-2), pages 40-55, October.
    7. Mar Reguant, 2014. "Complementary Bidding Mechanisms and Startup Costs in Electricity Markets," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 81(4), pages 1708-1742.
    8. Julian Schrittwieser & Ioannis Antonoglou & Thomas Hubert & Karen Simonyan & Laurent Sifre & Simon Schmitt & Arthur Guez & Edward Lockhart & Demis Hassabis & Thore Graepel & Timothy Lillicrap & David , 2020. "Mastering Atari, Go, chess and shogi by planning with a learned model," Nature, Nature, vol. 588(7839), pages 604-609, December.
    9. Byford, Martin C., 2015. "A theoretical foundation for the undercut-proof equilibrium," Journal of Economic Theory, Elsevier, vol. 159(PA), pages 209-220.
    10. Shy,Oz, 2001. "The Economics of Network Industries," Cambridge Books, Cambridge University Press, number 9780521805001, January.
    11. David Silver & Aja Huang & Chris J. Maddison & Arthur Guez & Laurent Sifre & George van den Driessche & Julian Schrittwieser & Ioannis Antonoglou & Veda Panneershelvam & Marc Lanctot & Sander Dieleman, 2016. "Mastering the game of Go with deep neural networks and tree search," Nature, Nature, vol. 529(7587), pages 484-489, January.
    12. John Asker & Chaim Fershtman & Ariel Pakes, 2021. "Artificial Intelligence and Pricing: The Impact of Algorithm Design," NBER Working Papers 28535, National Bureau of Economic Research, Inc.
    13. Viehmann, Johannes & Lorenczik, Stefan & Malischek, Raimund, 2021. "Multi-unit multiple bid auctions in balancing markets: An agent-based Q-learning approach," Energy Economics, Elsevier, vol. 93(C).
    14. Volodymyr Mnih & Koray Kavukcuoglu & David Silver & Andrei A. Rusu & Joel Veness & Marc G. Bellemare & Alex Graves & Martin Riedmiller & Andreas K. Fidjeland & Georg Ostrovski & Stig Petersen & Charle, 2015. "Human-level control through deep reinforcement learning," Nature, Nature, vol. 518(7540), pages 529-533, February.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Christoph Graf & Viktor Zobernig & Johannes Schmidt & Claude Klöckl, 2024. "Computational Performance of Deep Reinforcement Learning to Find Nash Equilibria," Computational Economics, Springer;Society for Computational Economics, vol. 63(2), pages 529-576, February.
    2. Sang Won Kim & Marcelo Olivares & Gabriel Y. Weintraub, 2014. "Measuring the Performance of Large-Scale Combinatorial Auctions: A Structural Estimation Approach," Management Science, INFORMS, vol. 60(5), pages 1180-1201, May.
    3. Huang, Ruchen & He, Hongwen & Gao, Miaojue, 2023. "Training-efficient and cost-optimal energy management for fuel cell hybrid electric bus based on a novel distributed deep reinforcement learning framework," Applied Energy, Elsevier, vol. 346(C).
    4. Lamy, Laurent & Patnam, Manasa & Visser, Michael, 2016. "Correcting for Sample Selection From Competitive Bidding, with an Application to Estimating the Effect of Wages on Performance," CEPR Discussion Papers 11376, C.E.P.R. Discussion Papers.
    5. Quang Vuong & Ayse Pehlivan, 2015. "Supply Function Competition and Exporters: Nonparametric Identification and Estimation of Productivity Distributions and Marginal Costs," 2015 Meeting Papers 1414, Society for Economic Dynamics.
    6. Hunt Allcott, 2012. "The Smart Grid, Entry, and Imperfect Competition in Electricity Markets," NBER Working Papers 18071, National Bureau of Economic Research, Inc.
    7. Jason Allen & Jakub Kastl & Milena Wittwer, 2020. "Primary Dealers and the Demand for Government Debt," Working Papers 2020-27, Princeton University. Economics Department..
    8. Ali Hortaçsu & Jakub Kastl & Allen Zhang, 2018. "Bid Shading and Bidder Surplus in the US Treasury Auction System," American Economic Review, American Economic Association, vol. 108(1), pages 147-169, January.
    9. Jason Allen & Jakub Kastl & Milena Wittwer, 2020. "Maturity Composition and the Demand for Government Debt," Staff Working Papers 20-29, Bank of Canada.
    10. Pesendorfer, Martin & Cantillon, Estelle, 2007. "Combination Bidding in Multi-Unit Auctions," CEPR Discussion Papers 6083, C.E.P.R. Discussion Papers.
    11. Philip A Haile & Yuichi Kitamura, 2019. "Unobserved heterogeneity in auctions," The Econometrics Journal, Royal Economic Society, vol. 22(1), pages 1-19.
    12. Maria Chiara D?Errico, 2020. "Competition in the Italian electricity market: The unforeseen social welfare losses of reform," ECONOMICS AND POLICY OF ENERGY AND THE ENVIRONMENT, FrancoAngeli Editore, vol. 2020(2), pages 75-91.
    13. Jason Allen & Ali Hortaçsu & Eric Richert & Milena Wittwer, 2024. "Entry and Exit in Treasury Auctions," Staff Working Papers 24-29, Bank of Canada.
    14. Li, Wenqing & Ni, Shaoquan, 2022. "Train timetabling with the general learning environment and multi-agent deep reinforcement learning," Transportation Research Part B: Methodological, Elsevier, vol. 157(C), pages 230-251.
    15. Susan Athey & Philip A. Haile, 2006. "Empirical Models of Auctions," NBER Working Papers 12126, National Bureau of Economic Research, Inc.
    16. Pietro Bonaldi & Ali Hortaçsu & Jakub Kastl, 2015. "Empirical Analysis of Funding Cost Spillovers in the EURO Zone with Application to Systemic Risk," Working Papers 2015-5, Princeton University. Economics Department..
    17. Hickman Brent R. & Hubbard Timothy P. & Sağlam Yiğit, 2012. "Structural Econometric Methods in Auctions: A Guide to the Literature," Journal of Econometric Methods, De Gruyter, vol. 1(1), pages 67-106, August.
    18. Simona Bigerna & Carlo Andrea Bollino & Maria Chiara D’Errico & Paolo Polinori, 2023. "A new design for market power monitoring in the electricity market. A simulation for Italy," Economia Politica: Journal of Analytical and Institutional Economics, Springer;Fondazione Edison, vol. 40(1), pages 285-317, April.
    19. Pietro Bonaldi & Ali Hortaçsu & Jakub Kastl, 2015. "Empirical Analysis of Funding Cost Spillovers in the EURO Zone with Application to Systemic Risk," Working Papers 2015-4, Princeton University. Economics Department..
    20. Kastl, Jakub, 2012. "On the properties of equilibria in private value divisible good auctions with constrained bidding," Journal of Mathematical Economics, Elsevier, vol. 48(6), pages 339-352.

    More about this item

    Keywords

    bertrand equilibrium; competition in uniform price auctions; deep deterministic policy gradient algorithm; parameter sensitivity analysis;
    All these keywords.

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2104.12895. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.