IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2108.02755.html
   My bibliography  Save this paper

The AI Economist: Optimal Economic Policy Design via Two-level Deep Reinforcement Learning

Author

Listed:
  • Stephan Zheng
  • Alexander Trott
  • Sunil Srinivasa
  • David C. Parkes
  • Richard Socher

Abstract

AI and reinforcement learning (RL) have improved many areas, but are not yet widely adopted in economic policy design, mechanism design, or economics at large. At the same time, current economic methodology is limited by a lack of counterfactual data, simplistic behavioral models, and limited opportunities to experiment with policies and evaluate behavioral responses. Here we show that machine-learning-based economic simulation is a powerful policy and mechanism design framework to overcome these limitations. The AI Economist is a two-level, deep RL framework that trains both agents and a social planner who co-adapt, providing a tractable solution to the highly unstable and novel two-level RL challenge. From a simple specification of an economy, we learn rational agent behaviors that adapt to learned planner policies and vice versa. We demonstrate the efficacy of the AI Economist on the problem of optimal taxation. In simple one-step economies, the AI Economist recovers the optimal tax policy of economic theory. In complex, dynamic economies, the AI Economist substantially improves both utilitarian social welfare and the trade-off between equality and productivity over baselines. It does so despite emergent tax-gaming strategies, while accounting for agent interactions and behavioral change more accurately than economic theory. These results demonstrate for the first time that two-level, deep RL can be used for understanding and as a complement to theory for economic design, unlocking a new computational learning-based approach to understanding economic policy.

Suggested Citation

  • Stephan Zheng & Alexander Trott & Sunil Srinivasa & David C. Parkes & Richard Socher, 2021. "The AI Economist: Optimal Economic Policy Design via Two-level Deep Reinforcement Learning," Papers 2108.02755, arXiv.org.
  • Handle: RePEc:arx:papers:2108.02755
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2108.02755
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Narayana R. Kocherlakota, 2010. "The New Dynamic Public Finance," Economics Books, Princeton University Press, edition 1, number 9222.
    2. A. J. Auerbach & M. Feldstein (ed.), 2002. "Handbook of Public Economics," Handbook of Public Economics, Elsevier, edition 1, volume 4, number 4.
    3. A. J. Auerbach & M. Feldstein (ed.), 2002. "Handbook of Public Economics," Handbook of Public Economics, Elsevier, edition 1, volume 3, number 3.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Alexander Trott & Sunil Srinivasa & Douwe van der Wal & Sebastien Haneuse & Stephan Zheng, 2021. "Building a Foundation for Data-Driven, Interpretable, and Robust Policy Design using the AI Economist," Papers 2108.02904, arXiv.org.
    2. Ariel Alexi & Teddy Lazebnik & Labib Shami, 2024. "Microfounded Tax Revenue Forecast Model with Heterogeneous Population and Genetic Algorithm Approach," Computational Economics, Springer;Society for Computational Economics, vol. 63(5), pages 1705-1734, May.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. James Alm, 2018. "Is the Haig‐Simons Standard Dead? The Uneasy Case for a Comprehensive Income Tax," National Tax Journal, National Tax Association;National Tax Journal, vol. 71(2), pages 379-398, June.
    2. Tran, Chung & Wende, Sebastian, 2021. "On the marginal excess burden of taxation in an overlapping generations model," Journal of Macroeconomics, Elsevier, vol. 70(C).
    3. Filistrucchi, L. & Ozbugday, F.C., 2012. "Mandatory Quality Disclosure and Quality Supply : Evidence from German Hospitals," Other publications TiSEM 680b0e3e-d3f5-4b91-9803-8, Tilburg University, School of Economics and Management.
    4. Montalvo, José G. & Piolatto, Amedeo & Raya, Josep, 2020. "Transaction-tax evasion in the housing market," Regional Science and Urban Economics, Elsevier, vol. 81(C).
    5. Nikolov, Plamen & Adelman, Alan, 2019. "Do private household transfers to the elderly respond to public pension benefits? Evidence from rural China," The Journal of the Economics of Ageing, Elsevier, vol. 14(C).
    6. Desai, Mihir A. & Hines, James R. Jr., 2002. "Expectations and Expatriations: Tracing the Causes and Consequences of Corporate Inversions," National Tax Journal, National Tax Association;National Tax Journal, vol. 55(3), pages 409-440, September.
    7. Magda Iga & Kiełczewska Aneta & Brandt Nicola, 2020. "The effect of child benefit on female labor supply," IZA Journal of Labor Policy, Sciendo & Forschungsinstitut zur Zukunft der Arbeit GmbH (IZA), vol. 10(1), pages 1-18, March.
    8. Carbonnier Cl´ement, 2014. "The incidence of non-linear consumption taxes," Научный результат. Серия «Экономические исследования», CyberLeninka;Федеральное государственное автономное образовательное учреждение высшего образования «Белгородский государственный национальный исследовательский университет», issue 1, pages 5-18.
    9. Auerbach, Alan & Kueng, Lorenz & Lee, Ronald & Yatsynovich, Yury, 2018. "Propagation and smoothing of shocks in alternative social security systems," Journal of Public Economics, Elsevier, vol. 164(C), pages 91-105.
    10. Boone, Jan & Müller, Wieland, 2012. "The distribution of harm in price-fixing cases," International Journal of Industrial Organization, Elsevier, vol. 30(2), pages 265-276.
    11. Jean-Pierre Laffargue, 2009. "Intergenerational Transfers and the Stability of Public Debt with Short-Lived Governments," Mathematical Population Studies, Taylor & Francis Journals, vol. 16(1), pages 79-104.
    12. Louis Kaplow, 2009. "Utility from Accumulation," NBER Working Papers 15595, National Bureau of Economic Research, Inc.
    13. Helliwell, John & Huang, Haifang, 2011. "New measures of the costs of unemployment: Evidence from the subjective well-being of 2.3 million Americans," Working Papers 2011-3, University of Alberta, Department of Economics.
    14. Håkan Selin, 2012. "Marginal Tax Rates and Tax‐Favoured Pension Savings of the Self‐Employed: Evidence from Sweden," Scandinavian Journal of Economics, Wiley Blackwell, vol. 114(1), pages 79-100, March.
    15. Giesecke, Matthias & Jäger, Philipp, 2021. "Pension incentives and labor supply: Evidence from the introduction of universal old-age assistance in the UK," Journal of Public Economics, Elsevier, vol. 203(C).
    16. Berriel, Tiago Couto & Zilberman, Eduardo, 2011. "Targeting the poor: a macroeconomic analysis of cash transfer programs," FGV EPGE Economics Working Papers (Ensaios Economicos da EPGE) 726, EPGE Brazilian School of Economics and Finance - FGV EPGE (Brazil).
    17. Louis Kaplow, 2014. "Government Policy and Labor Supply with Myopic or Targeted Savings Decisions," NBER Chapters, in: Tax Policy and the Economy, Volume 29, pages 159-193, National Bureau of Economic Research, Inc.
    18. Keane, Claire & Walsh, John R. & Callan, Tim & Savage, Michael, 2012. "Property Tax in Ireland: Key Choices," Papers EC11, Economic and Social Research Institute (ESRI).
    19. Andersen, Torben M. & Bhattacharya, Joydeep & Gestsson, Marias H., 2021. "Pareto-improving transition to fully funded pensions under myopia," Journal of Demographic Economics, Cambridge University Press, vol. 87(2), pages 169-212, June.
    20. Nils aus dem Moore, 2014. "Taxes and Corporate Financing Decisions – Evidence from the Belgian ACE Reform," Ruhr Economic Papers 0533, Rheinisch-Westfälisches Institut für Wirtschaftsforschung, Ruhr-Universität Bochum, Universität Dortmund, Universität Duisburg-Essen.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2108.02755. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.