IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2002.10990.html
   My bibliography  Save this paper

G-Learner and GIRL: Goal Based Wealth Management with Reinforcement Learning

Author

Listed:
  • Matthew Dixon
  • Igor Halperin

Abstract

We present a reinforcement learning approach to goal based wealth management problems such as optimization of retirement plans or target dated funds. In such problems, an investor seeks to achieve a financial goal by making periodic investments in the portfolio while being employed, and periodically draws from the account when in retirement, in addition to the ability to re-balance the portfolio by selling and buying different assets (e.g. stocks). Instead of relying on a utility of consumption, we present G-Learner: a reinforcement learning algorithm that operates with explicitly defined one-step rewards, does not assume a data generation process, and is suitable for noisy data. Our approach is based on G-learning - a probabilistic extension of the Q-learning method of reinforcement learning. In this paper, we demonstrate how G-learning, when applied to a quadratic reward and Gaussian reference policy, gives an entropy-regulated Linear Quadratic Regulator (LQR). This critical insight provides a novel and computationally tractable tool for wealth management tasks which scales to high dimensional portfolios. In addition to the solution of the direct problem of G-learning, we also present a new algorithm, GIRL, that extends our goal-based G-learning approach to the setting of Inverse Reinforcement Learning (IRL) where rewards collected by the agent are not observed, and should instead be inferred. We demonstrate that GIRL can successfully learn the reward parameters of a G-Learner agent and thus imitate its behavior. Finally, we discuss potential applications of the G-Learner and GIRL algorithms for wealth management and robo-advising.

Suggested Citation

  • Matthew Dixon & Igor Halperin, 2020. "G-Learner and GIRL: Goal Based Wealth Management with Reinforcement Learning," Papers 2002.10990, arXiv.org.
  • Handle: RePEc:arx:papers:2002.10990
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2002.10990
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Merton, Robert C., 1971. "Optimum consumption and portfolio rules in a continuous-time model," Journal of Economic Theory, Elsevier, vol. 3(4), pages 373-413, December.
    2. Browne, S., 1996. "Reaching Goals by a Deadline: Digital Options and Continuous-Time Active Portfolio Management," Papers 96-16, Columbia - Graduate School of Business.
    3. Igor Halperin & Ilya Feldshteyn, 2018. "Market Self-Learning of Signals, Impact and Optimal Trading: Invisible Hand Inference with Free Energy," Papers 1805.06126, arXiv.org.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Chung I Lu, 2023. "Evaluation of Deep Reinforcement Learning Algorithms for Portfolio Optimisation," Papers 2307.07694, arXiv.org, revised Jul 2023.
    2. Ben Hambly & Renyuan Xu & Huining Yang, 2023. "Recent advances in reinforcement learning in finance," Mathematical Finance, Wiley Blackwell, vol. 33(3), pages 437-503, July.
    3. Tessa Bauman & Bruno Gav{s}perov & Stjepan Beguv{s}i'c & Zvonko Kostanjv{c}ar, 2023. "Deep Reinforcement Learning for Robust Goal-Based Wealth Management," Papers 2307.13501, arXiv.org.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Auffret, Philippe, 2001. "An alternative unifying measure of welfare gains from risk-sharing," Policy Research Working Paper Series 2676, The World Bank.
    2. Chen, An & Hieber, Peter & Sureth, Caren, 2022. "Pay for tax certainty? Advance tax rulings for risky investment under multi-dimensional tax uncertainty," arqus Discussion Papers in Quantitative Tax Research 273, arqus - Arbeitskreis Quantitative Steuerlehre.
    3. Sanchez-Romero, Miguel, 2006. "“Demand for Private Annuities and Social Security: Consequences to Individual Wealth”," Working Papers in Economic Theory 2006/07, Universidad Autónoma de Madrid (Spain), Department of Economic Analysis (Economic Theory and Economic History).
    4. Andreas Fagereng & Luigi Guiso & Davide Malacrino & Luigi Pistaferri, 2020. "Heterogeneity and Persistence in Returns to Wealth," Econometrica, Econometric Society, vol. 88(1), pages 115-170, January.
    5. John H. Cochrane, 1999. "New facts in finance," Economic Perspectives, Federal Reserve Bank of Chicago, vol. 23(Q III), pages 36-58.
    6. Luca Di Persio & Luca Prezioso & Kai Wallbaum, 2019. "Closed-End Formula for options linked to Target Volatility Strategies," Papers 1902.08821, arXiv.org.
    7. Larrain, Borja, 2011. "World betas, consumption growth, and financial integration," Journal of International Money and Finance, Elsevier, vol. 30(6), pages 999-1018, October.
    8. Song, Dandan & Wang, Huamao & Yang, Zhaojun, 2014. "Learning, pricing, timing and hedging of the option to invest for perpetual cash flows with idiosyncratic risk," Journal of Mathematical Economics, Elsevier, vol. 51(C), pages 1-11.
    9. Devereux, Michael B. & Saito, Makoto, 1997. "Growth and risk-sharing with incomplete international assets markets," Journal of International Economics, Elsevier, vol. 42(3-4), pages 453-481, May.
    10. John Y. Campbell & Luis M. Viceira & Joshua S. White, 2003. "Foreign Currency for Long-Term Investors," Economic Journal, Royal Economic Society, vol. 113(486), pages 1-25, March.
    11. repec:dau:papers:123456789/56 is not listed on IDEAS
    12. Stephen Satchell & Susan Thorp, 2007. "Scenario Analysis with Recursive Utility: Dynamic Consumption Plans for Charitable Endowments," Research Paper Series 209, Quantitative Finance Research Centre, University of Technology, Sydney.
    13. Cuoco, Domenico & Liu, Hong, 2000. "Optimal consumption of a divisible durable good," Journal of Economic Dynamics and Control, Elsevier, vol. 24(4), pages 561-613, April.
    14. Renaud Bourlès & Dominique Henriet, 2012. "Risk-sharing Contracts with Asymmetric Information," The Geneva Risk and Insurance Review, Palgrave Macmillan;International Association for the Study of Insurance Economics (The Geneva Association), vol. 37(1), pages 27-56, March.
    15. Hong‐Chih Huang, 2010. "Optimal Multiperiod Asset Allocation: Matching Assets to Liabilities in a Discrete Model," Journal of Risk & Insurance, The American Risk and Insurance Association, vol. 77(2), pages 451-472, June.
    16. Carlos Garriga & Mark P. Keightley, 2007. "A general equilibrium theory of college with education subsidies, in-school labor supply, and borrowing constraints," Working Papers 2007-051, Federal Reserve Bank of St. Louis.
    17. Orszag, J. Michael & Yang, Hong, 1995. "Portfolio choice with Knightian uncertainty," Journal of Economic Dynamics and Control, Elsevier, vol. 19(5-7), pages 873-900.
    18. Bjork, Tomas, 2009. "Arbitrage Theory in Continuous Time," OUP Catalogue, Oxford University Press, edition 3, number 9780199574742.
    19. Andrew Papanicolaou, 2018. "Backward SDEs for Control with Partial Information," Papers 1807.08222, arXiv.org.
    20. E. Nasakkala & J. Keppo, 2008. "Hydropower with Financial Information," Applied Mathematical Finance, Taylor & Francis Journals, vol. 15(5-6), pages 503-529.
    21. Letendre, Marc-Andre & Smith, Gregor W., 2001. "Precautionary saving and portfolio allocation: DP by GMM," Journal of Monetary Economics, Elsevier, vol. 48(1), pages 197-215, August.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2002.10990. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.