IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2405.15975.html
   My bibliography  Save this paper

Inference of Utilities and Time Preference in Sequential Decision-Making

Author

Listed:
  • Haoyang Cao
  • Zhengqi Wu
  • Renyuan Xu

Abstract

This paper introduces a novel stochastic control framework to enhance the capabilities of automated investment managers, or robo-advisors, by accurately inferring clients' investment preferences from past activities. Our approach leverages a continuous-time model that incorporates utility functions and a generic discounting scheme of a time-varying rate, tailored to each client's risk tolerance, valuation of daily consumption, and significant life goals. We address the resulting time inconsistency issue through state augmentation and the establishment of the dynamic programming principle and the verification theorem. Additionally, we provide sufficient conditions for the identifiability of client investment preferences. To complement our theoretical developments, we propose a learning algorithm based on maximum likelihood estimation within a discrete-time Markov Decision Process framework, augmented with entropy regularization. We prove that the log-likelihood function is locally concave, facilitating the fast convergence of our proposed algorithm. Practical effectiveness and efficiency are showcased through two numerical examples, including Merton's problem and an investment problem with unhedgeable risks. Our proposed framework not only advances financial technology by improving personalized investment advice but also contributes broadly to other fields such as healthcare, economics, and artificial intelligence, where understanding individual preferences is crucial.

Suggested Citation

  • Haoyang Cao & Zhengqi Wu & Renyuan Xu, 2024. "Inference of Utilities and Time Preference in Sequential Decision-Making," Papers 2405.15975, arXiv.org, revised Jun 2024.
  • Handle: RePEc:arx:papers:2405.15975
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2405.15975
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Humoud Alsabah & Agostino Capponi & Octavio Ruiz Lacedelli & Matt Stern, 2021. "Robo-Advising: Learning Investors’ Risk Preferences via Portfolio Choices [Mean-variance versus Full-scale Optimisation: In and out of Sample]," Journal of Financial Econometrics, Oxford University Press, vol. 19(2), pages 369-392.
    2. Ying Hu & Hanqing Jin & Xun Yu Zhou, 2012. "Time-Inconsistent Stochastic Linear--Quadratic Control," Post-Print hal-00691816, HAL.
    3. Thaleia Zariphopoulou, 2001. "A solution approach to valuation with unhedgeable risks," Finance and Stochastics, Springer, vol. 5(1), pages 61-82.
    4. Dybvig, Philip H & Rogers, L C G, 1997. "Recovery of Preferences from Observed Wealth in a Single Realization," The Review of Financial Studies, Society for Financial Studies, vol. 10(1), pages 151-174.
    5. Alexander M. G. Cox & David Hobson & Jan Obłój, 2014. "Utility Theory Front To Back — Inferring Utility From Agents' Choices," International Journal of Theoretical and Applied Finance (IJTAF), World Scientific Publishing Co. Pte. Ltd., vol. 17(03), pages 1-44.
    6. Francesco D’Acunto & Nagpurnanand Prabhala & Alberto G Rossi, 2019. "The Promises and Pitfalls of Robo-Advising," The Review of Financial Studies, Society for Financial Studies, vol. 32(5), pages 1983-2020.
    7. Agostino Capponi & Sveinn Ólafsson & Thaleia Zariphopoulou, 2022. "Personalized Robo-Advising: Enhancing Investment Through Client Interaction," Management Science, INFORMS, vol. 68(4), pages 2485-2512, April.
    8. Sargent, Thomas J, 1978. "Estimation of Dynamic Labor Demand Schedules under Rational Expectations," Journal of Political Economy, University of Chicago Press, vol. 86(6), pages 1009-1044, December.
    9. Nicole Bäuerle & Ulrich Rieder, 2014. "More Risk-Sensitive Markov Decision Processes," Mathematics of Operations Research, INFORMS, vol. 39(1), pages 105-120, February.
    10. Nicole El Karoui & Mohamed Mrad, 2021. "Recover Dynamic Utility from Observable Process: Application to the economic equilibrium," Post-Print hal-01966312, HAL.
    11. Ying Hu & Hanqing Jin & Xun Yu Zhou, 2017. "Time-Inconsistent Stochastic Linear--Quadratic Control: Characterization and Uniqueness of Equilibrium," Post-Print hal-01139343, HAL.
    12. Hanqing Jin & Xun Yu Zhou, 2008. "Behavioral Portfolio Selection In Continuous Time," Mathematical Finance, Wiley Blackwell, vol. 18(3), pages 385-426, July.
    13. Jiwoong Shin & Jungju Yu, 2021. "Targeted Advertising and Consumer Inference," Marketing Science, INFORMS, vol. 40(5), pages 900-922, September.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Marcel Nutz & Yuchong Zhang, 2019. "Conditional Optimal Stopping: A Time-Inconsistent Optimization," Papers 1901.05802, arXiv.org, revised Oct 2019.
    2. Jiaqin Wei & Jianming Xia & Qian Zhao, 2024. "Time-Consistent Portfolio Selection for Rank-Dependent Utilities in an Incomplete Market," Papers 2409.19259, arXiv.org.
    3. Ying Hu & Hanqing Jin & Xun Yu Zhou, 2021. "Consistent investment of sophisticated rank‐dependent utility agents in continuous time," Mathematical Finance, Wiley Blackwell, vol. 31(3), pages 1056-1095, July.
    4. Keffert, Henk, 2024. "Robo-advising: Optimal investment with mismeasured and unstable risk preferences," European Journal of Operational Research, Elsevier, vol. 315(1), pages 378-392.
    5. Cardillo, Giovanni & Chiappini, Helen, 2024. "Robo-advisors: A systematic literature review," Finance Research Letters, Elsevier, vol. 62(PA).
    6. Zhiping Chen & Liyuan Wang & Ping Chen & Haixiang Yao, 2019. "Continuous-Time Mean–Variance Optimization For Defined Contribution Pension Funds With Regime-Switching," International Journal of Theoretical and Applied Finance (IJTAF), World Scientific Publishing Co. Pte. Ltd., vol. 22(06), pages 1-33, September.
    7. Camilo Hern'andez & Dylan Possamai, 2020. "Me, myself and I: a general theory of non-Markovian time-inconsistent stochastic control for sophisticated agents," Papers 2002.12572, arXiv.org, revised Jul 2021.
    8. Ying Hu & Hanqing Jin & Xun Yu Zhou, 2020. "Consistent Investment of Sophisticated Rank-Dependent Utility Agents in Continuous Time," Working Papers hal-02624308, HAL.
    9. Wei Ji, 2024. "Closed-loop and open-loop equilibrium of a class time-inconsistent linear-quadratic differential games," International Journal of Game Theory, Springer;Game Theory Society, vol. 53(2), pages 635-651, June.
    10. Yushi Hamaguchi, 2019. "Time-inconsistent consumption-investment problems in incomplete markets under general discount functions," Papers 1912.01281, arXiv.org, revised Mar 2021.
    11. Xue Dong He & Xun Yu Zhou, 2021. "Who Are I: Time Inconsistency and Intrapersonal Conflict and Reconciliation," Papers 2105.01829, arXiv.org.
    12. Ying Hu & Hanqing Jin & Xun Yu Zhou, 2020. "Consistent Investment of Sophisticated Rank-Dependent Utility Agents in Continuous Time," Papers 2006.01979, arXiv.org.
    13. Zongxia Liang & Sheng Wang & Jianming Xia & Fengyi Yuan, 2024. "Dynamic portfolio selection under generalized disappointment aversion," Papers 2401.08323, arXiv.org, revised Mar 2024.
    14. Hanqing Jin & Yimin Yang, 2014. "Time-Inconsistent Mean-Utility Portfolio Selection with Moving Target," Papers 1402.6760, arXiv.org.
    15. Zhongyang Sun & Xianping Guo, 2019. "Equilibrium for a Time-Inconsistent Stochastic Linear–Quadratic Control System with Jumps and Its Application to the Mean-Variance Problem," Journal of Optimization Theory and Applications, Springer, vol. 181(2), pages 383-410, May.
    16. Haiyang Wang & Ruimin Xu, 2023. "Time-Inconsistent LQ Games for Large-Population Systems and Applications," Journal of Optimization Theory and Applications, Springer, vol. 197(3), pages 1249-1268, June.
    17. Bingyan Han & Chi Seng Pun & Hoi Ying Wong, 2023. "Robust Time-inconsistent Linear-Quadratic Stochastic Controls: A Stochastic Differential Game Approach," Papers 2306.16982, arXiv.org, revised Sep 2024.
    18. Zongxia Liang & Fengyi Yuan, 2021. "Weak equilibria for time-inconsistent control: with applications to investment-withdrawal decisions," Papers 2105.06607, arXiv.org, revised Jun 2023.
    19. Yan, Tingjin & Wong, Hoi Ying, 2020. "Open-loop equilibrium reinsurance-investment strategy under mean–variance criterion with stochastic volatility," Insurance: Mathematics and Economics, Elsevier, vol. 90(C), pages 105-119.
    20. Bingyan Han & Hoi Ying Wong, 2019. "Time-inconsistency with rough volatility," Papers 1907.11378, arXiv.org, revised Dec 2021.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2405.15975. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.