Inference of Utilities and Time Preference in Sequential Decision-Making

My bibliography Save this paper

Inference of Utilities and Time Preference in Sequential Decision-Making

Author

Listed:

Haoyang Cao
Zhengqi Wu
Renyuan Xu

Registered:

Abstract

This paper introduces a novel stochastic control framework to enhance the capabilities of automated investment managers, or robo-advisors, by accurately inferring clients' investment preferences from past activities. Our approach leverages a continuous-time model that incorporates utility functions and a generic discounting scheme of a time-varying rate, tailored to each client's risk tolerance, valuation of daily consumption, and significant life goals. We address the resulting time inconsistency issue through state augmentation and the establishment of the dynamic programming principle and the verification theorem. Additionally, we provide sufficient conditions for the identifiability of client investment preferences. To complement our theoretical developments, we propose a learning algorithm based on maximum likelihood estimation within a discrete-time Markov Decision Process framework, augmented with entropy regularization. We prove that the log-likelihood function is locally concave, facilitating the fast convergence of our proposed algorithm. Practical effectiveness and efficiency are showcased through two numerical examples, including Merton's problem and an investment problem with unhedgeable risks. Our proposed framework not only advances financial technology by improving personalized investment advice but also contributes broadly to other fields such as healthcare, economics, and artificial intelligence, where understanding individual preferences is crucial.

Suggested Citation

Haoyang Cao & Zhengqi Wu & Renyuan Xu, 2024. "Inference of Utilities and Time Preference in Sequential Decision-Making," Papers 2405.15975, arXiv.org, revised Jun 2024.

Handle: RePEc:arx:papers:2405.15975

Download full text from publisher

References listed on IDEAS

Humoud Alsabah & Agostino Capponi & Octavio Ruiz Lacedelli & Matt Stern, 2021. "Robo-Advising: Learning Investors’ Risk Preferences via Portfolio Choices [Mean-variance versus Full-scale Optimisation: In and out of Sample]," Journal of Financial Econometrics, Oxford University Press, vol. 19(2), pages 369-392.
Ying Hu & Hanqing Jin & Xun Yu Zhou, 2012. "Time-Inconsistent Stochastic Linear--Quadratic Control," Post-Print hal-00691816, HAL.
Thaleia Zariphopoulou, 2001. "A solution approach to valuation with unhedgeable risks," Finance and Stochastics, Springer, vol. 5(1), pages 61-82.
Dybvig, Philip H & Rogers, L C G, 1997. "Recovery of Preferences from Observed Wealth in a Single Realization," The Review of Financial Studies, Society for Financial Studies, vol. 10(1), pages 151-174.
Alexander M. G. Cox & David Hobson & Jan Obłój, 2014. "Utility Theory Front To Back — Inferring Utility From Agents' Choices," International Journal of Theoretical and Applied Finance (IJTAF), World Scientific Publishing Co. Pte. Ltd., vol. 17(03), pages 1-44.
Francesco D’Acunto & Nagpurnanand Prabhala & Alberto G Rossi, 2019. "The Promises and Pitfalls of Robo-Advising," The Review of Financial Studies, Society for Financial Studies, vol. 32(5), pages 1983-2020.
Agostino Capponi & Sveinn Ólafsson & Thaleia Zariphopoulou, 2022. "Personalized Robo-Advising: Enhancing Investment Through Client Interaction," Management Science, INFORMS, vol. 68(4), pages 2485-2512, April.
Sargent, Thomas J, 1978. "Estimation of Dynamic Labor Demand Schedules under Rational Expectations," Journal of Political Economy, University of Chicago Press, vol. 86(6), pages 1009-1044, December.
- Thomas J. Sargent, 1978. "Estimation of dynamic labor demand schedules under rational expectations," Staff Report 27, Federal Reserve Bank of Minneapolis.
Nicole Bäuerle & Ulrich Rieder, 2014. "More Risk-Sensitive Markov Decision Processes," Mathematics of Operations Research, INFORMS, vol. 39(1), pages 105-120, February.
Nicole El Karoui & Mohamed Mrad, 2021. "Recover Dynamic Utility from Observable Process: Application to the economic equilibrium," Post-Print hal-01966312, HAL.
Ying Hu & Hanqing Jin & Xun Yu Zhou, 2017. "Time-Inconsistent Stochastic Linear--Quadratic Control: Characterization and Uniqueness of Equilibrium," Post-Print hal-01139343, HAL.
Hanqing Jin & Xun Yu Zhou, 2008. "Behavioral Portfolio Selection In Continuous Time," Mathematical Finance, Wiley Blackwell, vol. 18(3), pages 385-426, July.
Jiwoong Shin & Jungju Yu, 2021. "Targeted Advertising and Consumer Inference," Marketing Science, INFORMS, vol. 40(5), pages 900-922, September.

Full references (including those not matched with items on IDEAS)

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Marcel Nutz & Yuchong Zhang, 2019. "Conditional Optimal Stopping: A Time-Inconsistent Optimization," Papers 1901.05802, arXiv.org, revised Oct 2019.
Jiaqin Wei & Jianming Xia & Qian Zhao, 2024. "Time-Consistent Portfolio Selection for Rank-Dependent Utilities in an Incomplete Market," Papers 2409.19259, arXiv.org.
Ying Hu & Hanqing Jin & Xun Yu Zhou, 2021. "Consistent investment of sophisticated rank‐dependent utility agents in continuous time," Mathematical Finance, Wiley Blackwell, vol. 31(3), pages 1056-1095, July.
Keffert, Henk, 2024. "Robo-advising: Optimal investment with mismeasured and unstable risk preferences," European Journal of Operational Research, Elsevier, vol. 315(1), pages 378-392.
Cardillo, Giovanni & Chiappini, Helen, 2024. "Robo-advisors: A systematic literature review," Finance Research Letters, Elsevier, vol. 62(PA).
Hu, Xiao & Kang, Siqin & Ren, Long & Zhu, Shaokeng, 2024. "Interactive preference analysis: A reinforcement learning framework," European Journal of Operational Research, Elsevier, vol. 319(3), pages 983-998.
Zhiping Chen & Liyuan Wang & Ping Chen & Haixiang Yao, 2019. "Continuous-Time Mean–Variance Optimization For Defined Contribution Pension Funds With Regime-Switching," International Journal of Theoretical and Applied Finance (IJTAF), World Scientific Publishing Co. Pte. Ltd., vol. 22(06), pages 1-33, September.
Camilo Hern'andez & Dylan Possamai, 2020. "Me, myself and I: a general theory of non-Markovian time-inconsistent stochastic control for sophisticated agents," Papers 2002.12572, arXiv.org, revised Jul 2021.
Ying Hu & Hanqing Jin & Xun Yu Zhou, 2020. "Consistent Investment of Sophisticated Rank-Dependent Utility Agents in Continuous Time," Working Papers hal-02624308, HAL.
Wei Ji, 2024. "Closed-loop and open-loop equilibrium of a class time-inconsistent linear-quadratic differential games," International Journal of Game Theory, Springer;Game Theory Society, vol. 53(2), pages 635-651, June.
Yushi Hamaguchi, 2019. "Time-inconsistent consumption-investment problems in incomplete markets under general discount functions," Papers 1912.01281, arXiv.org, revised Mar 2021.
Zongxia Liang & Sheng Wang & Jianming Xia, 2024. "An Integral Equation in Portfolio Selection with Time-Inconsistent Preferences," Papers 2412.02446, arXiv.org, revised Jan 2025.
Fabian Wagner, 2024. "Determinants of conventional and digital investment advisory decisions: a systematic literature review," Financial Innovation, Springer;Southwestern University of Finance and Economics, vol. 10(1), pages 1-32, December.
Xue Dong He & Xun Yu Zhou, 2021. "Who Are I: Time Inconsistency and Intrapersonal Conflict and Reconciliation," Papers 2105.01829, arXiv.org.
Qian Lei & Chi Seng Pun, 2024. "A Malliavin Calculus Approach to Backward Stochastic Volterra Integral Equations," Papers 2412.19236, arXiv.org, revised Jan 2025.
Ying Hu & Hanqing Jin & Xun Yu Zhou, 2020. "Consistent Investment of Sophisticated Rank-Dependent Utility Agents in Continuous Time," Papers 2006.01979, arXiv.org.
Zongxia Liang & Sheng Wang & Jianming Xia & Fengyi Yuan, 2024. "Dynamic portfolio selection under generalized disappointment aversion," Papers 2401.08323, arXiv.org, revised Mar 2024.
Hanqing Jin & Yimin Yang, 2014. "Time-Inconsistent Mean-Utility Portfolio Selection with Moving Target," Papers 1402.6760, arXiv.org.
Kaawach, Said & Kowalewski, Oskar & Talavera, Oleksandr, 2024. "Automatic versus manual investing: Role of past performance," Journal of Financial Stability, Elsevier, vol. 74(C).
Nicole Maria Namyslo & Dominik Jung & Timo Sturm, 2025. "The state of robo-advisory design: A systematic consolidation of design requirements and recommendations," Electronic Markets, Springer;IIM University of St. Gallen, vol. 35(1), pages 1-29, December.

More about this item

NEP fields

This paper has been announced in the following NEP Reports:

NEP-UPT-2024-07-08 (Utility Models and Prospect Theory)

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2405.15975. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Inference of Utilities and Time Preference in Sequential Decision-Making

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Most related items

More about this item

NEP fields

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data