IDEAS home Printed from https://ideas.repec.org/a/gam/jjrfmx/v17y2024i1p25-d1316599.html
   My bibliography  Save this article

Exploratory Dividend Optimization with Entropy Regularization

Author

Listed:
  • Sang Hu

    (School of Data Science, The Chinese University of Hong Kong, Shenzhen 518172, China)

  • Zihan Zhou

    (School of Data Science, The Chinese University of Hong Kong, Shenzhen 518172, China)

Abstract

This study investigates the dividend optimization problem in the entropy regularization framework in the continuous-time reinforcement learning setting. The exploratory HJB is established, and the optimal exploratory dividend policy is a truncated exponential distribution. We show that, for suitable choices of the maximal dividend-paying rate and the temperature parameter, the value function of the exploratory dividend optimization problem can be significantly different from the value function in the classical dividend optimization problem. In particular, the value function of the exploratory dividend optimization problem can be classified into three cases based on its monotonicity. Additionally, numerical examples are presented to show the effect of the temperature parameter on the solution. Our results suggest that insurance companies can adopt new exploratory dividend payout strategies in unknown market environments.

Suggested Citation

  • Sang Hu & Zihan Zhou, 2024. "Exploratory Dividend Optimization with Entropy Regularization," JRFM, MDPI, vol. 17(1), pages 1-23, January.
  • Handle: RePEc:gam:jjrfmx:v:17:y:2024:i:1:p:25-:d:1316599
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/1911-8074/17/1/25/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/1911-8074/17/1/25/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Haoran Wang & Xun Yu Zhou, 2020. "Continuous‐time mean–variance portfolio selection: A reinforcement learning framework," Mathematical Finance, Wiley Blackwell, vol. 30(4), pages 1273-1308, October.
    2. Hans Gerber & Elias Shiu, 2006. "On Optimal Dividend Strategies In The Compound Poisson Model," North American Actuarial Journal, Taylor & Francis Journals, vol. 10(2), pages 76-93.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Alexandre Carbonneau & Fr'ed'eric Godin, 2021. "Deep equal risk pricing of financial derivatives with non-translation invariant risk measures," Papers 2107.11340, arXiv.org.
    2. Xu, Ran & Woo, Jae-Kyung, 2020. "Optimal dividend and capital injection strategy with a penalty payment at ruin: Restricted dividend payments," Insurance: Mathematics and Economics, Elsevier, vol. 92(C), pages 1-16.
    3. Yang, Hu & Zhang, Zhimin, 2008. "Gerber-Shiu discounted penalty function in a Sparre Andersen model with multi-layer dividend strategy," Insurance: Mathematics and Economics, Elsevier, vol. 42(3), pages 984-991, June.
    4. Lin, X. Sheldon & Sendova, Kristina P., 2008. "The compound Poisson risk model with multiple thresholds," Insurance: Mathematics and Economics, Elsevier, vol. 42(2), pages 617-627, April.
    5. Yongwu Li & Zhongfei Li & Yan Zeng, 2016. "Equilibrium Dividend Strategy with Non-exponential Discounting in a Dual Model," Journal of Optimization Theory and Applications, Springer, vol. 168(2), pages 699-722, February.
    6. Chi, Yichun & Lin, X. Sheldon, 2011. "On the threshold dividend strategy for a generalized jump-diffusion risk model," Insurance: Mathematics and Economics, Elsevier, vol. 48(3), pages 326-337, May.
    7. Chonghu Guan & Jiacheng Fan & Zuo Quan Xu, 2023. "Optimal dividend payout with path-dependent drawdown constraint," Papers 2312.01668, arXiv.org.
    8. Renaud, Jean-François, 2009. "The distribution of tax payments in a Lévy insurance risk model with a surplus-dependent taxation structure," Insurance: Mathematics and Economics, Elsevier, vol. 45(2), pages 242-246, October.
    9. Min Dai & Yuchao Dong & Yanwei Jia & Xun Yu Zhou, 2023. "Learning Merton's Strategies in an Incomplete Market: Recursive Entropy Regularization and Biased Gaussian Exploration," Papers 2312.11797, arXiv.org.
    10. Ran Xu & Wenyuan Wang & Jose Garrido, 2022. "Optimal Dividend Strategy Under Parisian Ruin with Affine Penalty," Methodology and Computing in Applied Probability, Springer, vol. 24(3), pages 1385-1409, September.
    11. Ng, Andrew C.Y., 2009. "On a dual model with a dividend threshold," Insurance: Mathematics and Economics, Elsevier, vol. 44(2), pages 315-324, April.
    12. Thonhauser, Stefan & Albrecher, Hansjorg, 2007. "Dividend maximization under consideration of the time value of ruin," Insurance: Mathematics and Economics, Elsevier, vol. 41(1), pages 163-184, July.
    13. Wing Fung Chong & Haoen Cui & Yuxuan Li, 2021. "Pseudo-Model-Free Hedging for Variable Annuities via Deep Reinforcement Learning," Papers 2107.03340, arXiv.org, revised Oct 2022.
    14. Chonghu Guan & Zuo Quan Xu, 2023. "Optimal ratcheting of dividend payout under Brownian motion surplus," Papers 2308.15048, arXiv.org.
    15. Xia Han & Ruodu Wang & Xun Yu Zhou, 2022. "Choquet regularization for reinforcement learning," Papers 2208.08497, arXiv.org.
    16. Feng, Runhuan, 2009. "On the total operating costs up to default in a renewal risk model," Insurance: Mathematics and Economics, Elsevier, vol. 45(2), pages 305-314, October.
    17. Xiaofei Shi & Daran Xu & Zhanhao Zhang, 2021. "Deep Learning Algorithms for Hedging with Frictions," Papers 2111.01931, arXiv.org, revised Dec 2022.
    18. Yang, Hu & Zhang, Zhimin & Lan, Chunmei, 2008. "On the time value of absolute ruin for a multi-layer compound Poisson model under interest force," Statistics & Probability Letters, Elsevier, vol. 78(13), pages 1835-1845, September.
    19. Lu, Yi & Li, Shuanming, 2009. "The Markovian regime-switching risk model with a threshold dividend strategy," Insurance: Mathematics and Economics, Elsevier, vol. 44(2), pages 296-303, April.
    20. Shi, Yafeng & Liu, Peng & Zhang, Chunsheng, 2013. "On the compound Poisson risk model with dependence and a threshold dividend strategy," Statistics & Probability Letters, Elsevier, vol. 83(9), pages 1998-2006.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jjrfmx:v:17:y:2024:i:1:p:25-:d:1316599. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.