IDEAS home Printed from https://ideas.repec.org/a/gam/jrisks/v9y2021i3p47-d509780.html
   My bibliography  Save this article

Applications of Clustering with Mixed Type Data in Life Insurance

Author

Listed:
  • Shuang Yin

    (Department of Statistics, University of Connecticut, 215 Glenbrook Road, Storrs, CT 06269-4120, USA)

  • Guojun Gan

    (Department of Mathematics, University of Connecticut, 341 Mansfield Road, Storrs, CT 06269-1009, USA)

  • Emiliano A. Valdez

    (Department of Mathematics, University of Connecticut, 341 Mansfield Road, Storrs, CT 06269-1009, USA)

  • Jeyaraj Vadiveloo

    (Department of Mathematics, University of Connecticut, 341 Mansfield Road, Storrs, CT 06269-1009, USA)

Abstract

Death benefits are generally the largest cash flow items that affect the financial statements of life insurers; some may still not have a systematic process to track and monitor death claims. In this article, we explore data clustering to examine and understand how actual death claims differ from what is expected—an early stage of developing a monitoring system crucial for risk management. We extended the k -prototype clustering algorithm to draw inferences from a life insurance dataset using only the insured’s characteristics and policy information without regard to known mortality. This clustering has the feature of efficiently handling categorical, numerical, and spatial attributes. Using gap statistics, the optimal clusters obtained from the algorithm are then used to compare actual to expected death claims experience of the life insurance portfolio. Our empirical data contained observations of approximately 1.14 million policies with a total insured amount of over 650 billion dollars. For this portfolio, the algorithm produced three natural clusters, with each cluster having lower actual to expected death claims but with differing variability. The analytical results provide management a process to identify policyholders’ attributes that dominate significant mortality deviations, and thereby enhance decision making for taking necessary actions.

Suggested Citation

  • Shuang Yin & Guojun Gan & Emiliano A. Valdez & Jeyaraj Vadiveloo, 2021. "Applications of Clustering with Mixed Type Data in Life Insurance," Risks, MDPI, vol. 9(3), pages 1-19, March.
  • Handle: RePEc:gam:jrisks:v:9:y:2021:i:3:p:47-:d:509780
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2227-9091/9/3/47/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2227-9091/9/3/47/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Sfyridis, Alexandros & Agnolucci, Paolo, 2020. "Annual average daily traffic estimation in England and Wales: An application of clustering and regression modelling," Journal of Transport Geography, Elsevier, vol. 83(C).
    2. Dickson,David C. M. & Hardy,Mary R. & Waters,Howard R., 2013. "Actuarial Mathematics for Life Contingent Risks," Cambridge Books, Cambridge University Press, number 9781107044074, October.
    3. Gan Guojun & Valdez Emiliano A., 2016. "An empirical comparison of some experimental designs for the valuation of large variable annuity portfolios," Dependence Modeling, De Gruyter, vol. 4(1), pages 1-19, December.
    4. Dickson,David C. M. & Hardy,Mary R. & Waters,Howard R., 2013. "Solutions Manual for Actuarial Mathematics for Life Contingent Risks," Cambridge Books, Cambridge University Press, number 9781107620261, February.
    5. Robert Tibshirani & Guenther Walther & Trevor Hastie, 2001. "Estimating the number of clusters in a data set via the gap statistic," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 63(2), pages 411-423.
    6. Gan, Guojun & Lin, X. Sheldon, 2015. "Valuation of large variable annuity portfolios under nested simulation: A functional data approach," Insurance: Mathematics and Economics, Elsevier, vol. 62(C), pages 138-150.
    7. Gan, Guojun, 2013. "Application of data clustering and machine learning in variable annuity valuation," Insurance: Mathematics and Economics, Elsevier, vol. 53(3), pages 795-801.
    8. Guojun Gan & Emiliano A. Valdez, 2020. "Data Clustering with Actuarial Applications," North American Actuarial Journal, Taylor & Francis Journals, vol. 24(2), pages 168-186, April.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Rabea Aschenbruck & Gero Szepannek & Adalbert F. X. Wilhelm, 2023. "Imputation Strategies for Clustering Mixed-Type Data with Missing Values," Journal of Classification, Springer;The Classification Society, vol. 40(1), pages 2-24, April.
    2. Alokananda Dey & Siddhartha Bhattacharyya & Sandip Dey & Debanjan Konar & Jan Platos & Vaclav Snasel & Leo Mrsic & Pankaj Pal, 2023. "A Review of Quantum-Inspired Metaheuristic Algorithms for Automatic Clustering," Mathematics, MDPI, vol. 11(9), pages 1-44, April.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Jiang, Ruihong & Saunders, David & Weng, Chengguo, 2023. "Two-phase selection of representative contracts for valuation of large variable annuity portfolios," Insurance: Mathematics and Economics, Elsevier, vol. 113(C), pages 293-309.
    2. Guojun Gan, 2018. "Valuation of Large Variable Annuity Portfolios Using Linear Models with Interactions," Risks, MDPI, vol. 6(3), pages 1-19, July.
    3. Daniel Doyle & Chris Groendyke, 2018. "Using Neural Networks to Price and Hedge Variable Annuity Guarantees," Risks, MDPI, vol. 7(1), pages 1-19, December.
    4. Jin Sun & Eckhard Platen, 2019. "Benchmarked Risk Minimizing Hedging Strategies for Life Insurance Policies," Research Paper Series 399, Quantitative Finance Research Centre, University of Technology, Sydney.
    5. Wang, Gu & Zou, Bin, 2021. "Optimal fee structure of variable annuities," Insurance: Mathematics and Economics, Elsevier, vol. 101(PB), pages 587-601.
    6. Thorsten Moenig, 2021. "Efficient valuation of variable annuity portfolios with dynamic programming," Journal of Risk & Insurance, The American Risk and Insurance Association, vol. 88(4), pages 1023-1055, December.
    7. Raj Kumari Bahl & Sotirios Sabanis, 2017. "General Price Bounds for Guaranteed Annuity Options," Papers 1707.00807, arXiv.org.
    8. Lee, Hangsuck & Ahn, Jae Youn & Ko, Bangwon, 2019. "Construction of multiple decrement tables under generalized fractional age assumptions," Computational Statistics & Data Analysis, Elsevier, vol. 133(C), pages 104-119.
    9. Runhuan Feng & Peng Li, 2021. "Sample Recycling Method -- A New Approach to Efficient Nested Monte Carlo Simulations," Papers 2106.06028, arXiv.org.
    10. Massimo Costabile & Fabio Viviano, 2021. "Modeling the Future Value Distribution of a Life Insurance Portfolio," Risks, MDPI, vol. 9(10), pages 1-17, October.
    11. Deelstra, Griselda & Grasselli, Martino & Van Weverberg, Christopher, 2016. "The role of the dependence between mortality and interest rates when pricing Guaranteed Annuity Options," Insurance: Mathematics and Economics, Elsevier, vol. 71(C), pages 205-219.
    12. Thomas Bernhardt & Catherine Donnelly, 2019. "Modern tontine with bequest: innovation in pooled annuity products," Papers 1903.05990, arXiv.org.
    13. Manuel L. Esquível & Gracinda R. Guerreiro & Matilde C. Oliveira & Pedro Corte Real, 2021. "Calibration of Transition Intensities for a Multistate Model: Application to Long-Term Care," Risks, MDPI, vol. 9(2), pages 1-17, February.
    14. Seyed Amir Hejazi & Kenneth R. Jackson, 2016. "A Neural Network Approach to Efficient Valuation of Large Portfolios of Variable Annuities," Papers 1606.07831, arXiv.org.
    15. Guojun Gan & Emiliano A. Valdez, 2018. "Nested Stochastic Valuation of Large Variable Annuity Portfolios: Monte Carlo Simulation and Synthetic Datasets," Data, MDPI, vol. 3(3), pages 1-21, September.
    16. Lin, X. Sheldon & Yang, Shuai, 2020. "Fast and efficient nested simulation for large variable annuity portfolios: A surrogate modeling approach," Insurance: Mathematics and Economics, Elsevier, vol. 91(C), pages 85-103.
    17. Gan Guojun & Valdez Emiliano A., 2017. "Valuation of large variable annuity portfolios: Monte Carlo simulation and synthetic datasets," Dependence Modeling, De Gruyter, vol. 5(1), pages 354-374, December.
    18. Barigou, Karim & Goffard, Pierre-Olivier & Loisel, Stéphane & Salhi, Yahia, 2023. "Bayesian model averaging for mortality forecasting using leave-future-out validation," International Journal of Forecasting, Elsevier, vol. 39(2), pages 674-690.
    19. Wing Fung Chong & Haoen Cui & Yuxuan Li, 2021. "Pseudo-Model-Free Hedging for Variable Annuities via Deep Reinforcement Learning," Papers 2107.03340, arXiv.org, revised Oct 2022.
    20. Nicholas Bett & Juma Kasozi & Daniel Ruturwa, 2022. "Temporal Clustering of the Causes of Death for Mortality Modelling," Risks, MDPI, vol. 10(5), pages 1-34, May.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jrisks:v:9:y:2021:i:3:p:47-:d:509780. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.