IDEAS home Printed from https://ideas.repec.org/a/gam/jrisks/v12y2024i9p141-d1472255.html
   My bibliography  Save this article

Insurance Analytics with Clustering Techniques

Author

Listed:
  • Charlotte Jamotton

    (Institute of Statistics, Biostatistics and Actuarial Sciences (ISBA), Université Catholique de Louvain (UCLouvain), 1348 Louvain-la-Neuve, Belgium)

  • Donatien Hainaut

    (Institute of Statistics, Biostatistics and Actuarial Sciences (ISBA), Université Catholique de Louvain (UCLouvain), 1348 Louvain-la-Neuve, Belgium)

  • Thomas Hames

    (Detralytics, Rue Belliard 2-B, 1040 Brussels, Belgium)

Abstract

The K-means algorithm and its variants are well-known clustering techniques. In actuarial applications, these partitioning methods can identify clusters of policies with similar attributes. The resulting partitions provide an actuarial framework for creating maps of dominant risks and unsupervised pricing grids. This research article aims to adapt well-established clustering methods to complex insurance datasets containing both categorical and numerical variables. To achieve this, we propose a novel approach based on Burt distance. We begin by reviewing the K-means algorithm to establish the foundation for our Burt distance-based framework. Next, we extend the scope of application of the mini-batch and fuzzy K-means variants to heterogeneous insurance data. Additionally, we adapt spectral clustering, a technique based on graph theory that accommodates non-convex cluster shapes. To mitigate the computational complexity associated with spectral clustering’s O ( n 3 ) runtime, we introduce a data reduction method for large-scale datasets using our Burt distance-based approach.

Suggested Citation

  • Charlotte Jamotton & Donatien Hainaut & Thomas Hames, 2024. "Insurance Analytics with Clustering Techniques," Risks, MDPI, vol. 12(9), pages 1-28, September.
  • Handle: RePEc:gam:jrisks:v:12:y:2024:i:9:p:141-:d:1472255
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2227-9091/12/9/141/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2227-9091/12/9/141/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Hainaut, Donatien, 2019. "A self-organizing predictive map for non-life insurance," LIDAM Reprints ISBA 2019026, Université catholique de Louvain, Institute of Statistics, Biostatistics and Actuarial Sciences (ISBA).
    2. Guojun Gan & Emiliano A. Valdez, 2020. "Data Clustering with Actuarial Applications," North American Actuarial Journal, Taylor & Francis Journals, vol. 24(2), pages 168-186, April.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Jamotton, Charlotte & Hainaut, Donatien & Hames, Thomas, 2023. "Insurance analytics with clustering techniques," LIDAM Discussion Papers ISBA 2023002, Université catholique de Louvain, Institute of Statistics, Biostatistics and Actuarial Sciences (ISBA).
    2. Jiang, Ruihong & Saunders, David & Weng, Chengguo, 2023. "Two-phase selection of representative contracts for valuation of large variable annuity portfolios," Insurance: Mathematics and Economics, Elsevier, vol. 113(C), pages 293-309.
    3. Jamotton, Charlotte & Hainaut, Donatien, 2024. "Latent Dirichlet Allocation for structured insurance data," LIDAM Discussion Papers ISBA 2024008, Université catholique de Louvain, Institute of Statistics, Biostatistics and Actuarial Sciences (ISBA).
    4. Nicholas Bett & Juma Kasozi & Daniel Ruturwa, 2022. "Temporal Clustering of the Causes of Death for Mortality Modelling," Risks, MDPI, vol. 10(5), pages 1-34, May.
    5. Romain Gauchon & Stéphane Loisel & Jean-Louis Rullière, 2020. "Health-policyholder clustering using health consumption," Post-Print hal-02156058, HAL.
    6. Shuang Yin & Guojun Gan & Emiliano A. Valdez & Jeyaraj Vadiveloo, 2021. "Applications of Clustering with Mixed Type Data in Life Insurance," Risks, MDPI, vol. 9(3), pages 1-19, March.
    7. Shengkun Xie & Kun Shi, 2023. "Generalised Additive Modelling of Auto Insurance Data with Territory Design: A Rate Regulation Perspective," Mathematics, MDPI, vol. 11(2), pages 1-24, January.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jrisks:v:12:y:2024:i:9:p:141-:d:1472255. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.