IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0238000.html
   My bibliography  Save this article

Stochastic gradient boosting frequency-severity model of insurance claims

Author

Listed:
  • Xiaoshan Su
  • Manying Bai

Abstract

The standard GLM and GAM frequency-severity models assume independence between the claim frequency and severity. To overcome restrictions of linear or additive forms and to relax the independence assumption, we develop a data-driven dependent frequency-severity model, where we combine a stochastic gradient boosting algorithm and a profile likelihood approach to estimate parameters for both of the claim frequency and average claim severity distributions, and where we introduce the dependence between the claim frequency and severity by treating the claim frequency as a predictor in the regression model for the average claim severity. The model can flexibly capture the nonlinear relation between the claim frequency (severity) and predictors and complex interactions among predictors and can fully capture the nonlinear dependence between the claim frequency and severity. A simulation study shows excellent prediction performance of our model. Then, we demonstrate the application of our model with a French auto insurance claim data. The results show that our model is superior to other state-of-the-art models.

Suggested Citation

  • Xiaoshan Su & Manying Bai, 2020. "Stochastic gradient boosting frequency-severity model of insurance claims," PLOS ONE, Public Library of Science, vol. 15(8), pages 1-24, August.
  • Handle: RePEc:plo:pone00:0238000
    DOI: 10.1371/journal.pone.0238000
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0238000
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0238000&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0238000?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Frees, Edward W. & Shi, Peng & Valdez, Emiliano A., 2009. "Actuarial Applications of a Hierarchical Insurance Claims Model," ASTIN Bulletin, Cambridge University Press, vol. 39(1), pages 165-197, May.
    2. Krämer, Nicole & Brechmann, Eike C. & Silvestrini, Daniel & Czado, Claudia, 2013. "Total loss estimation using copula-based regression models," Insurance: Mathematics and Economics, Elsevier, vol. 53(3), pages 829-839.
    3. Edward Frees & Jie Gao & Marjorie Rosenberg, 2011. "Predicting the Frequency and Amount of Health Care Expenditures," North American Actuarial Journal, Taylor & Francis Journals, vol. 15(3), pages 377-392.
    4. Edward W. (Jed) Frees & Glenn Meyers & A. David Cummings, 2014. "Insurance Ratemaking and a Gini Index," Journal of Risk & Insurance, The American Risk and Insurance Association, vol. 81(2), pages 335-366, June.
    5. Garrido, J. & Genest, C. & Schulz, J., 2016. "Generalized linear models for dependent frequency and severity of insurance claims," Insurance: Mathematics and Economics, Elsevier, vol. 70(C), pages 205-215.
    6. Yi Yang & Wei Qian & Hui Zou, 2018. "Insurance Premium Prediction via Gradient Tree-Boosted Tweedie Compound Poisson Models," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 36(3), pages 456-470, July.
    7. Huang, Helai & Song, Bo & Xu, Pengpeng & Zeng, Qiang & Lee, Jaeyoung & Abdel-Aty, Mohamed, 2016. "Macro and micro models for zonal crash prediction with application in hot zones identification," Journal of Transport Geography, Elsevier, vol. 54(C), pages 248-256.
    8. Friedman, Jerome H., 2002. "Stochastic gradient boosting," Computational Statistics & Data Analysis, Elsevier, vol. 38(4), pages 367-378, February.
    9. Sigrist, Fabio & Hirnschall, Christoph, 2019. "Grabit: Gradient tree-boosted Tobit models for default prediction," Journal of Banking & Finance, Elsevier, vol. 102(C), pages 177-192.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Dong-Young Lim, 2021. "A Neural Frequency-Severity Model and Its Application to Insurance Claims," Papers 2106.10770, arXiv.org, revised Feb 2024.
    2. Marian Reiff & Erik Šoltés & Silvia Komara & Tatiana Šoltésová & Silvia Zelinová, 2022. "Segmentation and estimation of claim severity in motor third-party liability insurance through contrast analysis," Equilibrium. Quarterly Journal of Economics and Economic Policy, Institute of Economic Research, vol. 17(3), pages 803-842, September.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Jeong, Himchan & Valdez, Emiliano A., 2020. "Predictive compound risk models with dependence," Insurance: Mathematics and Economics, Elsevier, vol. 94(C), pages 182-195.
    2. Oh, Rosy & Jeong, Himchan & Ahn, Jae Youn & Valdez, Emiliano A., 2021. "A multi-year microlevel collective risk model," Insurance: Mathematics and Economics, Elsevier, vol. 100(C), pages 309-328.
    3. Gao, Guangyuan & Li, Jiahong, 2023. "Dependence modeling of frequency-severity of insurance claims using waiting time," Insurance: Mathematics and Economics, Elsevier, vol. 109(C), pages 29-51.
    4. Dong-Young Lim, 2021. "A Neural Frequency-Severity Model and Its Application to Insurance Claims," Papers 2106.10770, arXiv.org, revised Feb 2024.
    5. Cossette, Hélène & Marceau, Etienne & Mtalai, Itre, 2019. "Collective risk models with dependence," Insurance: Mathematics and Economics, Elsevier, vol. 87(C), pages 153-168.
    6. Vernic, Raluca & Bolancé, Catalina & Alemany, Ramon, 2022. "Sarmanov distribution for modeling dependence between the frequency and the average severity of insurance claims," Insurance: Mathematics and Economics, Elsevier, vol. 102(C), pages 111-125.
    7. Carina Clemente & Gracinda R. Guerreiro & Jorge M. Bravo, 2023. "Modelling Motor Insurance Claim Frequency and Severity Using Gradient Boosting," Risks, MDPI, vol. 11(9), pages 1-20, September.
    8. Lee, Gee Y. & Shi, Peng, 2019. "A dependent frequency–severity approach to modeling longitudinal insurance claims," Insurance: Mathematics and Economics, Elsevier, vol. 87(C), pages 115-129.
    9. Pierre-Olivier Goffard & Patrick Laub, 2021. "Approximate Bayesian Computations to fit and compare insurance loss models," Working Papers hal-02891046, HAL.
    10. Garrido, J. & Genest, C. & Schulz, J., 2016. "Generalized linear models for dependent frequency and severity of insurance claims," Insurance: Mathematics and Economics, Elsevier, vol. 70(C), pages 205-215.
    11. Hua, Lei, 2015. "Tail negative dependence and its applications for aggregate loss modeling," Insurance: Mathematics and Economics, Elsevier, vol. 61(C), pages 135-145.
    12. Zifeng Zhao & Peng Shi & Xiaoping Feng, 2021. "Knowledge Learning of Insurance Risks Using Dependence Models," INFORMS Journal on Computing, INFORMS, vol. 33(3), pages 1177-1196, July.
    13. Michel Denuit & Yang Lu, 2021. "Wishart‐gamma random effects models with applications to nonlife insurance," Journal of Risk & Insurance, The American Risk and Insurance Association, vol. 88(2), pages 443-481, June.
    14. Edward W. Frees & Gee Lee & Lu Yang, 2016. "Multivariate Frequency-Severity Regression Models in Insurance," Risks, MDPI, vol. 4(1), pages 1-36, February.
    15. Araichi, Sawssen & Peretti, Christian de & Belkacem, Lotfi, 2017. "Reserve modelling and the aggregation of risks using time varying copula models," Economic Modelling, Elsevier, vol. 67(C), pages 149-158.
    16. Syuhada, Khreshna & Tjahjono, Venansius & Hakim, Arief, 2024. "Compound Poisson–Lindley process with Sarmanov dependence structure and its application for premium-based spectral risk forecasting," Applied Mathematics and Computation, Elsevier, vol. 467(C).
    17. Goffard, Pierre-Olivier & Laub, Patrick J., 2021. "Approximate Bayesian Computations to fit and compare insurance loss models," Insurance: Mathematics and Economics, Elsevier, vol. 100(C), pages 350-371.
    18. Cheung, Eric C.K. & Ni, Weihong & Oh, Rosy & Woo, Jae-Kyung, 2021. "Bayesian credibility under a bivariate prior on the frequency and the severity of claims," Insurance: Mathematics and Economics, Elsevier, vol. 100(C), pages 274-295.
    19. Övgücan Karadağ Erdemir, 2023. "A Comparative Perspective on Multivariate Modeling of Insurance Compensation Payments with Regression-Based and Copula-Based Models," EKOIST Journal of Econometrics and Statistics, Istanbul University, Faculty of Economics, vol. 0(39), pages 161-171, December.
    20. Shi, Peng & Feng, Xiaoping & Ivantsova, Anastasia, 2015. "Dependent frequency–severity modeling of insurance claims," Insurance: Mathematics and Economics, Elsevier, vol. 64(C), pages 417-428.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0238000. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.