IDEAS home Printed from https://ideas.repec.org/a/gam/jrisks/v8y2020i1p19-d322684.html
   My bibliography  Save this article

Delta Boosting Implementation of Negative Binomial Regression in Actuarial Pricing

Author

Listed:
  • Simon CK Lee

    (Department of Statistics and Actuarial Science, The University of Hong Kong, Pokfulam Road, Hong Kong)

Abstract

This study proposes an efficacious approach to analyze the over-dispersed insurance frequency data as it is imperative for the insurers to have decisive informative insights for precisely underwriting and pricing insurance products, retaining existing customer base and gaining an edge in the highly competitive retail insurance market. The delta boosting implementation of the negative binomial regression, both by one-parameter estimation and a novel two-parameter estimation, was tested on the empirical data. Accurate parameter estimation of the negative binomial regression is complicated with considerations of incomplete insurance exposures, negative convexity, and co-linearity. The issues mainly originate from the unique nature of insurance operations and the adoption of distribution outside the exponential family. We studied how the issues could significantly impact the quality of estimation. In addition to a novel approach to simultaneously estimate two parameters in regression through boosting, we further enrich the study by proposing an alteration of the base algorithm to address the problems. The algorithm was able to withstand the competition against popular regression methodologies in a real-life dataset. Common diagnostics were applied to compare the performance of the relevant candidates, leading to our conclusion to move from light-tail Poisson to negative binomial for over-dispersed data, from generalized linear model (GLM) to boosting for non-linear and interaction patterns, from one-parameter to two-parameter estimation to reflect more closely the reality.

Suggested Citation

  • Simon CK Lee, 2020. "Delta Boosting Implementation of Negative Binomial Regression in Actuarial Pricing," Risks, MDPI, vol. 8(1), pages 1-21, February.
  • Handle: RePEc:gam:jrisks:v:8:y:2020:i:1:p:19-:d:322684
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2227-9091/8/1/19/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2227-9091/8/1/19/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. repec:cup:cbooks:9780521879149 is not listed on IDEAS
    2. Kevin Kuo, 2018. "DeepTriangle: A Deep Learning Approach to Loss Reserving," Papers 1804.09253, arXiv.org, revised Sep 2019.
    3. Yi Yang & Wei Qian & Hui Zou, 2018. "Insurance Premium Prediction via Gradient Tree-Boosted Tweedie Compound Poisson Models," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 36(3), pages 456-470, July.
    4. David Scollnik, 2001. "Actuarial Modeling with MCMC and BUGs," North American Actuarial Journal, Taylor & Francis Journals, vol. 5(2), pages 96-124.
    5. Kevin Kuo, 2019. "DeepTriangle: A Deep Learning Approach to Loss Reserving," Risks, MDPI, vol. 7(3), pages 1-12, September.
    6. Mario V. Wüthrich, 2018. "Machine learning in individual claims reserving," Scandinavian Actuarial Journal, Taylor & Francis Journals, vol. 2018(6), pages 465-480, July.
    7. David Mihaela & Jemna Dănuţ-Vasile, 2015. "Modeling the Frequency of Auto Insurance Claims by Means of Poisson and Negative Binomial Models," Scientific Annals of Economics and Business, Sciendo, vol. 62(2), pages 151-168, July.
    8. Maximilien Baudry & Christian Y. Robert, 2019. "A machine learning approach for individual claims reserving in insurance," Applied Stochastic Models in Business and Industry, John Wiley & Sons, vol. 35(5), pages 1127-1155, September.
    9. Jean‐Philippe Boucher & Michel Denuit & Montserrat Guillen, 2009. "Number of Accidents or Number of Claims? An Approach with Zero‐Inflated Poisson Models for Panel Data," Journal of Risk & Insurance, The American Risk and Insurance Association, vol. 76(4), pages 821-846, December.
    10. Jozef L. Teugels & Petra Vynckier, 1996. "The structure distribution in a mixed Poisson process," International Journal of Stochastic Analysis, Hindawi, vol. 9, pages 1-8, January.
    11. Yip, Karen C.H. & Yau, Kelvin K.W., 2005. "On modeling claim frequency data in general insurance with extra zeros," Insurance: Mathematics and Economics, Elsevier, vol. 36(2), pages 153-163, April.
    12. Simon C. K. Lee & Sheldon Lin, 2018. "Delta Boosting Machine with Application to General Insurance," North American Actuarial Journal, Taylor & Francis Journals, vol. 22(3), pages 405-425, July.
    13. Greg Taylor, 2019. "Loss Reserving Models: Granular and Machine Learning Forms," Risks, MDPI, vol. 7(3), pages 1-18, July.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Christopher Blier-Wong & Hélène Cossette & Luc Lamontagne & Etienne Marceau, 2020. "Machine Learning in P&C Insurance: A Review for Pricing and Reserving," Risks, MDPI, vol. 9(1), pages 1-26, December.
    2. Stephan M. Bischofberger, 2020. "In-Sample Hazard Forecasting Based on Survival Models with Operational Time," Risks, MDPI, vol. 8(1), pages 1-17, January.
    3. Łukasz Delong & Mario V. Wüthrich, 2020. "Neural Networks for the Joint Development of Individual Payments and Claim Incurred," Risks, MDPI, vol. 8(2), pages 1-34, April.
    4. Greg Taylor, 2019. "Risks Special Issue on “Granular Models and Machine Learning Models”," Risks, MDPI, vol. 8(1), pages 1-2, December.
    5. Kevin Kuo & Daniel Lupton, 2020. "Towards Explainability of Machine Learning Models in Insurance Pricing," Papers 2003.10674, arXiv.org.
    6. Avanzi, Benjamin & Taylor, Greg & Wang, Melantha & Wong, Bernard, 2021. "SynthETIC: An individual insurance claim simulator with feature control," Insurance: Mathematics and Economics, Elsevier, vol. 100(C), pages 296-308.
    7. Payandeh Najafabadi Amir T. & MohammadPour Saeed, 2018. "A k-Inflated Negative Binomial Mixture Regression Model: Application to Rate–Making Systems," Asia-Pacific Journal of Risk and Insurance, De Gruyter, vol. 12(2), pages 1-31, July.
    8. Benjamin Avanzi & Yanfeng Li & Bernard Wong & Alan Xian, 2022. "Ensemble distributional forecasting for insurance loss reserving," Papers 2206.08541, arXiv.org, revised Jun 2024.
    9. Xu, Shuzhe & Zhang, Chuanlong & Hong, Don, 2022. "BERT-based NLP techniques for classification and severity modeling in basic warranty data study," Insurance: Mathematics and Economics, Elsevier, vol. 107(C), pages 57-67.
    10. Zhao, Xiaobing & Zhou, Xian, 2012. "Copula models for insurance claim numbers with excess zeros and time-dependence," Insurance: Mathematics and Economics, Elsevier, vol. 50(1), pages 191-199.
    11. Zhiyu Quan & Changyue Hu & Panyi Dong & Emiliano A. Valdez, 2024. "Improving Business Insurance Loss Models by Leveraging InsurTech Innovation," Papers 2401.16723, arXiv.org.
    12. Marjan Qazvini, 2019. "On the Validation of Claims with Excess Zeros in Liability Insurance: A Comparative Study," Risks, MDPI, vol. 7(3), pages 1-17, June.
    13. Xuejun Jiang & Yunxian Li & Aijun Yang & Ruowei Zhou, 2020. "Bayesian semiparametric quantile regression modeling for estimating earthquake fatality risk," Empirical Economics, Springer, vol. 58(5), pages 2085-2103, May.
    14. Zhengmin Duan & Yonglian Chang & Qi Wang & Tianyao Chen & Qing Zhao, 2018. "A Logistic Regression Based Auto Insurance Rate-Making Model Designed for the Insurance Rate Reform," IJFS, MDPI, vol. 6(1), pages 1-16, February.
    15. Yang Qiao & Chou-Wen Wang & Wenjun Zhu, 2024. "Machine learning in long-term mortality forecasting," The Geneva Papers on Risk and Insurance - Issues and Practice, Palgrave Macmillan;The Geneva Association, vol. 49(2), pages 340-362, April.
    16. Muhammed Taher Al-Mudafer & Benjamin Avanzi & Greg Taylor & Bernard Wong, 2021. "Stochastic loss reserving with mixture density neural networks," Papers 2108.07924, arXiv.org.
    17. Trufin, Julien & Denuit, Michel, 2021. "Boosting cost-complexity pruned trees On Tweedie responses: the ABT machine," LIDAM Discussion Papers ISBA 2021015, Université catholique de Louvain, Institute of Statistics, Biostatistics and Actuarial Sciences (ISBA).
    18. Hainaut, Donatien & Trufin, Julien & Denuit, Michel, 2021. "Response versus gradient boosting trees, GLMs and neural networks under Tweedie loss and log-link," LIDAM Discussion Papers ISBA 2021012, Université catholique de Louvain, Institute of Statistics, Biostatistics and Actuarial Sciences (ISBA).
    19. Valandis Elpidorou & Carolin Margraf & María Dolores Martínez-Miranda & Bent Nielsen, 2019. "A Likelihood Approach to Bornhuetter–Ferguson Analysis," Risks, MDPI, vol. 7(4), pages 1-20, December.
    20. Ramon Alemany & Catalina Bolancé & Roberto Rodrigo & Raluca Vernic, 2020. "Bivariate Mixed Poisson and Normal Generalised Linear Models with Sarmanov Dependence—An Application to Model Claim Frequency and Optimal Transformed Average Severity," Mathematics, MDPI, vol. 9(1), pages 1-18, December.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jrisks:v:8:y:2020:i:1:p:19-:d:322684. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.