IDEAS home Printed from https://ideas.repec.org/a/gam/jrisks/v11y2023i9p164-d1242230.html
   My bibliography  Save this article

Machine Learning in Forecasting Motor Insurance Claims

Author

Listed:
  • Thomas Poufinas

    (Department of Economics, Democritus University of Thrace, 69100 Komotini, Greece)

  • Periklis Gogas

    (Department of Economics, Democritus University of Thrace, 69100 Komotini, Greece)

  • Theophilos Papadimitriou

    (Department of Economics, Democritus University of Thrace, 69100 Komotini, Greece)

  • Emmanouil Zaganidis

    (Department of Economics, Democritus University of Thrace, 69100 Komotini, Greece)

Abstract

Accurate forecasting of insurance claims is of the utmost importance for insurance activity as the evolution of claims determines cash outflows and the pricing, and thus the profitability, of the underlying insurance coverage. These are used as inputs when the insurance company drafts its business plan and determines its risk appetite, and the respective solvency capital required (by the regulators) to absorb the assumed risks. The conventional claim forecasting methods attempt to fit (each of) the claims frequency and severity with a known probability distribution function and use it to project future claims. This study offers a fresh approach in insurance claims forecasting. First, we introduce two novel sets of variables, i.e., weather conditions and car sales, and second, we employ a battery of Machine Learning (ML) algorithms (Support Vector Machines—SVM, Decision Trees, Random Forests, and Boosting) to forecast the average (mean) insurance claim per insured car per quarter. Finally, we identify the variables that are the most influential in forecasting insurance claims. Our dataset comes from the motor portfolio of an insurance company operating in Athens, Greece and spans a period from 2008 to 2020. We found evidence that the three most informative variables pertain to the new car sales with a 3-quarter and 1-quarter lag and the minimum temperature of Elefsina (one of the weather stations in Athens) with a 3-quarter lag. Among the models tested, Random Forest with limited depth and XGboost run on the 15 most informative variables, and these exhibited the best performance. These findings can be useful in the hands of insurers as they can consider the weather conditions and the new car sales among the parameters that are considered to perform claims forecasting.

Suggested Citation

  • Thomas Poufinas & Periklis Gogas & Theophilos Papadimitriou & Emmanouil Zaganidis, 2023. "Machine Learning in Forecasting Motor Insurance Claims," Risks, MDPI, vol. 11(9), pages 1-19, September.
  • Handle: RePEc:gam:jrisks:v:11:y:2023:i:9:p:164-:d:1242230
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2227-9091/11/9/164/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2227-9091/11/9/164/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Maximilien Baudry & Christian Y. Robert, 2019. "A machine learning approach for individual claims reserving in insurance," Applied Stochastic Models in Business and Industry, John Wiley & Sons, vol. 35(5), pages 1127-1155, September.
    2. Jan Reig Torra & Montserrat Guillen & Ana M. Pérez-Marín & Lorena Rey Gámez & Giselle Aguer, 2023. "Weather Conditions and Telematics Panel Data in Monthly Motor Insurance Claim Frequency Models," Risks, MDPI, vol. 11(3), pages 1-18, March.
    3. Paruchuri, Harish, 2020. "The Impact of Machine Learning on the Future of Insurance Industry," American Journal of Trade and Policy, Asian Business Consortium, vol. 7(3), pages 85-90.
    4. Jessica Pesantez-Narvaez & Montserrat Guillen & Manuela Alcañiz, 2019. "Predicting Motor Insurance Claims Using Telematics Data—XGBoost versus Logistic Regression," Risks, MDPI, vol. 7(2), pages 1-16, June.
    5. Roel Henckaerts & Marie-Pier Côté & Katrien Antonio & Roel Verbelen, 2021. "Boosting Insights in Insurance Tariff Plans with Tree-Based Machine Learning Methods," North American Actuarial Journal, Taylor & Francis Journals, vol. 25(2), pages 255-285, April.
    6. Knighton, James & Buchanan, Brian & Guzman, Christian & Elliott, Rebecca & White, Eric & Rahm, Brian, 2020. "Predicting flood insurance claims with hydrologic and socioeconomic demographics via machine learning: exploring the roles of topography, minority populations, and political dissimilarity," LSE Research Online Documents on Economics 105761, London School of Economics and Political Science, LSE Library.
    7. Marjan Qazvini, 2019. "On the Validation of Claims with Excess Zeros in Liability Insurance: A Comparative Study," Risks, MDPI, vol. 7(3), pages 1-17, June.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Seyed Farshid Ghorashi & Maziyar Bahri & Atousa Goodarzi, 2024. "Developing and comparing machine learning approaches for predicting insurance penetration rates based on each country," Letters in Spatial and Resource Sciences, Springer, vol. 17(1), pages 1-29, December.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Zhiyu Quan & Changyue Hu & Panyi Dong & Emiliano A. Valdez, 2024. "Improving Business Insurance Loss Models by Leveraging InsurTech Innovation," Papers 2401.16723, arXiv.org.
    2. Zuleyka Díaz Martínez & José Fernández Menéndez & Luis Javier García Villalba, 2023. "Tariff Analysis in Automobile Insurance: Is It Time to Switch from Generalized Linear Models to Generalized Additive Models?," Mathematics, MDPI, vol. 11(18), pages 1-16, September.
    3. Emer Owens & Barry Sheehan & Martin Mullins & Martin Cunneen & Juliane Ressel & German Castignani, 2022. "Explainable Artificial Intelligence (XAI) in Insurance," Risks, MDPI, vol. 10(12), pages 1-50, December.
    4. Vali Asimit & Ioannis Kyriakou & Jens Perch Nielsen, 2020. "Special Issue “Machine Learning in Insurance”," Risks, MDPI, vol. 8(2), pages 1-2, May.
    5. Christopher Grumiau & Mina Mostoufi & Solon Pavlioglou & Tim Verdonck, 2020. "Address Identification Using Telematics: An Algorithm to Identify Dwell Locations," Risks, MDPI, vol. 8(3), pages 1-12, September.
    6. Nemanja Milanović & Miloš Milosavljević & Slađana Benković & Dušan Starčević & Željko Spasenić, 2020. "An Acceptance Approach for Novel Technologies in Car Insurance," Sustainability, MDPI, vol. 12(24), pages 1-15, December.
    7. Eduardo Ramos-P'erez & Pablo J. Alonso-Gonz'alez & Jos'e Javier N'u~nez-Vel'azquez, 2020. "Stochastic reserving with a stacked model based on a hybridized Artificial Neural Network," Papers 2008.07564, arXiv.org.
    8. Gu, Zheng & Li, Yunxian & Zhang, Minghui & Liu, Yifei, 2023. "Modelling economic losses from earthquakes using regression forests: Application to parametric insurance," Economic Modelling, Elsevier, vol. 125(C).
    9. Mahya Norallahi & Hesam Seyed Kaboli, 2021. "Urban flood hazard mapping using machine learning models: GARP, RF, MaxEnt and NB," Natural Hazards: Journal of the International Society for the Prevention and Mitigation of Natural Hazards, Springer;International Society for the Prevention and Mitigation of Natural Hazards, vol. 106(1), pages 119-137, March.
    10. T. E. Ologunorisa & O. Obioma & A. O. Eludoyin, 2022. "Urban flood event and associated damage in the Benue valley, Nigeria," Natural Hazards: Journal of the International Society for the Prevention and Mitigation of Natural Hazards, Springer;International Society for the Prevention and Mitigation of Natural Hazards, vol. 111(1), pages 261-282, March.
    11. Francis Duval & Jean‐Philippe Boucher & Mathieu Pigeon, 2023. "Enhancing claim classification with feature extraction from anomaly‐detection‐derived routine and peculiarity profiles," Journal of Risk & Insurance, The American Risk and Insurance Association, vol. 90(2), pages 421-458, June.
    12. Yanez, Juan Sebastian & Pigeon, Mathieu, 2021. "Micro-level parametric duration-frequency-severity modeling for outstanding claim payments," Insurance: Mathematics and Economics, Elsevier, vol. 98(C), pages 106-119.
    13. Gao, Lisa & Shi, Peng, 2022. "Leveraging high-resolution weather information to predict hail damage claims: A spatial point process for replicated point patterns," Insurance: Mathematics and Economics, Elsevier, vol. 107(C), pages 161-179.
    14. Hamed Ghaedi & Kelsea Best & Allison Reilly & Deb Niemeier, 2024. "Statistical learning to identify salient factors influencing FEMA public assistance outlays," Natural Hazards: Journal of the International Society for the Prevention and Mitigation of Natural Hazards, Springer;International Society for the Prevention and Mitigation of Natural Hazards, vol. 120(12), pages 10559-10582, September.
    15. Christopher Blier-Wong & Hélène Cossette & Luc Lamontagne & Etienne Marceau, 2020. "Machine Learning in P&C Insurance: A Review for Pricing and Reserving," Risks, MDPI, vol. 9(1), pages 1-26, December.
    16. Aristodemos Pnevmatikakis & Stathis Kanavos & George Matikas & Konstantina Kostopoulou & Alfredo Cesario & Sofoklis Kyriazakos, 2021. "Risk Assessment for Personalized Health Insurance Based on Real-World Data," Risks, MDPI, vol. 9(3), pages 1-15, March.
    17. Yaojun Zhang & Lanpeng Ji & Georgios Aivaliotis & Charles C. Taylor, 2024. "Bayesian CART models for aggregate claim modeling," Papers 2409.01908, arXiv.org.
    18. Stephan M. Bischofberger, 2020. "In-Sample Hazard Forecasting Based on Survival Models with Operational Time," Risks, MDPI, vol. 8(1), pages 1-17, January.
    19. Maria Goldshtein & Erin K. Chiou & Rod D. Roscoe, 2024. "‘I Just Don’t Trust Them’: Reasons for Distrust and Non-Disclosure in Demographic Questionnaires for Individuals in STEM," Societies, MDPI, vol. 14(7), pages 1-21, June.
    20. Jessica Pesantez-Narvaez & Montserrat Guillen & Manuela Alcañiz, 2021. "RiskLogitboost Regression for Rare Events in Binary Response: An Econometric Approach," Mathematics, MDPI, vol. 9(5), pages 1-21, March.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jrisks:v:11:y:2023:i:9:p:164-:d:1242230. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.