IDEAS home Printed from https://ideas.repec.org/a/bla/jrinsu/v90y2023i3p743-768.html
   My bibliography  Save this article

Detecting insurance fraud using supervised and unsupervised machine learning

Author

Listed:
  • Jörn Debener
  • Volker Heinke
  • Johannes Kriebel

Abstract

Fraud is a significant issue for insurance companies, generating much interest in machine learning solutions. Although supervised learning for insurance fraud detection has long been a research focus, unsupervised learning has rarely been studied in this context, and there remains insufficient evidence to guide the choice between these branches of machine learning for insurance fraud detection. Accordingly, this study evaluates supervised and unsupervised learning using proprietary insurance claim data. Furthermore, we conduct a field experiment in cooperation with an insurance company to investigate the performance of each approach in terms of identifying new fraudulent claims. We derive several important findings. Unsupervised learning, especially isolation forests, can successfully detect insurance fraud. Supervised learning also performs strongly, despite few labeled fraud cases. Interestingly, unsupervised and supervised learning detect new fraudulent claims based on different input information. Therefore, for implementation, we suggest understanding supervised and unsupervised methods as complements rather than substitutes.

Suggested Citation

  • Jörn Debener & Volker Heinke & Johannes Kriebel, 2023. "Detecting insurance fraud using supervised and unsupervised machine learning," Journal of Risk & Insurance, The American Risk and Insurance Association, vol. 90(3), pages 743-768, September.
  • Handle: RePEc:bla:jrinsu:v:90:y:2023:i:3:p:743-768
    DOI: 10.1111/jori.12427
    as

    Download full text from publisher

    File URL: https://doi.org/10.1111/jori.12427
    Download Restriction: no

    File URL: https://libkey.io/10.1111/jori.12427?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Alexander Vosseler, 2022. "Unsupervised Insurance Fraud Prediction Based on Anomaly Detector Ensembles," Risks, MDPI, vol. 10(7), pages 1-20, June.
    2. Viaene, Stijn & Ayuso, Mercedes & Guillen, Montserrat & Van Gheel, Dirk & Dedene, Guido, 2007. "Strategies for detecting fraudulent claims in the automobile insurance industry," European Journal of Operational Research, Elsevier, vol. 176(1), pages 565-583, January.
    3. Georges Dionne & Florence Giuliano & Pierre Picard, 2009. "Optimal Auditing with Scoring: Theory and Application to Insurance Fraud," Management Science, INFORMS, vol. 55(1), pages 58-70, January.
    4. Picard, Pierre, 1996. "Auditing claims in the insurance market with fraud: The credibility issue," Journal of Public Economics, Elsevier, vol. 63(1), pages 27-56, December.
    5. Steven B. Caudill & Mercedes Ayuso & Montserrat Guillén, 2005. "Fraud Detection Using a Multinomial Logit Model With Missing Information," Journal of Risk & Insurance, The American Risk and Insurance Association, vol. 72(4), pages 539-550, December.
    6. Artis, Manuel & Ayuso, Mercedes & Guillen, Montserrat, 1999. "Modelling different types of automobile insurance fraud behaviour in the Spanish market," Insurance: Mathematics and Economics, Elsevier, vol. 24(1-2), pages 67-81, March.
    7. Simon Fritzsch & Philipp Scharner & Gregor Weiß, 2021. "Estimating the relation between digitalization and the market value of insurers," Journal of Risk & Insurance, The American Risk and Insurance Association, vol. 88(3), pages 529-567, September.
    8. Gunnarsson, Björn Rafn & vanden Broucke, Seppe & Baesens, Bart & Óskarsdóttir, María & Lemahieu, Wilfried, 2021. "Deep learning for credit scoring: Do or don’t?," European Journal of Operational Research, Elsevier, vol. 295(1), pages 292-305.
    9. Jing Ai & Patrick L. Brockett & Linda L. Golden & Montserrat Guillén, 2013. "A Robust Unsupervised Method for Fraud Rate Estimation," Journal of Risk & Insurance, The American Risk and Insurance Association, vol. 80(1), pages 121-143, March.
    10. Michele Tumminello & Andrea Consiglio & Pietro Vassallo & Riccardo Cesari & Fabio Farabullini, 2023. "Insurance fraud detection: A statistically validated network approach," Journal of Risk & Insurance, The American Risk and Insurance Association, vol. 90(2), pages 381-419, June.
    11. Chamal Gomes & Zhuo Jin & Hailiang Yang, 2021. "Insurance fraud detection with unsupervised deep learning," Journal of Risk & Insurance, The American Risk and Insurance Association, vol. 88(3), pages 591-624, September.
    12. Keith J. Crocker & John Morgan, 1998. "Is Honesty the Best Policy? Curtailing Insurance Fraud through Optimal Incentive Contracts," Journal of Political Economy, University of Chicago Press, vol. 106(2), pages 355-375, April.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Yang Qiao & Chou-Wen Wang & Wenjun Zhu, 2024. "Machine learning in long-term mortality forecasting," The Geneva Papers on Risk and Insurance - Issues and Practice, Palgrave Macmillan;The Geneva Association, vol. 49(2), pages 340-362, April.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Jing Ai & Patrick L. Brockett & Linda L. Golden & Montserrat Guillén, 2013. "A Robust Unsupervised Method for Fraud Rate Estimation," Journal of Risk & Insurance, The American Risk and Insurance Association, vol. 80(1), pages 121-143, March.
    2. Ming-Jyh Wang & Chieh-Hua Wen & Lawrence W Lan, 2010. "Modelling Different Types of Bundled Automobile Insurance Choice Behaviour: The Case of Taiwan*," The Geneva Papers on Risk and Insurance - Issues and Practice, Palgrave Macmillan;The Geneva Association, vol. 35(2), pages 290-308, April.
    3. Urbina, Jilber & Guillén, Montserrat, 2013. "An application of capital allocation principles to operational risk," MPRA Paper 75726, University Library of Munich, Germany, revised Dec 2013.
    4. Dionne, Georges, 2012. "The empirical measure of information problems with emphasis on insurance fraud and dynamic data," Working Papers 12-10, HEC Montreal, Canada Research Chair in Risk Management.
    5. Lammers, Frauke & Schiller, Jörg, 2010. "Contract design and insurance fraud: An experimental investigation," FZID Discussion Papers 19-2010, University of Hohenheim, Center for Research on Innovation and Services (FZID).
    6. Dionne, Georges & Wang, Kili, 2011. "Does opportunistic fraud in automobile theft insurance fluctuate with the business cycle?," Working Papers 11-4, HEC Montreal, Canada Research Chair in Risk Management.
    7. Pierre Picard, 2012. "Economic Analysis of Insurance Fraud," Working Papers hal-00725561, HAL.
    8. Yankol-Schalck, Meryem, 2022. "The value of cross-data set analysis for automobile insurance fraud detection," Research in International Business and Finance, Elsevier, vol. 63(C).
    9. M. Martin Boyer & Jörg Schiller, 2003. "Merging Automobile Insurance Regulatory Bodies: The Case of Atlantic Canada," CIRANO Working Papers 2003s-70, CIRANO.
    10. Katja Müller & Hato Schmeiser & Joël Wagner, 2016. "The impact of auditing strategies on insurers’ profitability," Journal of Risk Finance, Emerald Group Publishing, vol. 17(1), pages 46-79, January.
    11. Georges Dionne & Kili Wang, 2013. "Does insurance fraud in automobile theft insurance fluctuate with the business cycle?," Journal of Risk and Uncertainty, Springer, vol. 47(1), pages 67-92, August.
    12. Andersson, Jonas & Olden, Andreas & Rusina, Aija, 2020. "Fraud detection by a multinomial model: Separating honesty from unobserved fraud," Discussion Papers 2020/15, Norwegian School of Economics, Department of Business and Management Science.
    13. Richard Watt, 2003. "Curtailing Ex-Post Fraud in Risk Sharing Arrangements," European Journal of Law and Economics, Springer, vol. 16(2), pages 247-263, September.
    14. Bermúdez, Ll. & Pérez, J.M. & Ayuso, M. & Gómez, E. & Vázquez, F.J., 2008. "A Bayesian dichotomous model with asymmetric link for fraud in insurance," Insurance: Mathematics and Economics, Elsevier, vol. 42(2), pages 779-786, April.
    15. Galeotti, Marcello & Rabitti, Giovanni & Vannucci, Emanuele, 2020. "An evolutionary approach to fraud management," European Journal of Operational Research, Elsevier, vol. 284(3), pages 1167-1177.
    16. Frauke von Bieberstein & Jörg Schiller, 2018. "Contract design and insurance fraud: an experimental investigation," Review of Managerial Science, Springer, vol. 12(3), pages 711-736, July.
    17. Boyer, M. Martin & Schiller, Jörg, 2003. "Merging automobile regulatory bodies: The case of Atlantic Canada," Working Papers on Risk and Insurance 11, University of Hamburg, Institute for Risk and Insurance.
    18. Denisa BANULESCU-RADU & Meryem YANKOL-SCHALCK, 2021. "Fraud detection in the era of Machine Learning: a household insurance case," LEO Working Papers / DR LEO 2904, Orleans Economics Laboratory / Laboratoire d'Economie d'Orleans (LEO), University of Orleans.
    19. Ingela Alger & Régis Renault, 2006. "Screening Ethics When Honest Agents Care About Fairness ," International Economic Review, Department of Economics, University of Pennsylvania and Osaka University Institute of Social and Economic Research Association, vol. 47(1), pages 59-85, February.
    20. Dionne, Georges, 1998. "La mesure empirique des problèmes d’information," L'Actualité Economique, Société Canadienne de Science Economique, vol. 74(4), pages 585-606, décembre.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bla:jrinsu:v:90:y:2023:i:3:p:743-768. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: https://edirc.repec.org/data/ariaaea.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.