IDEAS home Printed from https://ideas.repec.org/a/eee/insuma/v71y2016icp244-252.html
   My bibliography  Save this article

Bayesian multinomial latent variable modeling for fraud and abuse detection in health insurance

Author

Listed:
  • Bayerstadler, Andreas
  • van Dijk, Linda
  • Winter, Fabian

Abstract

Healthcare fraud and abuse are a serious challenge to healthcare payers and to the entire society. This article presents a predictive model for fraud and abuse detection in health insurance based on a training dataset of manually reviewed claims. The goal of the analysis is to predict different fraud and abuse probabilities for new invoices. The prediction is based on a wide framework of fraud and abuse reports which examine the behavior of medical providers and insured members by measuring systematic deviation from usual patterns in medical claims data. We show that models which directly use the results of the reports as model covariates do not exploit the full potential in terms of predictive quality. Instead, we propose a multinomial Bayesian latent variable model which summarizes behavioral patterns in latent variables, and predicts different fraud and abuse probabilities. The estimation of model parameters is based on a Markov Chain Monte Carlo (MCMC) algorithm using Bayesian shrinkage techniques. The presented approach improves the identification of fraudulent and abusive claims compared to different benchmark approaches.

Suggested Citation

  • Bayerstadler, Andreas & van Dijk, Linda & Winter, Fabian, 2016. "Bayesian multinomial latent variable modeling for fraud and abuse detection in health insurance," Insurance: Mathematics and Economics, Elsevier, vol. 71(C), pages 244-252.
  • Handle: RePEc:eee:insuma:v:71:y:2016:i:c:p:244-252
    DOI: 10.1016/j.insmatheco.2016.09.013
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0167668715302845
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.insmatheco.2016.09.013?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Faisal Zahid & Gerhard Tutz, 2013. "Multinomial logit models with implicit variable selection," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 7(4), pages 393-416, December.
    2. Jing Li & Kuei-Ying Huang & Jionghua Jin & Jianjun Shi, 2008. "A survey on statistical methods for health care fraud detection," Health Care Management Science, Springer, vol. 11(3), pages 275-287, September.
    3. Samson B. Adebayo & Ludwig Fahrmeir & Christian Seiler & Christian Heumann, 2011. "Geoadditive Latent Variable Modeling of Count Data on Multiple Sexual Partnering in Nigeria," Biometrics, The International Biometric Society, vol. 67(2), pages 620-628, June.
    4. Park, Trevor & Casella, George, 2008. "The Bayesian Lasso," Journal of the American Statistical Association, American Statistical Association, vol. 103, pages 681-686, June.
    5. Goldberger, Arthur S, 1972. "Structural Equation Methods in the Social Sciences," Econometrica, Econometric Society, vol. 40(6), pages 979-1001, November.
    6. Yufei Jin & Roderick Rejesus & Bertis Little, 2005. "Binary choice models for rare events data: a crop insurance fraud application," Applied Economics, Taylor & Francis Journals, vol. 37(7), pages 841-848.
    7. Ludwig Fahrmeir & Alexander Raach, 2007. "A Bayesian Semiparametric Latent Variable Model for Mixed Responses," Psychometrika, Springer;The Psychometric Society, vol. 72(3), pages 327-346, September.
    8. Mary Dupuis Sammel & Louise M. Ryan & Julie M. Legler, 1997. "Latent Variable Models for Mixed Discrete and Continuous Outcomes," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 59(3), pages 667-678.
    9. Martin, Andrew D. & Quinn, Kevin M. & Park, Jong Hee, 2011. "MCMCpack: Markov Chain Monte Carlo in R," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 42(i09).
    10. Bermúdez, Ll. & Pérez, J.M. & Ayuso, M. & Gómez, E. & Vázquez, F.J., 2008. "A Bayesian dichotomous model with asymmetric link for fraud in insurance," Insurance: Mathematics and Economics, Elsevier, vol. 42(2), pages 779-786, April.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Galeotti, Marcello & Rabitti, Giovanni & Vannucci, Emanuele, 2020. "An evolutionary approach to fraud management," European Journal of Operational Research, Elsevier, vol. 284(3), pages 1167-1177.
    2. Sun, Huan & Wang, Haiyan & Steffensen, Sonja, 2022. "Mechanism design of multi-strategy health insurance plans under asymmetric information," Omega, Elsevier, vol. 107(C).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Ruixin Guo & Hongtu Zhu & Sy-Miin Chow & Joseph G. Ibrahim, 2012. "Bayesian Lasso for Semiparametric Structural Equation Models," Biometrics, The International Biometric Society, vol. 68(2), pages 567-577, June.
    2. Ji, Yonggang & Lin, Nan & Zhang, Baoxue, 2012. "Model selection in binary and tobit quantile regression using the Gibbs sampler," Computational Statistics & Data Analysis, Elsevier, vol. 56(4), pages 827-839.
    3. Samson B. Adebayo & Ludwig Fahrmeir & Christian Seiler & Christian Heumann, 2011. "Geoadditive Latent Variable Modeling of Count Data on Multiple Sexual Partnering in Nigeria," Biometrics, The International Biometric Society, vol. 67(2), pages 620-628, June.
    4. Feng, Xiangnan & Lu, Bin & Song, Xinyuan & Ma, Shuang, 2019. "Financial literacy and household finances: A Bayesian two-part latent variable modeling approach," Journal of Empirical Finance, Elsevier, vol. 51(C), pages 119-137.
    5. Nalan Basturk & Cem Cakmakli & S. Pinar Ceyhan & Herman K. van Dijk, 2014. "On the Rise of Bayesian Econometrics after Cowles Foundation Monographs 10, 14," Tinbergen Institute Discussion Papers 14-085/III, Tinbergen Institute, revised 04 Sep 2014.
    6. Nalan Basturk & Cem Cakmakli & S. Pinar Ceyhan & Herman K. van Dijk, 2013. "Historical Developments in Bayesian Econometrics after Cowles Foundation Monographs 10, 14," Tinbergen Institute Discussion Papers 13-191/III, Tinbergen Institute.
    7. Babak Fazelabdolabadi, 2019. "A hybrid Bayesian-network proposition for forecasting the crude oil price," Financial Innovation, Springer;Southwestern University of Finance and Economics, vol. 5(1), pages 1-21, December.
    8. Li, Chunyu & Lou, Chenxin & Luo, Dan & Xing, Kai, 2021. "Chinese corporate distress prediction using LASSO: The role of earnings management," International Review of Financial Analysis, Elsevier, vol. 76(C).
    9. Anne Musson & Damien Rousselière, 2020. "Exploring the effect of crisis on cooperatives: a Bayesian performance analysis of French craftsmen cooperatives," Applied Economics, Taylor & Francis Journals, vol. 52(25), pages 2657-2678, May.
    10. Yang Lu, 2019. "Flexible (panel) regression models for bivariate count–continuous data with an insurance application," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 182(4), pages 1503-1521, October.
    11. Hengtao Zhang & Guosheng Yin, 2021. "Response‐adaptive rerandomization," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 70(5), pages 1281-1298, November.
    12. Prüser, Jan, 2017. "Forecasting US inflation using Markov dimension switching," Ruhr Economic Papers 710, RWI - Leibniz-Institut für Wirtschaftsforschung, Ruhr-University Bochum, TU Dortmund University, University of Duisburg-Essen.
    13. Armagan, Artin & Dunson, David, 2011. "Sparse variational analysis of linear mixed models for large data sets," Statistics & Probability Letters, Elsevier, vol. 81(8), pages 1056-1062, August.
    14. Andrew Gelman & Daniel Lee & Jiqiang Guo, 2015. "Stan," Journal of Educational and Behavioral Statistics, , vol. 40(5), pages 530-543, October.
    15. Wang, Hong & Forbes, Catherine S. & Fenech, Jean-Pierre & Vaz, John, 2020. "The determinants of bank loan recovery rates in good times and bad – New evidence," Journal of Economic Behavior & Organization, Elsevier, vol. 177(C), pages 875-897.
    16. Fan, Jianqing & Jiang, Bai & Sun, Qiang, 2022. "Bayesian factor-adjusted sparse regression," Journal of Econometrics, Elsevier, vol. 230(1), pages 3-19.
    17. Blackman, Allen & Guerrero, Santiago, 2012. "What drives voluntary eco-certification in Mexico?," Journal of Comparative Economics, Elsevier, vol. 40(2), pages 256-268.
    18. Zuoxian Gan & Tao Feng & Min Yang, 2018. "Exploring the Effects of Car Ownership and Commuting on Subjective Well-Being: A Nationwide Questionnaire Study," Sustainability, MDPI, vol. 11(1), pages 1-20, December.
    19. Mao, Lu, 2022. "Identification of the outcome distribution and sensitivity analysis under weak confounder–instrument interaction," Statistics & Probability Letters, Elsevier, vol. 189(C).
    20. Francesco Agostinelli & Matthew Wiswall, 2016. "Identification of Dynamic Latent Factor Models: The Implications of Re-Normalization in a Model of Child Development," NBER Working Papers 22441, National Bureau of Economic Research, Inc.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:insuma:v:71:y:2016:i:c:p:244-252. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/inca/505554 .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.