IDEAS home Printed from https://ideas.repec.org/a/inm/oropre/v56y2008i6p1382-1392.html
   My bibliography  Save this article

Algorithmic Prediction of Health-Care Costs

Author

Listed:
  • Dimitris Bertsimas

    (Operations Research Center, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139)

  • Margrét V. Bjarnadóttir

    (Stanford Graduate School of Business, Stanford, California 94305)

  • Michael A. Kane

    (Medical Department, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139)

  • J. Christian Kryder

    (D2Hawkeye, Waltham, Massachusetts 02453)

  • Rudra Pandey

    (D2Hawkeye, Waltham, Massachusetts 02453)

  • Santosh Vempala

    (ARC ThinkTank, Georgia Institute of Technology, Atlanta, Georgia 30332)

  • Grant Wang

    (Electrical Engineering and Computer Science Department, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139)

Abstract

The rising cost of health care is one of the world's most important problems. Accordingly, predicting such costs with accuracy is a significant first step in addressing this problem. Since the 1980s, there has been research on the predictive modeling of medical costs based on (health insurance) claims data using heuristic rules and regression methods. These methods, however, have not been appropriately validated using populations that the methods have not seen. We utilize modern data-mining methods, specifically classification trees and clustering algorithms, along with claims data from over 800,000 insured individuals over three years, to provide rigorously validated predictions of health-care costs in the third year, based on medical and cost data from the first two years. We quantify the accuracy of our predictions using unseen (out-of-sample) data from over 200,000 members. The key findings are: (a) our data-mining methods provide accurate predictions of medical costs and represent a powerful tool for prediction of health-care costs, (b) the pattern of past cost data is a strong predictor of future costs, and (c) medical information only contributes to accurate prediction of medical costs of high-cost members.

Suggested Citation

  • Dimitris Bertsimas & Margrét V. Bjarnadóttir & Michael A. Kane & J. Christian Kryder & Rudra Pandey & Santosh Vempala & Grant Wang, 2008. "Algorithmic Prediction of Health-Care Costs," Operations Research, INFORMS, vol. 56(6), pages 1382-1392, December.
  • Handle: RePEc:inm:oropre:v:56:y:2008:i:6:p:1382-1392
    DOI: 10.1287/opre.1080.0619
    as

    Download full text from publisher

    File URL: http://dx.doi.org/10.1287/opre.1080.0619
    Download Restriction: no

    File URL: https://libkey.io/10.1287/opre.1080.0619?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Van de ven, Wynand P.M.M. & Ellis, Randall P., 2000. "Risk adjustment in competitive health plan markets," Handbook of Health Economics, in: A. J. Culyer & J. P. Newhouse (ed.), Handbook of Health Economics, edition 1, volume 1, chapter 14, pages 755-845, Elsevier.
    2. Kim H. & Loh W.Y., 2001. "Classification Trees With Unbiased Multiway Splits," Journal of the American Statistical Association, American Statistical Association, vol. 96, pages 589-604, June.
    3. Manning, Willard G. & Mullahy, John, 2001. "Estimating log models: to transform or not to transform?," Journal of Health Economics, Elsevier, vol. 20(4), pages 461-494, July.
    4. A. J. Culyer & J. P. Newhouse (ed.), 2000. "Handbook of Health Economics," Handbook of Health Economics, Elsevier, edition 1, volume 1, number 1.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Sriubaite, I. & Harris, A. & Jones, A.M. & Gabbe, B., 2020. "Economic Consequences of Road Traffic Injuries. Application of the Super Learner algorithm," Health, Econometrics and Data Group (HEDG) Working Papers 20/20, HEDG, c/o Department of Economics, University of York.
    2. Florian Buchner & Jürgen Wasem & Sonja Schillo, 2017. "Regression Trees Identify Relevant Interactions: Can This Improve the Predictive Performance of Risk Adjustment?," Health Economics, John Wiley & Sons, Ltd., vol. 26(1), pages 74-85, January.
    3. Yeongah Choi & Jiho An & Seiyoung Ryu & Jaekyeong Kim, 2022. "Development and Evaluation of Machine Learning-Based High-Cost Prediction Model Using Health Check-Up Data by the National Health Insurance Service of Korea," IJERPH, MDPI, vol. 19(20), pages 1-16, October.
    4. Sungchul Park & Anirban Basu, 2018. "Alternative evaluation metrics for risk adjustment methods," Health Economics, John Wiley & Sons, Ltd., vol. 27(6), pages 984-1010, June.
    5. Adam Maidman & Lan Wang, 2018. "New semiparametric method for predicting high‐cost patients," Biometrics, The International Biometric Society, vol. 74(3), pages 1104-1111, September.
    6. Lida Anna Apergi & Margrét Vilborg Bjarnadóttir & John S. Baras & Bruce L. Golden, 2024. "Cost Patterns of Multiple Chronic Conditions: A Novel Modeling Approach Using a Condition Hierarchy," INFORMS Joural on Data Science, INFORMS, vol. 3(1), pages 49-67, April.
    7. D. Cattel & R. C. Kleef & R. C. J. A. Vliet, 2017. "A method to simulate incentives for cost containment under various cost sharing designs: an application to a first-euro deductible and a doughnut hole," The European Journal of Health Economics, Springer;Deutsche Gesellschaft für Gesundheitsökonomie (DGGÖ), vol. 18(8), pages 987-1000, November.
    8. Lennon, Conor, 2021. "Are the costs of employer-sponsored health insurance passed on to workers at the individual level?," Economics & Human Biology, Elsevier, vol. 41(C).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Jones, A. & Lomas, J. & Rice, N., 2014. "Going Beyond the Mean in Healthcare Cost Regressions: a Comparison of Methods for Estimating the Full Conditional Distribution," Health, Econometrics and Data Group (HEDG) Working Papers 14/26, HEDG, c/o Department of Economics, University of York.
    2. R. Halbersma & M. Mikkers & E. Motchenkova & I. Seinen, 2011. "Market structure and hospital–insurer bargaining in the Netherlands," The European Journal of Health Economics, Springer;Deutsche Gesellschaft für Gesundheitsökonomie (DGGÖ), vol. 12(6), pages 589-603, December.
    3. Pilny, Adam & Wübker, Ansgar & Ziebarth, Nicolas R., 2017. "Introducing risk adjustment and free health plan choice in employer-based health insurance: Evidence from Germany," Journal of Health Economics, Elsevier, vol. 56(C), pages 330-351.
    4. Jacob Glazer & Thomas G. McGuire & Julie Shi, 2016. "Risk Adjustment of Health Plan Payments to Correct Inefficient Plan Choice from Adverse Selection," NBER Chapters, in: Measuring and Modeling Health Care Costs, pages 379-418, National Bureau of Economic Research, Inc.
    5. Eijkenaar, Frank & van Vliet, René C.J.A., 2017. "Improving risk equalization for individuals with persistently high costs: Experiences from the Netherlands," Health Policy, Elsevier, vol. 121(11), pages 1169-1176.
    6. Hao Yu, 2017. "China’s medical savings accounts: an analysis of the price elasticity of demand for health care," The European Journal of Health Economics, Springer;Deutsche Gesellschaft für Gesundheitsökonomie (DGGÖ), vol. 18(6), pages 773-785, July.
    7. Cantoni, Eva & Ronchetti, Elvezio, 2006. "A robust approach for skewed and heavy-tailed outcomes in the analysis of health care expenditures," Journal of Health Economics, Elsevier, vol. 25(2), pages 198-213, March.
    8. Lehmann, Hansjorg & Zweifel, Peter, 2004. "Innovation and risk selection in deregulated social health insurance," Journal of Health Economics, Elsevier, vol. 23(5), pages 997-1012, September.
    9. Simón Ramírez Amaya & Adolfo J. Quiroz & Álvaro José Riascos Villegas, 2019. "Regression by clustering using metropolis-hastings," Documentos de Trabajo 18180, Quantil.
    10. Keane, Michael, 2004. "Modeling Health Insurance Choice Using the Heterogeneous Logit Model," MPRA Paper 55203, University Library of Munich, Germany.
    11. Karen Eggleston & Randall P. Ellis & Mingshan Lu, 2007. "Prevention and Dynamic Risk Adjustment," Boston University - Department of Economics - Working Papers Series WP2007-023, Boston University - Department of Economics.
    12. repec:clg:wpaper:2007-06 is not listed on IDEAS
    13. Karen Eggleston & Brian K. Chen & Chih-Hung Chen & Ying Isabel Chen & Talitha Feenstra & Toshiaki Iizuka & Janet Tin Kei Lam & Gabriel M. Leung & Jui-fen Rachel Lu & Beatriz Rodriguez-Sanchez & Jeroen, 2020. "Are quality-adjusted medical prices declining for chronic disease? Evidence from diabetes care in four health systems," The European Journal of Health Economics, Springer;Deutsche Gesellschaft für Gesundheitsökonomie (DGGÖ), vol. 21(5), pages 689-702, July.
    14. Grossman, Michael, 2004. "The demand for health, 30 years later: a very personal retrospective and prospective reflection," Journal of Health Economics, Elsevier, vol. 23(4), pages 629-636, July.
    15. Peter Zweifel, 2013. "The present state of health economics: a critique and an agenda for the future," The European Journal of Health Economics, Springer;Deutsche Gesellschaft für Gesundheitsökonomie (DGGÖ), vol. 14(4), pages 569-571, August.
    16. Maurus Rischatsch, 2014. "Lead me not into temptation: drug price regulation and dispensing physicians in Switzerland," The European Journal of Health Economics, Springer;Deutsche Gesellschaft für Gesundheitsökonomie (DGGÖ), vol. 15(7), pages 697-708, September.
    17. Yi Yao & Joan Schmit & Julie Shi, 2019. "Promoting sustainability for micro health insurance: a risk-adjusted subsidy approach for maternal healthcare service," The Geneva Papers on Risk and Insurance - Issues and Practice, Palgrave Macmillan;The Geneva Association, vol. 44(3), pages 382-409, July.
    18. S. Veen & R. Kleef & W. Ven & R. Vliet, 2015. "Improving the prediction model used in risk equalization: cost and diagnostic information from multiple prior years," The European Journal of Health Economics, Springer;Deutsche Gesellschaft für Gesundheitsökonomie (DGGÖ), vol. 16(2), pages 201-218, March.
    19. Thomas Bjørner & Søren Arnberg, 2012. "Terminal costs, improved life expectancy and future public health expenditure," International Journal of Health Economics and Management, Springer, vol. 12(2), pages 129-143, June.
    20. Manos Matsaganis & Theodore Mitrakos & Panos Tsakloglou, 2009. "Modelling health expenditure at the household level in Greece," The European Journal of Health Economics, Springer;Deutsche Gesellschaft für Gesundheitsökonomie (DGGÖ), vol. 10(3), pages 329-336, July.
    21. Devlin, Rose Anne & Sarma, Sisira, 2008. "Do physician remuneration schemes matter? The case of Canadian family physicians," Journal of Health Economics, Elsevier, vol. 27(5), pages 1168-1181, September.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:inm:oropre:v:56:y:2008:i:6:p:1382-1392. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Chris Asher (email available below). General contact details of provider: https://edirc.repec.org/data/inforea.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.