IDEAS home Printed from https://ideas.repec.org/a/sae/medema/v34y2014i4p513-522.html
   My bibliography  Save this article

Sensitivity and Specificity Can Change in Opposite Directions When New Predictive Markers Are Added to Risk Models

Author

Listed:
  • Ben Van Calster
  • Ewout W. Steyerberg
  • Ralph B. D’Agostino Sr
  • Michael J. Pencina

Abstract

When comparing prediction models, it is essential to estimate the magnitude of change in performance rather than rely solely on statistical significance. In this paper we investigate measures that estimate change in classification performance, assuming 2-group classification based on a single risk threshold. We study the value of a new biomarker when added to a baseline risk prediction model. First, simulated data are used to investigate the change in sensitivity and specificity (ΔSe and ΔSp). Second, the influence of ΔSe and ΔSp on the net reclassification improvement (NRI; sum of ΔSe and ΔSp) and on decision-analytic measures (net benefit or relative utility) is studied. We assume normal distributions for the predictors and assume correctly specified models such that the extended model has a dominating receiver operating characteristic curve relative to the baseline model. Remarkably, we observe that even when a strong marker is added it is possible that either sensitivity (for thresholds below the event rate) or specificity (for thresholds above the event rate) decreases. In these cases, decision-analytic measures provide more modest support for improved classification than NRI, even though all measures confirm that adding the marker improved classification accuracy. Our results underscore the necessity of reporting ΔSe and ΔSp separately. When a single summary is desired, decision-analytic measures allow for a simple incorporation of the misclassification costs.

Suggested Citation

  • Ben Van Calster & Ewout W. Steyerberg & Ralph B. D’Agostino Sr & Michael J. Pencina, 2014. "Sensitivity and Specificity Can Change in Opposite Directions When New Predictive Markers Are Added to Risk Models," Medical Decision Making, , vol. 34(4), pages 513-522, May.
  • Handle: RePEc:sae:medema:v:34:y:2014:i:4:p:513-522
    DOI: 10.1177/0272989X13513654
    as

    Download full text from publisher

    File URL: https://journals.sagepub.com/doi/10.1177/0272989X13513654
    Download Restriction: no

    File URL: https://libkey.io/10.1177/0272989X13513654?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Stuart G. Baker & Nancy R. Cook & Andrew Vickers & Barnett S. Kramer, 2009. "Using relative utility curves to evaluate risk prediction," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 172(4), pages 729-748, October.
    2. Walter Bouwmeester & Nicolaas P A Zuithoff & Susan Mallett & Mirjam I Geerlings & Yvonne Vergouwe & Ewout W Steyerberg & Douglas G Altman & Karel G M Moons, 2012. "Reporting and Methods in Clinical Prediction Research: A Systematic Review," PLOS Medicine, Public Library of Science, vol. 9(5), pages 1-13, May.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Ben Van Calster & Andrew J. Vickers, 2015. "Calibration of Risk Prediction Models," Medical Decision Making, , vol. 35(2), pages 162-169, February.
    2. Thomas P A Debray & Karel G M Moons & Ghada Mohammed Abdallah Abo-Zaid & Hendrik Koffijberg & Richard David Riley, 2013. "Individual Participant Data Meta-Analysis for a Binary Outcome: One-Stage or Two-Stage?," PLOS ONE, Public Library of Science, vol. 8(4), pages 1-10, April.
    3. Paul Bach & Christine Wallisch & Nadja Klein & Lorena Hafermann & Willi Sauerbrei & Ewout W Steyerberg & Georg Heinze & Geraldine Rauch & for topic group 2 of the STRATOS initiative, 2020. "Systematic review of education and practical guidance on regression modeling for medical researchers who lack a strong statistical background: Study protocol," PLOS ONE, Public Library of Science, vol. 15(12), pages 1-10, December.
    4. Tracey L. Marsh & Holly Janes & Margaret S. Pepe, 2020. "Statistical inference for net benefit measures in biomarker validation studies," Biometrics, The International Biometric Society, vol. 76(3), pages 843-852, September.
    5. Shi, Chengchun & Lu, Wenbin & Song, Rui, 2019. "A sparse random projection-based test for overall qualitative treatment effects," LSE Research Online Documents on Economics 102107, London School of Economics and Political Science, LSE Library.
    6. Michael Lebenbaum & Osvaldo Espin-Garcia & Yi Li & Laura C Rosella, 2018. "Development and validation of a population based risk algorithm for obesity: The Obesity Population Risk Tool (OPoRT)," PLOS ONE, Public Library of Science, vol. 13(1), pages 1-11, January.
    7. Mei-Cheng Wang & Shanshan Li, 2012. "Bivariate Marker Measurements and ROC Analysis," Biometrics, The International Biometric Society, vol. 68(4), pages 1207-1218, December.
    8. Holly Janes & Margaret S. Pepe & Ying Huang, 2014. "A Framework for Evaluating Markers Used to Select Patient Treatment," Medical Decision Making, , vol. 34(2), pages 159-167, February.
    9. Baker Stuart G. & Van Calster Ben & Steyerberg Ewout W., 2012. "Evaluating a New Marker for Risk Prediction Using the Test Tradeoff: An Update," The International Journal of Biostatistics, De Gruyter, vol. 8(1), pages 1-37, March.
    10. Taro Takeshima & Yosuke Yamamoto & Yoshinori Noguchi & Nobuyuki Maki & Koichiro Gibo & Yukio Tsugihashi & Asako Doi & Shingo Fukuma & Shin Yamazaki & Eiji Kajii & Shunichi Fukuhara, 2016. "Identifying Patients with Bacteremia in Community-Hospital Emergency Rooms: A Retrospective Cohort Study," PLOS ONE, Public Library of Science, vol. 11(3), pages 1-17, March.
    11. Ying Huang & Eric Laber, 2016. "Personalized Evaluation of Biomarker Value: A Cost-Benefit Perspective," Statistics in Biosciences, Springer;International Chinese Statistical Association, vol. 8(1), pages 43-65, June.
    12. Ben Van Calster & Andrew J. Vickers & Michael J. Pencina & Stuart G. Baker & Dirk Timmerman & Ewout W. Steyerberg, 2013. "Evaluation of Markers and Risk Prediction Models," Medical Decision Making, , vol. 33(4), pages 490-501, May.
    13. Helder Novais Bastos & Nuno S Osório & António Gil Castro & Angélica Ramos & Teresa Carvalho & Leonor Meira & David Araújo & Leonor Almeida & Rita Boaventura & Patrícia Fragata & Catarina Chaves & Pat, 2016. "A Prediction Rule to Stratify Mortality Risk of Patients with Pulmonary Tuberculosis," PLOS ONE, Public Library of Science, vol. 11(9), pages 1-14, September.
    14. Hormuzd A. Katki & Ionut Bebu, 2021. "A simple framework to identify optimal cost‐effective risk thresholds for a single screen: Comparison to Decision Curve Analysis," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 184(3), pages 887-903, July.
    15. Stuart G. Baker, 2024. "Evaluating Risk Prediction with Data Collection Costs: Novel Estimation of Test Tradeoff Curves," Medical Decision Making, , vol. 44(1), pages 53-63, January.
    16. Igor O Korolev & Laura L Symonds & Andrea C Bozoki & Alzheimer's Disease Neuroimaging Initiative, 2016. "Predicting Progression from Mild Cognitive Impairment to Alzheimer's Dementia Using Clinical, MRI, and Plasma Biomarkers via Probabilistic Pattern Classification," PLOS ONE, Public Library of Science, vol. 11(2), pages 1-25, February.
    17. Marta Morales-Puerto & María Ruiz-Díaz & Marta Aranda-Gallardo & José Miguel Morales-Asencio & Purificación Alcalá-Gutiérrez & José Antonio Rodríguez-Montalvo & Álvaro León-Campos & Silvia García-Mayo, 2022. "Development of a Clinical Prediction Rule for Adverse Events in Multimorbid Patients in Emergency and Hospitalisation," IJERPH, MDPI, vol. 19(14), pages 1-14, July.
    18. John H Wasson & Lynn Ho & Laura Soloway & L Gordon Moore, 2018. "Validation of the What Matters Index: A brief, patient-reported index that guides care for chronic conditions and can substitute for computer-generated risk models," PLOS ONE, Public Library of Science, vol. 13(2), pages 1-13, February.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:sae:medema:v:34:y:2014:i:4:p:513-522. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: SAGE Publications (email available below). General contact details of provider: .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.