IDEAS home Printed from https://ideas.repec.org/a/spr/stabio/v14y2022i3d10.1007_s12561-021-09325-3.html
   My bibliography  Save this article

A Weighted Sample Framework to Incorporate External Calculators for Risk Modeling

Author

Listed:
  • Debashis Ghosh

    (Colorado School of Public Health)

  • Michael S. Sabel

    (University of Michigan Health Systems)

Abstract

Personalized risk prediction calculators abound in medicine, and they carry important information about the effect of prognostic factors on outcomes of interest. How to use that information in order to analyze local datasets is a pressing question, and several recent proposals have attempted to pool information from external calculators to local datasets using parameter sharing approaches. Here, we adopt a weighting approach using convex optimization in order to transfer information. Rather than directly modeling parameters, we instead pool information on a per-sample basis. In particular, we develop prediction-guided analyses, along with an attendant inferential strategy, for incorporating information from the external risk calculator. We also supplement this analytical approach with an exploratory technique using trees to describe what we term as ‘calculator-guided observations.’ In addition, the optimization problem itself can yield insights on the potential transferability of the external calculator to the local dataset. The methodology is illustrated by simulation studies as well as an application of risk calculators to the prediction of sentinel lymph node positivity in melanoma.

Suggested Citation

  • Debashis Ghosh & Michael S. Sabel, 2022. "A Weighted Sample Framework to Incorporate External Calculators for Risk Modeling," Statistics in Biosciences, Springer;International Chinese Statistical Association, vol. 14(3), pages 363-379, December.
  • Handle: RePEc:spr:stabio:v:14:y:2022:i:3:d:10.1007_s12561-021-09325-3
    DOI: 10.1007/s12561-021-09325-3
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s12561-021-09325-3
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s12561-021-09325-3?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Wu C. & Sitter R. R, 2001. "A Model-Calibration Approach to Using Complete Auxiliary Information From Survey Data," Journal of the American Statistical Association, American Statistical Association, vol. 96, pages 185-193, March.
    2. Kwun Chuen Gary Chan & Sheung Chi Phillip Yam & Zheng Zhang, 2016. "Globally efficient non-parametric inference of average treatment effects by empirical balancing calibration weighting," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 78(3), pages 673-700, June.
    3. Nilanjan Chatterjee & Yi-Hau Chen & Paige Maas & Raymond J. Carroll, 2016. "Constrained Maximum Likelihood Estimation for Model Calibration Using Summary-Level Information From External Big Data Sources," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 111(513), pages 107-117, March.
    4. José R. Zubizarreta, 2015. "Stable Weights that Balance Covariates for Estimation With Incomplete Outcome Data," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 110(511), pages 910-922, September.
    5. Margaret Pepe & Holly Janes & Gary Longton & Wendy Leisenring & Polly Newcomb, 2004. "Limitations of the Odds Ratio in Gauging the Performance of a Diagnostic or Prognostic Marker," UW Biostatistics Working Paper Series 1035, Berkeley Electronic Press.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Pedro H. C. Sant'Anna & Xiaojun Song & Qi Xu, 2022. "Covariate distribution balance via propensity scores," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 37(6), pages 1093-1120, September.
    2. Vahe Avagyan & Stijn Vansteelandt, 2021. "Stable inverse probability weighting estimation for longitudinal studies," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 48(3), pages 1046-1067, September.
    3. Shixiao Zhang & Peisong Han & Changbao Wu, 2023. "Calibration Techniques Encompassing Survey Sampling, Missing Data Analysis and Causal Inference," International Statistical Review, International Statistical Institute, vol. 91(2), pages 165-192, August.
    4. Eli Ben-Michael & Avi Feller & Jesse Rothstein, 2021. "The Augmented Synthetic Control Method," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 116(536), pages 1789-1803, October.
    5. Ying Sheng & Yifei Sun & Chiung‐Yu Huang & Mi‐Ok Kim, 2022. "Synthesizing external aggregated information in the presence of population heterogeneity: A penalized empirical likelihood approach," Biometrics, The International Biometric Society, vol. 78(2), pages 679-690, June.
    6. Qiu, Chen & Otsu, Taisuke, 2022. "Information theoretic approach to high dimensional multiplicative models: stochastic discount factor and treatment effect," LSE Research Online Documents on Economics 110494, London School of Economics and Political Science, LSE Library.
    7. Cousineau, Martin & Verter, Vedat & Murphy, Susan A. & Pineau, Joelle, 2023. "Estimating causal effects with optimization-based methods: A review and empirical comparison," European Journal of Operational Research, Elsevier, vol. 304(2), pages 367-380.
    8. Chen Qiu & Taisuke Otsu, 2022. "Information theoretic approach to high‐dimensional multiplicative models: Stochastic discount factor and treatment effect," Quantitative Economics, Econometric Society, vol. 13(1), pages 63-94, January.
    9. Phillip Heiler, 2020. "Efficient Covariate Balancing for the Local Average Treatment Effect," Papers 2007.04346, arXiv.org.
    10. Sean Yiu & Li Su, 2022. "Joint calibrated estimation of inverse probability of treatment and censoring weights for marginal structural models," Biometrics, The International Biometric Society, vol. 78(1), pages 115-127, March.
    11. Zhang, Xiaoke & Xue, Wu & Wang, Qiyue, 2021. "Covariate balancing functional propensity score for functional treatments in cross-sectional observational studies," Computational Statistics & Data Analysis, Elsevier, vol. 163(C).
    12. Sean Yiu & Li Su, 2018. "Covariate association eliminating weights: a unified weighting framework for causal effect estimation," Biometrika, Biometrika Trust, vol. 105(3), pages 709-722.
    13. Martin Cousineau & Vedat Verter & Susan A. Murphy & Joelle Pineau, 2022. "Estimating causal effects with optimization-based methods: A review and empirical comparison," Papers 2203.00097, arXiv.org.
    14. Taisuke Otsu & Chen Qiu, 2018. "Information theoretic approach to high dimensional multiplicative models: Stochastic discount factor and treatment effect," STICERD - Econometrics Paper Series 595, Suntory and Toyota International Centres for Economics and Related Disciplines, LSE.
    15. Mingfeng Zhan & Zongwu Cai & Ying Fang & Ming Lin, 2020. "Covariate Balance Weighting Methods in Estimating Treatment Effects: An Empirical Comparison," WORKING PAPERS SERIES IN THEORETICAL AND APPLIED ECONOMICS 202020, University of Kansas, Department of Economics, revised Dec 2020.
    16. Ruoqi Yu, 2021. "Evaluating and improving a matched comparison of antidepressants and bone density," Biometrics, The International Biometric Society, vol. 77(4), pages 1276-1288, December.
    17. Anna-Karin Ivert & Marie Torstensson Levander & Juan Merlo, 2013. "Adolescents' Utilisation of Psychiatric Care, Neighbourhoods and Neighbourhood Socioeconomic Deprivation: A Multilevel Analysis," PLOS ONE, Public Library of Science, vol. 8(11), pages 1-1, November.
    18. Chunrong Ai & Oliver Linton & Kaiji Motegi & Zheng Zhang, 2021. "A unified framework for efficient estimation of general treatment models," Quantitative Economics, Econometric Society, vol. 12(3), pages 779-816, July.
    19. Victor Chernozhukov & Whitney K. Newey & Victor Quintas-Martinez & Vasilis Syrgkanis, 2021. "Automatic Debiased Machine Learning via Riesz Regression," Papers 2104.14737, arXiv.org, revised Mar 2024.
    20. Margaret Sullivan Pepe & Tianxi Cai & Gary Longton, 2006. "Combining Predictors for Classification Using the Area under the Receiver Operating Characteristic Curve," Biometrics, The International Biometric Society, vol. 62(1), pages 221-229, March.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:stabio:v:14:y:2022:i:3:d:10.1007_s12561-021-09325-3. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.