IDEAS home Printed from https://ideas.repec.org/a/bla/biomet/v79y2023i3p2036-2049.html
   My bibliography  Save this article

Quantile regression for nonignorable missing data with its application of analyzing electronic medical records

Author

Listed:
  • Aiai Yu
  • Yujie Zhong
  • Xingdong Feng
  • Ying Wei

Abstract

Over the past decade, there has been growing enthusiasm for using electronic medical records (EMRs) for biomedical research. Quantile regression estimates distributional associations, providing unique insights into the intricacies and heterogeneity of the EMR data. However, the widespread nonignorable missing observations in EMR often obscure the true associations and challenge its potential for robust biomedical discoveries. We propose a novel method to estimate the covariate effects in the presence of nonignorable missing responses under quantile regression. This method imposes no parametric specifications on response distributions, which subtly uses implicit distributions induced by the corresponding quantile regression models. We show that the proposed estimator is consistent and asymptotically normal. We also provide an efficient algorithm to obtain the proposed estimate and a randomly weighted bootstrap approach for statistical inferences. Numerical studies, including an empirical analysis of real‐world EMR data, are used to assess the proposed method's finite‐sample performance compared to existing literature.

Suggested Citation

  • Aiai Yu & Yujie Zhong & Xingdong Feng & Ying Wei, 2023. "Quantile regression for nonignorable missing data with its application of analyzing electronic medical records," Biometrics, The International Biometric Society, vol. 79(3), pages 2036-2049, September.
  • Handle: RePEc:bla:biomet:v:79:y:2023:i:3:p:2036-2049
    DOI: 10.1111/biom.13723
    as

    Download full text from publisher

    File URL: https://doi.org/10.1111/biom.13723
    Download Restriction: no

    File URL: https://libkey.io/10.1111/biom.13723?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Koenker, Roger W & Bassett, Gilbert, Jr, 1978. "Regression Quantiles," Econometrica, Econometric Society, vol. 46(1), pages 33-50, January.
    2. repec:mpr:mprres:8160 is not listed on IDEAS
    3. Jiwei Zhao & Yanyuan Ma, 2018. "Optimal pseudolikelihood estimation in the analysis of multivariate missing data with nonignorable nonresponse," Biometrika, Biometrika Trust, vol. 105(2), pages 479-486.
    4. Siamak Ghasemzadeh & Mojtaba Ganjali & Taban Baghfalaki, 2018. "Bayesian quantile regression for analyzing ordinal longitudinal responses in the presence of non-ignorable missingness," METRON, Springer;Sapienza Università di Roma, vol. 76(3), pages 321-348, December.
    5. Gong, Jie & Lu, Yi & Xie, Huihua, 2020. "The average and distributional effects of teenage adversity on long-term health," Journal of Health Economics, Elsevier, vol. 71(C).
    6. Ying Yuan & Guosheng Yin, 2010. "Bayesian Quantile Regression for Longitudinal Studies with Nonignorable Missing Data," Biometrics, The International Biometric Society, vol. 66(1), pages 105-114, March.
    7. Jun Shao & Lei Wang, 2016. "Semiparametric inverse propensity weighting for nonignorable missing data," Biometrika, Biometrika Trust, vol. 103(1), pages 175-187.
    8. Qin, Jing & Shao, Jun & Zhang, Biao, 2008. "Efficient and Doubly Robust Imputation for Covariate-Dependent Missing Responses," Journal of the American Statistical Association, American Statistical Association, vol. 103, pages 797-810, June.
    9. Gong Tang, 2003. "Analysis of multivariate missing data with nonignorable nonresponse," Biometrika, Biometrika Trust, vol. 90(4), pages 747-764, December.
    10. Stuart R. Lipsitz & Lue Ping Zhao & Geert Molenberghs, 1998. "A semiparametric method of multiple imputation," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 60(1), pages 127-144.
    11. He, Xuming & Shao, Qi-Man, 2000. "On Parameters of Increasing Dimensions," Journal of Multivariate Analysis, Elsevier, vol. 73(1), pages 120-135, April.
    12. Xuerong Chen & Alan T. K. Wan & Yong Zhou, 2015. "Efficient Quantile Regression Analysis With Missing Observations," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 110(510), pages 723-741, June.
    13. Zhao, Zhibiao & Xiao, Zhijie, 2014. "Efficient Regressions Via Optimally Combining Quantile Information," Econometric Theory, Cambridge University Press, vol. 30(6), pages 1272-1314, December.
    14. Kim, Jae Kwang & Yu, Cindy Long, 2011. "A Semiparametric Estimation of Mean Functionals With Nonignorable Missing Data," Journal of the American Statistical Association, American Statistical Association, vol. 106(493), pages 157-165.
    15. Wei, Ying & Carroll, Raymond J., 2009. "Quantile Regression With Measurement Error," Journal of the American Statistical Association, American Statistical Association, vol. 104(487), pages 1129-1143.
    16. Marc Aerts, 2002. "Local multiple imputation," Biometrika, Biometrika Trust, vol. 89(2), pages 375-388, June.
    17. Zhang, Ting & Wang, Lei, 2020. "Smoothed empirical likelihood inference and variable selection for quantile regression with nonignorable missing response," Computational Statistics & Data Analysis, Elsevier, vol. 144(C).
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Zhang, Ting & Wang, Lei, 2020. "Smoothed empirical likelihood inference and variable selection for quantile regression with nonignorable missing response," Computational Statistics & Data Analysis, Elsevier, vol. 144(C).
    2. Wang, Lei & Zhao, Puying & Shao, Jun, 2021. "Dimension-reduced semiparametric estimation of distribution functions and quantiles with nonignorable nonresponse," Computational Statistics & Data Analysis, Elsevier, vol. 156(C).
    3. Shuanghua Luo & Changlin Mei & Cheng-yi Zhang, 2017. "Smoothed empirical likelihood for quantile regression models with response data missing at random," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 101(1), pages 95-116, January.
    4. Yujing Shao & Lei Wang, 2022. "Generalized partial linear models with nonignorable dropouts," Metrika: International Journal for Theoretical and Applied Statistics, Springer, vol. 85(2), pages 223-252, February.
    5. Xuerong Chen & Alan T. K. Wan & Yong Zhou, 2015. "Efficient Quantile Regression Analysis With Missing Observations," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 110(510), pages 723-741, June.
    6. Li, Mengyan & Ma, Yanyuan & Zhao, Jiwei, 2022. "Efficient estimation in a partially specified nonignorable propensity score model," Computational Statistics & Data Analysis, Elsevier, vol. 174(C).
    7. Xuerong Chen & Guoqing Diao & Jing Qin, 2020. "Pseudo likelihood‐based estimation and testing of missingness mechanism function in nonignorable missing data problems," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 47(4), pages 1377-1400, December.
    8. Lyu Ni & Jun Shao, 2023. "Estimation with multivariate outcomes having nonignorable item nonresponse," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 75(1), pages 1-15, February.
    9. Cheng, Hao, 2021. "Importance sampling imputation algorithms in quantile regression with their application in CGSS data," Mathematics and Computers in Simulation (MATCOM), Elsevier, vol. 188(C), pages 498-508.
    10. Zhang, Jing & Wang, Qihua & Kang, Jian, 2020. "Feature screening under missing indicator imputation with non-ignorable missing response," Computational Statistics & Data Analysis, Elsevier, vol. 149(C).
    11. Firpo, Sergio & Galvao, Antonio F. & Pinto, Cristine & Poirier, Alexandre & Sanroman, Graciela, 2022. "GMM quantile regression," Journal of Econometrics, Elsevier, vol. 230(2), pages 432-452.
    12. Jingxuan Guo & Fuguo Liu & Wolfgang Karl Härdle & Xueliang Zhang & Kai Wang & Ting Zeng & Liping Yang & Maozai Tian, 2023. "Sampling Importance Resampling Algorithm with Nonignorable Missing Response Variable Based on Smoothed Quantile Regression," Mathematics, MDPI, vol. 11(24), pages 1-30, December.
    13. Georges Bresson & Guy Lacroix & Mohammad Arshad Rahman, 2021. "Bayesian panel quantile regression for binary outcomes with correlated random effects: an application on crime recidivism in Canada," Empirical Economics, Springer, vol. 60(1), pages 227-259, January.
    14. Shonosuke Sugasawa & Kosuke Morikawa & Keisuke Takahata, 2022. "Bayesian semiparametric modeling of response mechanism for nonignorable missing data," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 31(1), pages 101-117, March.
    15. Pengfei Li & Jing Qin & Yukun Liu, 2023. "Instability of inverse probability weighting methods and a remedy for nonignorable missing data," Biometrics, The International Biometric Society, vol. 79(4), pages 3215-3226, December.
    16. Tianqing Liu & Xiaohui Yuan, 2020. "Doubly robust augmented-estimating-equations estimation with nonignorable nonresponse data," Statistical Papers, Springer, vol. 61(6), pages 2241-2270, December.
    17. Lei Wang & Wei Ma, 2021. "Improved empirical likelihood inference and variable selection for generalized linear models with longitudinal nonignorable dropouts," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 73(3), pages 623-647, June.
    18. Firpo, Sergio & Galvao, Antonio F. & Song, Suyong, 2017. "Measurement errors in quantile regression models," Journal of Econometrics, Elsevier, vol. 198(1), pages 146-164.
    19. Rui Duan & C. Jason Liang & Pamela Shaw & Cheng Yong Tang & Yong Chen, 2020. "Missing at Random or Not: A Semiparametric Testing Approach," Papers 2003.11181, arXiv.org.
    20. Kato, Kengo & F. Galvao, Antonio & Montes-Rojas, Gabriel V., 2012. "Asymptotics for panel quantile regression models with individual effects," Journal of Econometrics, Elsevier, vol. 170(1), pages 76-91.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bla:biomet:v:79:y:2023:i:3:p:2036-2049. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: http://www.blackwellpublishing.com/journal.asp?ref=0006-341X .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.