IDEAS home Printed from https://ideas.repec.org/a/eee/csdana/v155y2021ics0167947320302024.html
   My bibliography  Save this article

A beyond multiple robust approach for missing response problem

Author

Listed:
  • Wang, Qihua
  • Su, Miaomiao
  • Wang, Ruoyu

Abstract

Imputation and the inverse probability weighting are two commonly used approaches in missing data analysis. Parametric versions of them are not robust due to model misspecification of some unknown functions. Nonparametric ones are robust but are impractical when the number of covariates is large due to the problem of “curse of dimension”. A beyond multiple robust method is proposed in this paper. This method balances the parametric and nonparametric methods by using some model information contained in the outcome regression function and the selection probability function, and hence alleviates the model misspecification problem and “curse of dimension” problem simultaneously. To illustrate the proposed method, we focus on the estimating problem of response mean in the presence of missing responses. A beyond multiple robust estimator of the response mean is defined, which is proved to be consistent and asymptotically normal as long as one of the true models for the outcome regression or selection probability functions can be some function of its assumed models, without the requirement that one of the true models is correctly specified. Also, it is shown that the asymptotic variance of the proposed estimator is equal to the semiparametric efficiency bound established by Hahn (1998, Econometrica, pp 315–331) when both the selection probability function and the outcome regression function are the functions of their assumed models, respectively. The finite sample properties of the proposed estimator are evaluated by simulation studies and the proposed method is illustrated by a real data analysis.

Suggested Citation

  • Wang, Qihua & Su, Miaomiao & Wang, Ruoyu, 2021. "A beyond multiple robust approach for missing response problem," Computational Statistics & Data Analysis, Elsevier, vol. 155(C).
  • Handle: RePEc:eee:csdana:v:155:y:2021:i:c:s0167947320302024
    DOI: 10.1016/j.csda.2020.107111
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0167947320302024
    Download Restriction: Full text for ScienceDirect subscribers only.

    File URL: https://libkey.io/10.1016/j.csda.2020.107111?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Bradley Efron, 2014. "Estimation and Accuracy After Model Selection," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 109(507), pages 991-1007, September.
    2. Tan, Zhiqiang, 2006. "A Distributional Approach for Causal Inference Using Propensity Scores," Journal of the American Statistical Association, American Statistical Association, vol. 101, pages 1619-1637, December.
    3. Peisong Han & Linglong Kong & Jiwei Zhao & Xingcai Zhou, 2019. "A general framework for quantile estimation with incomplete data," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 81(2), pages 305-333, April.
    4. Weihua Cao & Anastasios A. Tsiatis & Marie Davidian, 2009. "Improving efficiency and robustness of the doubly robust estimator for a population mean with incomplete data," Biometrika, Biometrika Trust, vol. 96(3), pages 723-734.
    5. Min Zhang & Anastasios A. Tsiatis & Marie Davidian, 2008. "Improving Efficiency of Inferences in Randomized Clinical Trials Using Auxiliary Covariates," Biometrics, The International Biometric Society, vol. 64(3), pages 707-715, September.
    6. Yanyuan Ma & Liping Zhu, 2013. "Efficiency loss and the linearity condition in dimension reduction," Biometrika, Biometrika Trust, vol. 100(2), pages 371-383.
    7. Yanyuan Ma & Liping Zhu, 2012. "A Semiparametric Approach to Dimension Reduction," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 107(497), pages 168-179, March.
    8. Jing Qin & Biao Zhang, 2007. "Empirical‐likelihood‐based inference in missing response problems and its application in observational studies," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 69(1), pages 101-122, February.
    9. Keisuke Hirano & Guido W. Imbens & Geert Ridder, 2003. "Efficient Estimation of Average Treatment Effects Using the Estimated Propensity Score," Econometrica, Econometric Society, vol. 71(4), pages 1161-1189, July.
    10. Peisong Han & Lu Wang, 2013. "Estimation with missing data: beyond double robustness," Biometrika, Biometrika Trust, vol. 100(2), pages 417-430.
    11. Qihua Wang & J. N. K. Rao, 2002. "Empirical Likelihood‐based Inference in Linear Models with Missing Data," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 29(3), pages 563-576, September.
    12. Qin, Jing & Shao, Jun & Zhang, Biao, 2008. "Efficient and Doubly Robust Imputation for Covariate-Dependent Missing Responses," Journal of the American Statistical Association, American Statistical Association, vol. 103, pages 797-810, June.
    13. Rubin Daniel B & van der Laan Mark J., 2008. "Empirical Efficiency Maximization: Improved Locally Efficient Covariate Adjustment in Randomized Experiments and Survival Analysis," The International Journal of Biostatistics, De Gruyter, vol. 4(1), pages 1-42, May.
    14. Wang Q. & Linton O. & Hardle W., 2004. "Semiparametric Regression Analysis With Missing Response at Random," Journal of the American Statistical Association, American Statistical Association, vol. 99, pages 334-345, January.
    15. Ming-Yueh Huang & Kwun Chuen Gary Chan, 2017. "Joint sufficient dimension reduction and estimation of conditional and average treatment effects," Biometrika, Biometrika Trust, vol. 104(3), pages 583-596.
    16. Wei Luo & Yeying Zhu & Debashis Ghosh, 2017. "On estimating regression-based causal effects using sufficient dimension reduction," Biometrika, Biometrika Trust, vol. 104(1), pages 51-65.
    17. Michael Healy & Michael Westmacott, 1956. "Missing Values in Experiments Analysed on Automatic Computers," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 5(3), pages 203-206, November.
    18. Qihua Wang, 2002. "Empirical likelihood-based inference in linear errors-in-covariables models with validation data," Biometrika, Biometrika Trust, vol. 89(2), pages 345-358, June.
    19. Zonghui Hu & Dean A. Follmann & Jing Qin, 2012. "Semiparametric Double Balancing Score Estimation for Incomplete Data With Ignorable Missingness," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 107(497), pages 247-257, March.
    20. Peisong Han, 2014. "Multiply Robust Estimation in Regression Analysis With Missing Data," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 109(507), pages 1159-1173, September.
    21. Jinyong Hahn, 1998. "On the Role of the Propensity Score in Efficient Semiparametric Estimation of Average Treatment Effects," Econometrica, Econometric Society, vol. 66(2), pages 315-332, March.
    22. Heejung Bang & James M. Robins, 2005. "Doubly Robust Estimation in Missing Data and Causal Inference Models," Biometrics, The International Biometric Society, vol. 61(4), pages 962-973, December.
    23. James J. Heckman & Hidehiko Ichimura & Petra Todd, 1998. "Matching As An Econometric Evaluation Estimator," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 65(2), pages 261-294.
    24. Lexin Li & Liping Zhu & Lixing Zhu, 2011. "Inference on the primary parameter of interest with the aid of dimension reduction estimation," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 73(1), pages 59-80, January.
    25. Yongjin Li & Qihua Wang & Liping Zhu & Xiaobo Ding, 2017. "Mean response estimation with missing response in the presence of high-dimensional covariates," Communications in Statistics - Theory and Methods, Taylor & Francis Journals, vol. 46(2), pages 628-643, January.
    26. Han, Peisong, 2012. "A note on improving the efficiency of inverse probability weighted estimator using the augmentation term," Statistics & Probability Letters, Elsevier, vol. 82(12), pages 2221-2228.
    27. Andrea Rotnitzky & Quanhong Lei & Mariela Sued & James M. Robins, 2012. "Improved double-robust estimation in missing data and causal inference models," Biometrika, Biometrika Trust, vol. 99(2), pages 439-456.
    28. Sixia Chen & David Haziza, 2017. "Multiply robust imputation procedures for the treatment of item nonresponse in surveys," Biometrika, Biometrika Trust, vol. 104(2), pages 439-453.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Zhongqi Liang & Qihua Wang & Yuting Wei, 2022. "Robust model selection with covariables missing at random," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 74(3), pages 539-557, June.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Shixiao Zhang & Peisong Han & Changbao Wu, 2023. "Calibration Techniques Encompassing Survey Sampling, Missing Data Analysis and Causal Inference," International Statistical Review, International Statistical Institute, vol. 91(2), pages 165-192, August.
    2. Jianxuan Liu & Yanyuan Ma & Lan Wang, 2018. "An alternative robust estimator of average treatment effect in causal inference," Biometrics, The International Biometric Society, vol. 74(3), pages 910-923, September.
    3. Peisong Han & Linglong Kong & Jiwei Zhao & Xingcai Zhou, 2019. "A general framework for quantile estimation with incomplete data," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 81(2), pages 305-333, April.
    4. Peisong Han, 2014. "Multiply Robust Estimation in Regression Analysis With Missing Data," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 109(507), pages 1159-1173, September.
    5. Xiaogang Duan & Guosheng Yin, 2017. "Ensemble Approaches to Estimating the Population Mean with Missing Response," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 44(4), pages 899-917, December.
    6. Difang Huang & Jiti Gao & Tatsushi Oka, 2022. "Semiparametric Single-Index Estimation for Average Treatment Effects," Papers 2206.08503, arXiv.org, revised Apr 2024.
    7. Hamori, Shigeyuki & Motegi, Kaiji & Zhang, Zheng, 2019. "Calibration estimation of semiparametric copula models with data missing at random," Journal of Multivariate Analysis, Elsevier, vol. 173(C), pages 85-109.
    8. Han, Peisong & Song, Peter X.-K. & Wang, Lu, 2015. "Achieving semiparametric efficiency bound in longitudinal data analysis with dropouts," Journal of Multivariate Analysis, Elsevier, vol. 135(C), pages 59-70.
    9. Guo, Xu & Fang, Yun & Zhu, Xuehu & Xu, Wangli & Zhu, Lixing, 2018. "Semiparametric double robust and efficient estimation for mean functionals with response missing at random," Computational Statistics & Data Analysis, Elsevier, vol. 128(C), pages 325-339.
    10. Y Cui & E J Tchetgen Tchetgen, 2024. "Selective machine learning of doubly robust functionals," Biometrika, Biometrika Trust, vol. 111(2), pages 517-535.
    11. Su, Miaomiao & Wang, Qihua, 2022. "A convex programming solution based debiased estimator for quantile with missing response and high-dimensional covariables," Computational Statistics & Data Analysis, Elsevier, vol. 168(C).
    12. Iván Díaz & Elizabeth Colantuoni & Daniel F. Hanley & Michael Rosenblum, 2019. "Improved precision in the analysis of randomized trials with survival outcomes, without assuming proportional hazards," Lifetime Data Analysis: An International Journal Devoted to Statistical Methods and Applications for Time-to-Event Data, Springer, vol. 25(3), pages 439-468, July.
    13. Peisong Han, 2016. "Combining Inverse Probability Weighting and Multiple Imputation to Improve Robustness of Estimation," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 43(1), pages 246-260, March.
    14. Shu Yang & Yunshu Zhang, 2023. "Multiply robust matching estimators of average and quantile treatment effects," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 50(1), pages 235-265, March.
    15. Kwun Chuen Gary Chan & Sheung Chi Phillip Yam & Zheng Zhang, 2016. "Globally efficient non-parametric inference of average treatment effects by empirical balancing calibration weighting," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 78(3), pages 673-700, June.
    16. Chen, Sixia & Haziza, David, 2018. "Jackknife empirical likelihood method for multiply robust estimation with missing data," Computational Statistics & Data Analysis, Elsevier, vol. 127(C), pages 258-268.
    17. Lu Li & Niwen Zhou & Lixing Zhu, 2022. "Outcome regression-based estimation of conditional average treatment effect," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 74(5), pages 987-1041, October.
    18. Li, Wei & Luo, Shanshan & Xu, Wangli, 2024. "Calibrated regression estimation using empirical likelihood under data fusion," Computational Statistics & Data Analysis, Elsevier, vol. 190(C).
    19. Wang, Qihua & Lai, Peng, 2011. "Empirical likelihood calibration estimation for the median treatment difference in observational studies," Computational Statistics & Data Analysis, Elsevier, vol. 55(4), pages 1596-1609, April.
    20. Wolfgang Härdle & Oliver Linton & Wang & Qihua, 2003. "Semiparametric regression analysis with missing response at random," CeMMAP working papers 11/03, Institute for Fiscal Studies.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:csdana:v:155:y:2021:i:c:s0167947320302024. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/csda .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.