IDEAS home Printed from https://ideas.repec.org/a/gam/jstats/v7y2024i3p56-943d1468558.html
   My bibliography  Save this article

Doubly Robust Estimation and Semiparametric Efficiency in Generalized Partially Linear Models with Missing Outcomes

Author

Listed:
  • Lu Wang

    (Department of Biostatistics, University of Michigan, Ann Arbor, MI 48109, USA)

  • Zhongzhe Ouyang

    (Department of Biostatistics, University of Michigan, Ann Arbor, MI 48109, USA)

  • Xihong Lin

    (Department of Biostatistics, Harvard School of Public Health, Boston, MA 02115, USA)

Abstract

We investigate a semiparametric generalized partially linear regression model that accommodates missing outcomes, with some covariates modeled parametrically and others nonparametrically. We propose a class of augmented inverse probability weighted (AIPW) kernel–profile estimating equations. The nonparametric component is estimated using AIPW kernel estimating equations, while parametric regression coefficients are estimated using AIPW profile estimating equations. We demonstrate the doubly robust nature of the AIPW estimators for both nonparametric and parametric components. Specifically, these estimators remain consistent if either the assumed model for the probability of missing data or that for the conditional mean of the outcome, given covariates and auxiliary variables, is correctly specified, though not necessarily both simultaneously. Additionally, the AIPW profile estimator for parametric regression coefficients is consistent and asymptotically normal under the semiparametric model defined by the generalized partially linear model on complete data, assuming that the missing data mechanism is missing at random. When both working models are correctly specified, this estimator achieves semiparametric efficiency, with its asymptotic variance reaching the efficiency bound. We validate our approach through simulations to assess the finite sample performance of the proposed estimators and apply the method to a study that investigates risk factors associated with myocardial ischemia.

Suggested Citation

  • Lu Wang & Zhongzhe Ouyang & Xihong Lin, 2024. "Doubly Robust Estimation and Semiparametric Efficiency in Generalized Partially Linear Models with Missing Outcomes," Stats, MDPI, vol. 7(3), pages 1-20, August.
  • Handle: RePEc:gam:jstats:v:7:y:2024:i:3:p:56-943:d:1468558
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2571-905X/7/3/56/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2571-905X/7/3/56/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Tao Hu & Hengjian Cui, 2010. "Robust estimates in generalised varying-coefficient partially linear models," Journal of Nonparametric Statistics, Taylor & Francis Journals, vol. 22(6), pages 737-754.
    2. Song Chen & Ingrid Van Keilegom, 2013. "Estimation in semiparametric models with missing data," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 65(4), pages 785-805, August.
    3. Wang Q. & Linton O. & Hardle W., 2004. "Semiparametric Regression Analysis With Missing Response at Random," Journal of the American Statistical Association, American Statistical Association, vol. 99, pages 334-345, January.
    4. Naisyin Wang & Raymond J. Carroll & Xihong Lin, 2005. "Efficient Semiparametric Marginal Estimation for Longitudinal/Clustered Data," Journal of the American Statistical Association, American Statistical Association, vol. 100, pages 147-157, March.
    5. Newey, Whitney K, 1990. "Semiparametric Efficiency Bounds," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 5(2), pages 99-135, April-Jun.
    6. Qi-Hua Wang, 2009. "Statistical estimation in partial linear models with covariate data missing at random," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 61(1), pages 47-84, March.
    7. Heejung Bang & James M. Robins, 2005. "Doubly Robust Estimation in Missing Data and Causal Inference Models," Biometrics, The International Biometric Society, vol. 61(4), pages 962-973, December.
    8. Rotnitzky, Andrea & Holcroft, Christina A. & Robins, James M., 1997. "Efficiency Comparisons in Multivariate Multiple Regression with Missing Outcomes," Journal of Multivariate Analysis, Elsevier, vol. 61(1), pages 102-128, April.
    9. Hua Liang & Suojin Wang & Raymond J. Carroll, 2007. "Partially linear models with missing response variables and error-prone covariates," Biometrika, Biometrika Trust, vol. 94(1), pages 185-198.
    10. Jafer Rahman & Shihua Luo & Yawen Fan & Xiaohui Liu, 2020. "Semiparametric efficient inferences for generalised partially linear models," Journal of Nonparametric Statistics, Taylor & Francis Journals, vol. 32(3), pages 704-724, July.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. M. Hristache & V. Patilea, 2017. "Conditional moment models with data missing at random," Biometrika, Biometrika Trust, vol. 104(3), pages 735-742.
    2. Bryan S. Graham & Cristine Campos De Xavier Pinto & Daniel Egel, 2012. "Inverse Probability Tilting for Moment Condition Models with Missing Data," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 79(3), pages 1053-1079.
    3. Yu Shen & Han-Ying Liang, 2018. "Quantile regression and its empirical likelihood with missing response at random," Statistical Papers, Springer, vol. 59(2), pages 685-707, June.
    4. Sant’Anna, Pedro H.C. & Zhao, Jun, 2020. "Doubly robust difference-in-differences estimators," Journal of Econometrics, Elsevier, vol. 219(1), pages 101-122.
    5. Majid Mojirsheibani & Timothy Reese, 2017. "Kernel regression estimation for incomplete data with applications," Statistical Papers, Springer, vol. 58(1), pages 185-209, March.
    6. Chen, Songxi, 2012. "Estimation in semiparametric models with missing data," MPRA Paper 46216, University Library of Munich, Germany.
    7. Wangli Xu & Xu Guo & Lixing Zhu, 2012. "Goodness-of-fitting for partial linear model with missing response at random," Journal of Nonparametric Statistics, Taylor & Francis Journals, vol. 24(1), pages 103-118.
    8. Nengxiang Ling & Rui Kan & Philippe Vieu & Shuyu Meng, 2019. "Semi-functional partially linear regression model with responses missing at random," Metrika: International Journal for Theoretical and Applied Statistics, Springer, vol. 82(1), pages 39-70, January.
    9. Firpo, Sergio Pinheiro & Pinto, Rafael de Carvalho Cayres, 2012. "Combining Strategies for the Estimation of Treatment Effects," Brazilian Review of Econometrics, Sociedade Brasileira de Econometria - SBE, vol. 32(1), March.
    10. Xue, Liugen & Xue, Dong, 2011. "Empirical likelihood for semiparametric regression model with missing response data," Journal of Multivariate Analysis, Elsevier, vol. 102(4), pages 723-740, April.
    11. Graham, Bryan S. & Pinto, Cristine Campos de Xavier, 2022. "Semiparametrically efficient estimation of the average linear regression function," Journal of Econometrics, Elsevier, vol. 226(1), pages 115-138.
    12. Song Chen & Ingrid Van Keilegom, 2013. "Estimation in semiparametric models with missing data," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 65(4), pages 785-805, August.
    13. Wang, Qihua & Su, Miaomiao & Wang, Ruoyu, 2021. "A beyond multiple robust approach for missing response problem," Computational Statistics & Data Analysis, Elsevier, vol. 155(C).
    14. Nengxiang Ling & Lilei Cheng & Philippe Vieu & Hui Ding, 2022. "Missing responses at random in functional single index model for time series data," Statistical Papers, Springer, vol. 63(2), pages 665-692, April.
    15. Wangli Xu & Xu Guo, 2013. "Checking the adequacy of partial linear models with missing covariates at random," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 65(3), pages 473-490, June.
    16. Ash Abebe & Huybrechts F. Bindele & Masego Otlaadisa & Boikanyo Makubate, 2021. "Robust estimation of single index models with responses missing at random," Statistical Papers, Springer, vol. 62(5), pages 2195-2225, October.
    17. Stephens Alisa & Tchetgen Tchetgen Eric & De Gruttola Victor, 2014. "Locally Efficient Estimation of Marginal Treatment Effects When Outcomes Are Correlated: Is the Prize Worth the Chase?," The International Journal of Biostatistics, De Gruyter, vol. 10(1), pages 59-75, May.
    18. Inkmann, J., 2005. "Inverse Probability Weighted Generalised Empirical Likelihood Estimators : Firm Size and R&D Revisited," Other publications TiSEM c39cff1f-16c1-4446-a83f-c, Tilburg University, School of Economics and Management.
    19. Davide Viviano & Jelena Bradic, 2020. "Fair Policy Targeting," Papers 2005.12395, arXiv.org, revised Jun 2022.
    20. Cattaneo, Matias D., 2010. "Efficient semiparametric estimation of multi-valued treatment effects under ignorability," Journal of Econometrics, Elsevier, vol. 155(2), pages 138-154, April.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jstats:v:7:y:2024:i:3:p:56-943:d:1468558. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.