IDEAS home Printed from https://ideas.repec.org/a/taf/gnstxx/v24y2012i3p629-646.html
   My bibliography  Save this article

Empirical likelihood-based hot deck imputation methods

Author

Listed:
  • Yijie Xue
  • Nicole Lazar

Abstract

The non-response problem commonly exists in survey data and has been investigated by various methods. We propose an empirical likelihood-based hot deck imputation method, which resamples the observed data by using the weights from the empirical likelihood ratio for missing values. We demonstrate that the estimator of the mean is unbiased and the corresponding variance estimator of the mean is asymptotically unbiased under mild conditions. Next, we extend our method for U-statistic estimators and show that the estimator converges to the real U-statistic in probability. The proposed method can also incorporate multiple imputations and/or regression imputations easily. Simulations and a real example illustrate that our method outperforms some of the existing approaches, such as simple hot deck imputation and fractional hot deck imputation. We conclude with a discussion of the advantages of the empirical likelihood-based hot deck imputation method.

Suggested Citation

  • Yijie Xue & Nicole Lazar, 2012. "Empirical likelihood-based hot deck imputation methods," Journal of Nonparametric Statistics, Taylor & Francis Journals, vol. 24(3), pages 629-646.
  • Handle: RePEc:taf:gnstxx:v:24:y:2012:i:3:p:629-646
    DOI: 10.1080/10485252.2012.690879
    as

    Download full text from publisher

    File URL: http://hdl.handle.net/10.1080/10485252.2012.690879
    Download Restriction: Access to full text is restricted to subscribers.

    File URL: https://libkey.io/10.1080/10485252.2012.690879?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Jae Kwang Kim, 2011. "Parametric fractional imputation for missing data analysis," Biometrika, Biometrika Trust, vol. 98(1), pages 119-132.
    2. Qin J. & Leung D. & Shao J., 2002. "Estimation With Survey Data Under Nonignorable Nonresponse or Informative Sampling," Journal of the American Statistical Association, American Statistical Association, vol. 97, pages 193-200, March.
    3. Jae Kwang Kim, 2004. "Fractional hot deck imputation," Biometrika, Biometrika Trust, vol. 91(3), pages 559-578, September.
    4. Grendar, Marian & Judge, George G, 2009. "Maximum Empirical Likelihood: Empty Set Problem," Department of Agricultural & Resource Economics, UC Berkeley, Working Paper Series qt71v338mh, Department of Agricultural & Resource Economics, UC Berkeley.
    5. Qin, Jing & Shao, Jun & Zhang, Biao, 2008. "Efficient and Doubly Robust Imputation for Covariate-Dependent Missing Responses," Journal of the American Statistical Association, American Statistical Association, vol. 103, pages 797-810, June.
    6. Rebecca R. Andridge & Roderick J. A. Little, 2010. "A Review of Hot Deck Imputation for Survey Non‐response," International Statistical Review, International Statistical Institute, vol. 78(1), pages 40-64, April.
    7. Marc Aerts, 2002. "Local multiple imputation," Biometrika, Biometrika Trust, vol. 89(2), pages 375-388, June.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Ton de Waal & Wieger Coutinho, 2017. "Preserving Logical Relations while Estimating Missing Values," Romanian Statistical Review, Romanian Statistical Review, vol. 65(3), pages 47-59, September.
    2. Chen, Sixia & Haziza, David, 2023. "A unified framework of multiply robust estimation approaches for handling incomplete data," Computational Statistics & Data Analysis, Elsevier, vol. 179(C).
    3. Aiai Yu & Yujie Zhong & Xingdong Feng & Ying Wei, 2023. "Quantile regression for nonignorable missing data with its application of analyzing electronic medical records," Biometrics, The International Biometric Society, vol. 79(3), pages 2036-2049, September.
    4. Nicklas Pettersson, 2013. "Bias reduction of finite population imputation by kernel methods," Statistics in Transition new series, Główny Urząd Statystyczny (Polska), vol. 14(1), pages 139-160, March.
    5. Zhan Liu & Chun Yip Yau, 2022. "A propensity score adjustment method for longitudinal time series models under nonignorable nonresponse," Statistical Papers, Springer, vol. 63(1), pages 317-342, February.
    6. Xiaojun Mao & Zhonglei Wang & Shu Yang, 2023. "Matrix completion under complex survey sampling," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 75(3), pages 463-492, June.
    7. Danhyang Lee & Jae Kwang Kim, 2022. "Semiparametric imputation using conditional Gaussian mixture models under item nonresponse," Biometrics, The International Biometric Society, vol. 78(1), pages 227-237, March.
    8. Hamori, Shigeyuki & Motegi, Kaiji & Zhang, Zheng, 2019. "Calibration estimation of semiparametric copula models with data missing at random," Journal of Multivariate Analysis, Elsevier, vol. 173(C), pages 85-109.
    9. Zhong, Ping-Shou & Chen, Sixia, 2014. "Jackknife empirical likelihood inference with regression imputation and survey data," Journal of Multivariate Analysis, Elsevier, vol. 129(C), pages 193-205.
    10. Olanrewaju Akande & Gabriel Madson & D. Sunshine Hillygus & Jerome P. Reiter, 2021. "Leveraging auxiliary information on marginal distributions in nonignorable models for item and unit nonresponse," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 184(2), pages 643-662, April.
    11. Raymundo M. Campos-Vázquez, 2013. "Efectos de los ingresos no reportados en el nivel y tendencia de la pobreza laboral en México," Ensayos Revista de Economia, Universidad Autonoma de Nuevo Leon, Facultad de Economia, vol. 0(2), pages 23-54, November.
    12. Shu Yang & Jae Kwang Kim, 2016. "Likelihood-based Inference with Missing Data Under Missing-at-Random," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 43(2), pages 436-454, June.
    13. Paul T. von Hippel, 2013. "Should a Normal Imputation Model be Modified to Impute Skewed Variables?," Sociological Methods & Research, , vol. 42(1), pages 105-138, February.
    14. Siedschlag Iulia & Kaitila Ville & McQuinn John & Zhang Xiaoheng, 2014. "International Investment and Firm Performance: Empirical Evidence from Small Open Economies," Journal of Economics and Statistics (Jahrbuecher fuer Nationaloekonomie und Statistik), De Gruyter, vol. 234(6), pages 662-687, December.
    15. Miller, Elizabeth A. & Paschall, Katherine W. & Azar, Sandra T., 2017. "Latent classes of older foster youth: Prospective associations with outcomes and exits from the foster care system during the transition to adulthood," Children and Youth Services Review, Elsevier, vol. 79(C), pages 495-505.
    16. Takahiro Hoshino & Yuya Shimizu, 2019. "Doubly Robust-type Estimation of Population Moments and Parameters in Biased Sampling," Keio-IES Discussion Paper Series 2019-006, Institute for Economics Studies, Keio University.
    17. Meyer, Bruce D. & Mittag, Nikolas, 2019. "Combining Administrative and Survey Data to Improve Income Measurement," IZA Discussion Papers 12266, Institute of Labor Economics (IZA).
    18. Nancy, Jane Y. & Khanna, Nehemiah H. & Arputharaj, Kannan, 2017. "Imputing missing values in unevenly spaced clinical time series data to build an effective temporal classification framework," Computational Statistics & Data Analysis, Elsevier, vol. 112(C), pages 63-79.
    19. Zhang, Ting & Wang, Lei, 2020. "Smoothed empirical likelihood inference and variable selection for quantile regression with nonignorable missing response," Computational Statistics & Data Analysis, Elsevier, vol. 144(C).
    20. Zhang, Jing & Wang, Qihua & Kang, Jian, 2020. "Feature screening under missing indicator imputation with non-ignorable missing response," Computational Statistics & Data Analysis, Elsevier, vol. 149(C).

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:taf:gnstxx:v:24:y:2012:i:3:p:629-646. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Chris Longhurst (email available below). General contact details of provider: http://www.tandfonline.com/GNST20 .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.