IDEAS home Printed from https://ideas.repec.org/a/jss/jstsof/v045i02.html
   My bibliography  Save this article

Multiple Imputation with Diagnostics (mi) in R: Opening Windows into the Black Box

Author

Listed:
  • Su, Yu-Sung
  • Gelman, Andrew
  • Hill, Jennifer
  • Yajima, Masanao

Abstract

Our mi package in R has several features that allow the user to get inside the imputation process and evaluate the reasonableness of the resulting models and imputations. These features include: choice of predictors, models, and transformations for chained imputation models; standard and binned residual plots for checking the fit of the conditional distributions used for imputation; and plots for comparing the distributions of observed and imputed data. In addition, we use Bayesian models and weakly informative prior distributions to construct more stable estimates of imputation models. Our goal is to have a demonstration package that (a) avoids many of the practical problems that arise with existing multivariate imputation programs, and (b) demonstrates state-of-the-art diagnostics that can be applied more generally and can be incorporated into the software of others.

Suggested Citation

  • Su, Yu-Sung & Gelman, Andrew & Hill, Jennifer & Yajima, Masanao, 2011. "Multiple Imputation with Diagnostics (mi) in R: Opening Windows into the Black Box," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 45(i02).
  • Handle: RePEc:jss:jstsof:v:045:i02
    DOI: http://hdl.handle.net/10.18637/jss.v045.i02
    as

    Download full text from publisher

    File URL: https://www.jstatsoft.org/index.php/jss/article/view/v045i02/v45i02.pdf
    Download Restriction: no

    File URL: https://www.jstatsoft.org/index.php/jss/article/downloadSuppFile/v045i02/mi_0.09-13.tar.gz
    Download Restriction: no

    File URL: https://www.jstatsoft.org/index.php/jss/article/downloadSuppFile/v045i02/v45i02.R
    Download Restriction: no

    File URL: https://libkey.io/http://hdl.handle.net/10.18637/jss.v045.i02?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. repec:jss:jstsof:45:i01 is not listed on IDEAS
    2. Tendeiro, Jorge N. & Meijer, Rob R. & Niessen, A. Susan M., 2016. "PerFit: An R Package for Person-Fit Analysis in IRT," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 74(i05).
    3. Joost Ginkel & Pieter Kroonenberg, 2014. "Using Generalized Procrustes Analysis for Multiple Imputation in Principal Component Analysis," Journal of Classification, Springer;The Classification Society, vol. 31(2), pages 242-269, July.
    4. Christian Seiler, 2013. "Nonresponse in Business Tendency Surveys: Theoretical Discourse and Empirical Evidence," ifo Beiträge zur Wirtschaftsforschung, ifo Institute - Leibniz Institute for Economic Research at the University of Munich, number 52, May.
    5. Richard E Neapolitan & Xia Jiang, 2015. "Study of Integrated Heterogeneous Data Reveals Prognostic Power of Gene Expression for Breast Cancer Survival," PLOS ONE, Public Library of Science, vol. 10(2), pages 1-16, February.
    6. Cheng, Xiaoyue & Cook, Dianne & Hofmann, Heike, 2015. "Visually Exploring Missing Values in Multivariable Data Using a Graphical User Interface," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 68(i06).
    7. Labrecque, Jeremy A. & Kaufman, Jay S. & Balzer, Laura B. & Maclehose, Richard F. & Strumpf, Erin C. & Matijasevich, Alicia & Santos, Iná S. & Schmidt, Kelen H. & Barros, Aluísio J.D., 2018. "Effect of a conditional cash transfer program on length-for-age and weight-for-age in Brazilian infants at 24 months using doubly-robust, targeted estimation," Social Science & Medicine, Elsevier, vol. 211(C), pages 9-15.
    8. Robbins Michael W., 2014. "The Utility of Nonparametric Transformations for Imputation of Survey Data," Journal of Official Statistics, Sciendo, vol. 30(4), pages 675-700, December.
    9. Humera Razzak & Christian Heumann, 2019. "Hybrid Multiple Imputation In A Large Scale Complex Survey," Statistics in Transition New Series, Polish Statistical Association, vol. 20(4), pages 33-58, December.
    10. Josse, Julie & Husson, François, 2016. "missMDA: A Package for Handling Missing Values in Multivariate Data Analysis," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 70(i01).
    11. Elizabeth Duthie & Diogo Veríssimo & Aidan Keane & Andrew T Knight, 2017. "The effectiveness of celebrities in conservation marketing," PLOS ONE, Public Library of Science, vol. 12(7), pages 1-16, July.
    12. Florian Meinfelder, 2014. "Multiple Imputation: an attempt to retell the evolutionary process," AStA Wirtschafts- und Sozialstatistisches Archiv, Springer;Deutsche Statistische Gesellschaft - German Statistical Society, vol. 8(4), pages 249-267, November.
    13. Takashi Sugimoto & Tomohiro Shinozaki & Takashi Naruse & Yuki Miyamoto, 2014. "Who Was Concerned about Radiation, Food Safety, and Natural Disasters after the Great East Japan Earthquake and Fukushima Catastrophe? A Nationwide Cross-Sectional Survey in 2012," PLOS ONE, Public Library of Science, vol. 9(9), pages 1-8, September.
    14. Gerko Vink & Laurence E. Frank & Jeroen Pannekoek & Stef Buuren, 2014. "Predictive mean matching imputation of semicontinuous variables," Statistica Neerlandica, Netherlands Society for Statistics and Operations Research, vol. 68(1), pages 61-90, February.
    15. Razzak Humera & Heumann Christian, 2019. "Hybrid Multiple Imputation In A Large Scale Complex Survey," Statistics in Transition New Series, Polish Statistical Association, vol. 20(4), pages 33-58, December.
    16. Thomas R. Belin, 2017. "TRIVELLORE RAGHUNATHAN . Missing Data Analysis in Practice . Boca Raton : CRC Press," Biometrics, The International Biometric Society, vol. 73(3), pages 1059-1060, September.
    17. repec:jss:jstsof:45:i03 is not listed on IDEAS
    18. Gary K Chen & Eric C Chi & John Michael O Ranola & Kenneth Lange, 2015. "Convex Clustering: An Attractive Alternative to Hierarchical Clustering," PLOS Computational Biology, Public Library of Science, vol. 11(5), pages 1-31, May.
    19. Speidel, Matthias & Drechsler, Jörg & Jolani, Shahab, 2018. "R package hmi: a convenient tool for hierarchical multiple imputation and beyond," IAB-Discussion Paper 201816, Institut für Arbeitsmarkt- und Berufsforschung (IAB), Nürnberg [Institute for Employment Research, Nuremberg, Germany].
    20. Rashid, S. & Mitra, R. & Steele, R.J., 2015. "Using mixtures of t densities to make inferences in the presence of missing data with a small number of multiply imputed data sets," Computational Statistics & Data Analysis, Elsevier, vol. 92(C), pages 84-96.
    21. Christos T Nakas & Narayan Schütz & Marcus Werners & Alexander B Leichtle, 2016. "Accuracy and Calibration of Computational Approaches for Inpatient Mortality Predictive Modeling," PLOS ONE, Public Library of Science, vol. 11(7), pages 1-11, July.
    22. Adel Bosch & Steven F. Koch, 2021. "Individual and Household Debt: Does Imputation Choice Matter?," Working Papers 202141, University of Pretoria, Department of Economics.
    23. Oberski, Daniel, 2014. "lavaan.survey: An R Package for Complex Survey Analysis of Structural Equation Models," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 57(i01).
    24. G. Robin Gauthier & Patricia Wonch Hill & Julia McQuillan & Amy N. Spiegel & Judy Diamond, 2017. "The Potential Scientist’s Dilemma: How the Masculine Framing of Science Shapes Friendships and Science Job Aspirations," Social Sciences, MDPI, vol. 6(1), pages 1-21, February.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:jss:jstsof:v:045:i02. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Christopher F. Baum (email available below). General contact details of provider: http://www.jstatsoft.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.