IDEAS home Printed from https://ideas.repec.org/a/tsj/stataj/v11y2011i3p403-419.html
   My bibliography  Save this article

A closer examination of three small-sample approximations to the multiple-imputation degrees of freedom

Author

Listed:
  • David A. Wagstaff

    (Pennsylvania State University)

  • Ofer Harel

    (University of Connecticut)

Abstract

Incomplete data is a common complication in applied research. In this study, we use simulation to compare two approaches to the multiple imputation of a continuous predictor: multiple imputation through chained equations and multivariate normal imputation. This study extends earlier work by being the first to 1) compare the small-sample approximations to the multiple-imputation degrees of freedom proposed by Barnard and Rubin (1999, Biometrika 86: 948– 955); Lipsitz, Parzen, and Zhao (2002, Journal of Statistical Computation and Simulation 72: 309–318); and Reiter (2007, Biometrika 94: 502–508) and 2) ask if the sampling distribution of the t statistics is in fact a Student’s t distribution with the specified degrees of freedom. In addition to varying the imputation method, we varied the number of imputa- tions (m = 5,10,20,100) that were averaged over 500,000 replications to obtain the combined estimates and standard errors for a linear model that regressed the log price of a home on its age (years) and size (square feet) in a sample of 25 observations. Six age values were randomly set equal to missing for each replication. As assessed by the absolute percentage and relative percentage bias, the two approaches performed similarly. The absolute bias of the regression coefficients for age and size was roughly −0.1% across the levels of m for both approaches; the ab- solute bias for the constant was 0.6% for the chained-equations approach and 1.0% for the multivariate normal model. The absolute biases of the standard errors for age, size, and the constant were 0.2%, 0.3%, and 1.2%, respectively. In general, the relative percentage bias was slightly smaller for the chained-equations approach. Graphical and numerical inspection of the empirical sampling distributions for the three t statistics suggested that the area from the shoulder to the tail was reasonably well approximated by a t distribution and that the small-sample ap- proximations to the multiple-imputation degrees of freedom proposed by Barnard and Rubin and by Reiter performed satisfactorily. Copyright 2011 by StataCorp LP.

Suggested Citation

  • David A. Wagstaff & Ofer Harel, 2011. "A closer examination of three small-sample approximations to the multiple-imputation degrees of freedom," Stata Journal, StataCorp LP, vol. 11(3), pages 403-419, September.
  • Handle: RePEc:tsj:stataj:v:11:y:2011:i:3:p:403-419
    Note: to access software from within Stata, net describe http://www.stata-journal.com/software/sj11-3/st0235/
    as

    Download full text from publisher

    File URL: http://www.stata-journal.com/article.html?article=st0235
    File Function: link to article purchase
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Paul D. Allison, 2000. "Multiple Imputation for Missing Data," Sociological Methods & Research, , vol. 28(3), pages 301-309, February.
    2. Jerome P. Reiter, 2007. "Small-sample degrees of freedom for multi-component significance tests with multiple imputation for missing data," Biometrika, Biometrika Trust, vol. 94(2), pages 502-508.
    3. Patrick Royston & John B. Carlin & Ian R. White, 2009. "Multiple imputation of missing values: New features for mim," Stata Journal, StataCorp LP, vol. 9(2), pages 252-264, June.
    4. Patrick Royston, 2004. "Multiple imputation of missing values," Stata Journal, StataCorp LP, vol. 4(3), pages 227-241, September.
    5. Koehler, Elizabeth & Brown, Elizabeth & Haneuse, Sebastien J.-P. A., 2009. "On the Assessment of Monte Carlo Error in Simulation-Based Statistical Analyses," The American Statistician, American Statistical Association, vol. 63(2), pages 155-162.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Llano, Carlos & Pardo, Juan & Pérez-Balsalobre, Santiago & Pérez, Julián, 2023. "Estimating multicountry tourism flows by transport mode," Annals of Tourism Research, Elsevier, vol. 103(C).
    2. Ruochen Zha & Ofer Harel, 2021. "Power calculation in multiply imputed data," Statistical Papers, Springer, vol. 62(1), pages 533-559, February.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Yulia V. Marchenko & Jerome P. Reiter, 2009. "Improved degrees of freedom for multivariate significance tests obtained from multiply imputed, small-sample data," Stata Journal, StataCorp LP, vol. 9(3), pages 388-397, September.
    2. Patrick Royston, 2009. "Multiple imputation of missing values: Further update of ice, with an emphasis on categorical variables," Stata Journal, StataCorp LP, vol. 9(3), pages 466-477, September.
    3. Elke Lüdemann, 2011. "Schooling and the Formation of Cognitive and Non-cognitive Outcomes," ifo Beiträge zur Wirtschaftsforschung, ifo Institute - Leibniz Institute for Economic Research at the University of Munich, number 39.
    4. Paul Watts & Gemma Phillips & Mark Petticrew & Richard Hayes & Christian Bottomley & Ge Yu & Elena Schmidt & Patrick Tobi & Derek Moore & Caroline Frostick & Karen Lock & Adrian Renton, 2013. "Physical Activity in Deprived Communities in London: Examining Individual and Neighbourhood-Level Factors," PLOS ONE, Public Library of Science, vol. 8(7), pages 1-8, July.
    5. Andrew F Auld & Francisco Mbofana & Ray W Shiraishi & Charity Alfredo & Mauro Sanchez & Tedd V Ellerbrock & Lisa J Nelson, 2013. "Incidence and Determinants of Tuberculosis among Adults Initiating Antiretroviral Therapy – Mozambique, 2004–2008," PLOS ONE, Public Library of Science, vol. 8(1), pages 1-11, January.
    6. Peter A. Lachenbruch, 2010. "Stata tip 89: Estimating means and percentiles following multiple imputation," Stata Journal, StataCorp LP, vol. 10(3), pages 496-499, September.
    7. Ian R. White, 2010. "simsum: Analyses of simulation studies including Monte Carlo error," Stata Journal, StataCorp LP, vol. 10(3), pages 369-385, September.
    8. White, Ian R. & Daniel, Rhian & Royston, Patrick, 2010. "Avoiding bias due to perfect prediction in multiple imputation of incomplete categorical variables," Computational Statistics & Data Analysis, Elsevier, vol. 54(10), pages 2267-2275, October.
    9. Harald Schoen, 2008. "Identity, Instrumental Self-Interest and Institutional Evaluations," European Union Politics, , vol. 9(1), pages 5-29, March.
    10. Julia S. Goldberg, 2011. "Identity Salience and Involvement among Resident and Nonresident Fathers," Working Papers 1323, Princeton University, School of Public and International Affairs, Center for Research on Child Wellbeing..
    11. Hilde Coffe & Catherine Bolzendahl, 2011. "Gender Gaps in Political Participation Across Sub-Saharan African Nations," Social Indicators Research: An International and Interdisciplinary Journal for Quality-of-Life Measurement, Springer, vol. 102(2), pages 245-264, June.
    12. Ronald Mincy & Jennifer Hill & Marilyn Sinkewicz, 2009. "Marriage: Cause or mere indicator of future earnings growth?," Journal of Policy Analysis and Management, John Wiley & Sons, Ltd., vol. 28(3), pages 417-439.
    13. John Bound & Michael F. Lovenheim & Sarah Turner, 2010. "Why Have College Completion Rates Declined? An Analysis of Changing Student Preparation and Collegiate Resources," American Economic Journal: Applied Economics, American Economic Association, vol. 2(3), pages 129-157, July.
    14. repec:mpr:mprres:7638 is not listed on IDEAS
    15. von Hinke Kessler Scholder S, 2009. "Genetic Markers as Instrumental Variables: An Application to Child Fat Mass and Academic Achievement," Health, Econometrics and Data Group (HEDG) Working Papers 09/25, HEDG, c/o Department of Economics, University of York.
    16. Natalia Krüger & Axel McCallum & Víctor Volman, 2020. "Segregación escolar por nivel socioeconómico: disparidades entre las provincias argentinas," Asociación Argentina de Economía Política: Working Papers 4362, Asociación Argentina de Economía Política.
    17. Yongwei Chen & Dahai Fu, 2015. "Measuring income inequality using survey data: the case of China," The Journal of Economic Inequality, Springer;Society for the Study of Economic Inequality, vol. 13(2), pages 299-307, June.
    18. Anna K Stuck & Marie Méan & Andreas Limacher & Marc Righini & Kurt Jaeger & Hans-Jürg Beer & Joseph Osterwalder & Beat Frauchiger & Christian M Matter & Nils Kucher & Michael Egloff & Markus Aschwande, 2014. "The Adherence to Initial Processes of Care in Elderly Patients with Acute Venous Thromboembolism," PLOS ONE, Public Library of Science, vol. 9(7), pages 1-7, July.
    19. Ethan T. Hunt & Bridget Armstrong & Brie M. Turner-McGrievy & Michael W. Beets & Robert G. Weaver, 2021. "Differences by School Location in Summer and School Monthly Weight Change: Findings from a Nationally Representative Sample," IJERPH, MDPI, vol. 18(21), pages 1-12, November.
    20. repec:pri:crcwel:wp12-10-ff is not listed on IDEAS
    21. Shoham, David A. & Vupputuri, Suma & Kaufman, Jay S. & Kshirsagar, Abhijit V. & Diez Roux, Ana V. & Coresh, Josef & Heiss, Gerardo, 2008. "Kidney disease and the cumulative burden of life course socioeconomic conditions: The Atherosclerosis Risk in Communities (ARIC) Study," Social Science & Medicine, Elsevier, vol. 67(8), pages 1311-1320, October.
    22. Joost R. Ginkel, 2020. "Standardized Regression Coefficients and Newly Proposed Estimators for $${R}^{{2}}$$R2 in Multiply Imputed Data," Psychometrika, Springer;The Psychometric Society, vol. 85(1), pages 185-205, March.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:tsj:stataj:v:11:y:2011:i:3:p:403-419. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Christopher F. Baum or Lisa Gilmore (email available below). General contact details of provider: http://www.stata-journal.com/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.