IDEAS home Printed from https://ideas.repec.org/p/ldr/wpaper/13.html
   My bibliography  Save this paper

Sequential Regression Multiple Imputation for Incomplete Multivariate Data using Markov Chain Monte Carlo

Author

Listed:
  • Miguel Lacerda
  • Cally Ardington

    (SALDRU, School of Economics, University of Cape TownAuthor-Email:)

  • Murray Leibbrandt

    (SALDRU, School of Economics, University of Cape Town)

Abstract

This paper discusses the theoretical background to handling missing data in a multivariate context. Earlier methods for dealing with item non-response are reviewed, followed by an examination of some of the more modern methods and, in particular, multiple imputation. One such technique, known as sequential regression multivariate imputation, which employs a Markov chain Monte Carlo algorithm is described and implemented. It is demonstrated that distributional convergence is rapid and only a few imputations are necessary in order to produce accurate point estimates and preserve multivariate relationships, whilst adequately accounting for the uncertainty introduced by the imputation procedure. It is further shown that lower fractions of missing data and the inclusion of relevant covariates in the imputation model are desirable in terms of bias reduction.

Suggested Citation

  • Miguel Lacerda & Cally Ardington & Murray Leibbrandt, 2007. "Sequential Regression Multiple Imputation for Incomplete Multivariate Data using Markov Chain Monte Carlo," SALDRU Working Papers 13, Southern Africa Labour and Development Research Unit, University of Cape Town.
  • Handle: RePEc:ldr:wpaper:13
    as

    Download full text from publisher

    File URL: https://opensaldru.uct.ac.za/bitstream/handle/11090/41/2008_13.pdf?sequence=1
    File Function: Full text
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Patrick Royston, 2004. "Multiple imputation of missing values," Stata Journal, StataCorp LP, vol. 4(3), pages 227-241, September.
    2. Javaras, Kristin N. & Van Dyk, David A., 2003. "Multiple Imputation for Incomplete Data With Semicontinuous Variables," Journal of the American Statistical Association, American Statistical Association, vol. 98, pages 703-715, January.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Nic Baigrie & Katherine Eyal, 2014. "An Evaluation of the Determinants and Implications of Panel Attrition in the National Income Dynamics Survey (2008-2010)," South African Journal of Economics, Economic Society of South Africa, vol. 82(1), pages 39-65, March.
    2. Derek Yu, 2013. "Some factors influencing the comparability and reliability of poverty estimates across household surveys," Working Papers 03/2013, Stellenbosch University, Department of Economics.
    3. David McLennan & Michael Noble & Gemma Wright & Helen Barnes & Faith Masekesa, 2021. "Exploring the quality of income data in two African household surveys for the purpose of tax-benefit microsimulation modelling: Imputing employment income in Tanzania and Zambia," WIDER Working Paper Series wp-2021-134, World Institute for Development Economic Research (UNU-WIDER).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Harald Schoen, 2008. "Identity, Instrumental Self-Interest and Institutional Evaluations," European Union Politics, , vol. 9(1), pages 5-29, March.
    2. Julia S. Goldberg, 2011. "Identity Salience and Involvement among Resident and Nonresident Fathers," Working Papers 1323, Princeton University, School of Public and International Affairs, Center for Research on Child Wellbeing..
    3. John Bound & Michael F. Lovenheim & Sarah Turner, 2010. "Why Have College Completion Rates Declined? An Analysis of Changing Student Preparation and Collegiate Resources," American Economic Journal: Applied Economics, American Economic Association, vol. 2(3), pages 129-157, July.
    4. von Hinke Kessler Scholder S, 2009. "Genetic Markers as Instrumental Variables: An Application to Child Fat Mass and Academic Achievement," Health, Econometrics and Data Group (HEDG) Working Papers 09/25, HEDG, c/o Department of Economics, University of York.
    5. Elizabeth Washbrook & Paul Gregg & Carol Propper, 2014. "A decomposition analysis of the relationship between parental income and multiple child outcomes," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 177(4), pages 757-782, October.
    6. Jennifer March Augustine, 2016. "Exploring New Life Course Patterns of Mother’s Continuing Secondary and College Education," Population Research and Policy Review, Springer;Southern Demographic Association (SDA), vol. 35(6), pages 727-755, December.
    7. Gerko Vink & Laurence E. Frank & Jeroen Pannekoek & Stef Buuren, 2014. "Predictive mean matching imputation of semicontinuous variables," Statistica Neerlandica, Netherlands Society for Statistics and Operations Research, vol. 68(1), pages 61-90, February.
    8. Zsuzsa Blasko & Artur Pokropek & Joanna Sikora, 2018. "Science career plans of adolescents: patterns, trends and gender divides," JRC Research Reports JRC109135, Joint Research Centre.
    9. Sonia Bhalotra & Samantha Rawlings, 2013. "Gradients of the Intergenerational Transmission of Health in Developing Countries," The Review of Economics and Statistics, MIT Press, vol. 95(2), pages 660-672, May.
    10. Martine Audibert & Yong He & Jacky Mathonnat, 2013. "Two-Period Comparison of Healthcare Demand with Income Growth and Population Aging in Rural China: Implications for Adjustment of the Healthcare Supply and Development," Working Papers halshs-00846088, HAL.
    11. Stoddard, Pamela & Handley, Margaret A. & Vargas Bustamante, Arturo & Schillinger, Dean, 2011. "The influence of indigenous status and community indigenous composition on obesity and diabetes among Mexican adults," Social Science & Medicine, Elsevier, vol. 73(11), pages 1635-1643.
    12. Manuel S. González Canché, 2017. "Financial Benefits of Rapid Student Loan Repayment: An Analytic Framework Employing Two Decades of Data," The ANNALS of the American Academy of Political and Social Science, , vol. 671(1), pages 154-182, May.
    13. Layte, R. & Christopher Whelan, 2013. "GINI DP 78: Who Feels Inferior? A Test of the Status Anxiety Hypothesis of Social Inequalities in Health," GINI Discussion Papers 78, AIAS, Amsterdam Institute for Advanced Labour Studies.
    14. Layte, Richard & Whelan, Christopher T., 2014. "Who Feels Inferior? A Test of the Status Anxiety Hypothesis of Social Inequalities in Health," Papers WP476, Economic and Social Research Institute (ESRI).
    15. Lee, Se Woong & Lee, Eunjung Alice, 2020. "Teacher qualification matters: The association between cumulative teacher qualification and students’ educational attainment," International Journal of Educational Development, Elsevier, vol. 77(C).
    16. repec:pri:crcwel:wp10-03-ff is not listed on IDEAS
    17. Westermeier, Christian & Grabka, Markus M., 2016. "Longitudinal Wealth Data and Multiple Imputation: An Evaluation Study," EconStor Open Access Articles and Book Chapters, ZBW - Leibniz Information Centre for Economics, vol. 10(3), pages 237-252.
    18. Brian Walitt & Richard L Nahin & Robert S Katz & Martin J Bergman & Frederick Wolfe, 2015. "The Prevalence and Characteristics of Fibromyalgia in the 2012 National Health Interview Survey," PLOS ONE, Public Library of Science, vol. 10(9), pages 1-16, September.
    19. Martine AUDIBERT & Yong HE & Jacky MATHONNAT, 2017. "What does demand heterogeneity tell us about health care provider choice in rural China?," Working Papers P193, FERDI.
    20. Zhong, Hua & Hu, Wuyang, 2015. "Farmers’ Willingness to Engage in Best Management Practices: an Application of Multiple Imputation," 2015 Annual Meeting, January 31-February 3, 2015, Atlanta, Georgia 196962, Southern Agricultural Economics Association.
    21. repec:pri:crcwel:wp06-28-ff is not listed on IDEAS
    22. Mueller, Christoph Emanuel, 2020. "Why do residents participate in high-voltage transmission line planning procedures? Findings from two power grid expansion regions in Germany," Energy Policy, Elsevier, vol. 145(C).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:ldr:wpaper:13. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Alison Siljeur (email available below). General contact details of provider: https://edirc.repec.org/data/sauctza.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.