IDEAS home Printed from https://ideas.repec.org/a/sae/somere/v46y2017i4p864-897.html
   My bibliography  Save this article

Nonparametric Multiple Imputation for Questionnaires with Individual Skip Patterns and Constraints: The Case of Income Imputation in the National Educational Panel Study

Author

Listed:
  • Christian Aßmann
  • Ariane Würbach
  • Solange Goßmann
  • Ferdinand Geissler
  • Anika Bela

Abstract

Large-scale surveys typically exhibit data structures characterized by rich mutual dependencies between surveyed variables and individual-specific skip patterns. Despite high efforts in fieldwork and questionnaire design, missing values inevitably occur. One approach for handling missing values is to provide multiply imputed data sets, thus enhancing the analytical potential of the surveyed data. To preserve possible nonlinear relationships among variables and incorporate skip patterns that make the full conditional distributions individual specific, we adapt a full conditional multiple imputation approach based on sequential classification and regression trees. Individual-specific skip patterns and constraints are handled within imputation in a way ensuring the consistency of the sequence of full conditional distributions. The suggested approach is illustrated in the context of income imputation in the adult cohort of the National Educational Panel Study.

Suggested Citation

  • Christian Aßmann & Ariane Würbach & Solange Goßmann & Ferdinand Geissler & Anika Bela, 2017. "Nonparametric Multiple Imputation for Questionnaires with Individual Skip Patterns and Constraints: The Case of Income Imputation in the National Educational Panel Study," Sociological Methods & Research, , vol. 46(4), pages 864-897, November.
  • Handle: RePEc:sae:somere:v:46:y:2017:i:4:p:864-897
    DOI: 10.1177/0049124115610346
    as

    Download full text from publisher

    File URL: https://journals.sagepub.com/doi/10.1177/0049124115610346
    Download Restriction: no

    File URL: https://libkey.io/10.1177/0049124115610346?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Schenker, Nathaniel & Raghunathan, Trivellore E. & Chiu, Pei-Lu & Makuc, Diane M. & Zhang, Guangyu & Cohen, Alan J., 2006. "Multiple Imputation of Missing Income Data in the National Health Interview Survey," Journal of the American Statistical Association, American Statistical Association, vol. 101, pages 924-933, September.
    2. Hapfelmeier, A. & Hothorn, T. & Ulm, K., 2012. "Recursive partitioning on incomplete data using surrogate decisions and multiple imputation," Computational Statistics & Data Analysis, Elsevier, vol. 56(6), pages 1552-1565.
    3. Rubin, Donald B., 2004. "The Design of a General and Flexible System for Handling Nonresponse in Sample Surveys," The American Statistician, American Statistical Association, vol. 58, pages 298-302, November.
    4. Little, Roderick J A, 1988. "Missing-Data Adjustments in Large Surveys: Reply," Journal of Business & Economic Statistics, American Statistical Association, vol. 6(3), pages 300-301, July.
    5. Jenkins, Stephen P., 2010. "The British Household Panel Survey and its Income Data," IZA Discussion Papers 5242, Institute of Labor Economics (IZA).
    6. Doove, L.L. & Van Buuren, S. & Dusseldorp, E., 2014. "Recursive partitioning for missing data imputation in the presence of interaction effects," Computational Statistics & Data Analysis, Elsevier, vol. 72(C), pages 92-104.
    7. Joachim R. Frick & Markus M. Grabka, 2007. "Item Non-response and Imputation of Annual Labor Income in Panel Surveys from a Cross-National Perspective," SOEPpapers on Multidisciplinary Panel Data Research 49, DIW Berlin, The German Socio-Economic Panel (SOEP).
    8. Regina Riphahn & Oliver Serfling, 2005. "Item non-response on income and wealth questions," Empirical Economics, Springer, vol. 30(2), pages 521-538, September.
    9. Jörg Drechsler, 2011. "Multiple imputation in practice—a case study using a complex German establishment survey," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 95(1), pages 1-26, March.
    10. Lane F. Burgette & Jerome P. Reiter, 2012. "Nonparametric Bayesian Multiple Imputation for Missing Data Due to Mid-Study Switching of Measurement Methods," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 107(498), pages 439-449, June.
    11. Little, Roderick J A, 1988. "Missing-Data Adjustments in Large Surveys," Journal of Business & Economic Statistics, American Statistical Association, vol. 6(3), pages 287-296, July.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Martin, Eisele & Zhu, Junyi, 2013. "Multiple imputation in a complex household survey - the German Panel on Household Finances (PHF): challenges and solutions," MPRA Paper 57666, University Library of Munich, Germany.
    2. Juana Sanchez & Sydney Noelle Kahmann, 2017. "R&D, Attrition and Multiple Imputation in BRDIS," Working Papers 17-13, Center for Economic Studies, U.S. Census Bureau.
    3. Zachary H. Seeskin, 2016. "Evaluating the Use of Commercial Data to Improve Survey Estimates of Property Taxes," CARRA Working Papers 2016-06, Center for Economic Studies, U.S. Census Bureau.
    4. Westermeier, Christian & Grabka, Markus M., 2016. "Longitudinal Wealth Data and Multiple Imputation: An Evaluation Study," EconStor Open Access Articles and Book Chapters, ZBW - Leibniz Information Centre for Economics, vol. 10(3), pages 237-252.
    5. Youngjoo Cho & Debashis Ghosh, 2021. "Quantile-Based Subgroup Identification for Randomized Clinical Trials," Statistics in Biosciences, Springer;International Chinese Statistical Association, vol. 13(1), pages 90-128, April.
    6. A. R. Linero, 2017. "Bayesian nonparametric analysis of longitudinal studies in the presence of informative missingness," Biometrika, Biometrika Trust, vol. 104(2), pages 327-341.
    7. Daniel Schunk, 2006. "The German SAVE Survey: Documentation and Methodology," MEA discussion paper series 06109, Munich Center for the Economics of Aging (MEA) at the Max Planck Institute for Social Law and Social Policy.
    8. Adel Bosch & Steven F. Koch, 2021. "Individual and Household Debt: Does Imputation Choice Matter?," Working Papers 202141, University of Pretoria, Department of Economics.
    9. Joost Ginkel & Pieter Kroonenberg, 2014. "Using Generalized Procrustes Analysis for Multiple Imputation in Principal Component Analysis," Journal of Classification, Springer;The Classification Society, vol. 31(2), pages 242-269, July.
    10. Verbeek, M.J.C.M. & Nijman, T.E., 1992. "Incomplete panels and selection bias : A survey," Discussion Paper 1992-7, Tilburg University, Center for Economic Research.
    11. Hai Zhong, 2010. "The impact of missing data in the estimation of concentration index: a potential source of bias," The European Journal of Health Economics, Springer;Deutsche Gesellschaft für Gesundheitsökonomie (DGGÖ), vol. 11(3), pages 255-266, June.
    12. Gerko Vink & Laurence E. Frank & Jeroen Pannekoek & Stef Buuren, 2014. "Predictive mean matching imputation of semicontinuous variables," Statistica Neerlandica, Netherlands Society for Statistics and Operations Research, vol. 68(1), pages 61-90, February.
    13. Xiong, Ruoxuan & Pelger, Markus, 2023. "Large dimensional latent factor modeling with missing observations and applications to causal inference," Journal of Econometrics, Elsevier, vol. 233(1), pages 271-301.
    14. Dang, Hai-Anh H & Carletto, Calogero, 2022. "Recall Bias Revisited: Measure Farm Labor Using Mixed-Mode Surveys and Multiple Imputation," IZA Discussion Papers 14997, Institute of Labor Economics (IZA).
    15. Daniel Schunk, 2007. "A Markov Chain Monte Carlo Multiple Imputation Procedure for Dealing with Item Nonresponse in the German SAVE Survey," MEA discussion paper series 07121, Munich Center for the Economics of Aging (MEA) at the Max Planck Institute for Social Law and Social Policy.
    16. F. Di Lascio & Simone Giannerini & Alessandra Reale, 2015. "Exploring copulas for the imputation of complex dependent data," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 24(1), pages 159-175, March.
    17. Ankita Patnaik & Jeffrey Hemmeter & Arif Mamun, "undated". "Promoting Readiness of Minors with Autism Spectrum Disorder: Evidence from a Randomized Controlled Trial," Mathematica Policy Research Reports a74c93d9bdce40709ad81cdbc, Mathematica Policy Research.
    18. Yanqing Sun & Li Qi & Fei Heng & Peter B. Gilbert, 2020. "A hybrid approach for the stratified mark‐specific proportional hazards model with missing covariates and missing marks, with application to vaccine efficacy trials," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 69(4), pages 791-814, August.
    19. Arif Mamun & David Wittenburg & Noelle Denny-Brown & Michael Levere & David Mann & Rebecca Coughlin & Sarah Croake & Heather Gordon & Denise Hoffman & Rachel Holzwart & Rosalind Keith & Brittany McGil, "undated". "Promoting Opportunity Demonstration: Interim Evaluation Report," Mathematica Policy Research Reports caa99d38a8b14f968ea3438e5, Mathematica Policy Research.
    20. Miguel Szekely & Nora Lustig & Martin Cumpa & Jose Antonio Mejia, 2004. "Do we know how much poverty there is?," Oxford Development Studies, Taylor & Francis Journals, vol. 32(4), pages 523-558.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:sae:somere:v:46:y:2017:i:4:p:864-897. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: SAGE Publications (email available below). General contact details of provider: .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.