IDEAS home Printed from https://ideas.repec.org/a/sae/somere/v42y2013i4p483-530.html
   My bibliography  Save this article

Multiple Imputation for Combined-survey Estimation With Incomplete Regressors in One but Not Both Surveys

Author

Listed:
  • Michael S. Rendall
  • Bonnie Ghosh-Dastidar
  • Margaret M. Weden
  • Elizabeth H. Baker
  • Zafar Nazarov

Abstract

Within-survey multiple imputation (MI) methods are adapted to pooled-survey regression estimation where one survey has more regressors, but typically fewer observations, than the other. This adaptation is achieved through (1) larger numbers of imputations to compensate for the higher fraction of missing values, (2) model-fit statistics to check the assumption that the two surveys sample from a common universe, and (3) specifying the analysis model completely from variables present in the survey with the larger set of regressors, thereby excluding variables never jointly observed. In contrast to the typical within-survey MI context, cross-survey missingness is monotonic and easily satisfies the missing at random assumption needed for unbiased MI. Large efficiency gains and substantial reduction in omitted variable bias are demonstrated in an application to sociodemographic differences in the risk of child obesity estimated from two nationally representative cohort surveys.

Suggested Citation

  • Michael S. Rendall & Bonnie Ghosh-Dastidar & Margaret M. Weden & Elizabeth H. Baker & Zafar Nazarov, 2013. "Multiple Imputation for Combined-survey Estimation With Incomplete Regressors in One but Not Both Surveys," Sociological Methods & Research, , vol. 42(4), pages 483-530, November.
  • Handle: RePEc:sae:somere:v:42:y:2013:i:4:p:483-530
    DOI: 10.1177/0049124113502947
    as

    Download full text from publisher

    File URL: https://journals.sagepub.com/doi/10.1177/0049124113502947
    Download Restriction: no

    File URL: https://libkey.io/10.1177/0049124113502947?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Judith K. Hellerstein & Guido W. Imbens, 1999. "Imposing Moment Restrictions From Auxiliary Data By Weighting," The Review of Economics and Statistics, MIT Press, vol. 81(1), pages 1-14, February.
    2. John Fitzgerald & Peter Gottschalk & Robert Moffitt, 1998. "An Analysis of Sample Attrition in Panel Data: The Michigan Panel Study of Income Dynamics," Journal of Human Resources, University of Wisconsin Press, vol. 33(2), pages 251-299.
    3. Rodgers, Willard L, 1984. "An Evaluation of Statistical Matching," Journal of Business & Economic Statistics, American Statistical Association, vol. 2(1), pages 91-102, January.
    4. Michael Rendall & Mark Handcock & Stefan Jonsson, 2009. "Bayesian estimation of hispanic fertility hazards from survey and population data," Demography, Springer;Population Association of America (PAA), vol. 46(1), pages 65-83, February.
    5. Patricia M. Anderson & Kristin F. Butcher, 2006. "Reading, Writing, and Refreshments: Are School Finances Contributing to Children’s Obesity?," Journal of Human Resources, University of Wisconsin Press, vol. 41(3).
    6. Michael Rendall & Margaret Weden & Melissa Favreault & Hilary Waldron, 2011. "The Protective Effect of Marriage for Survival: A Review and Update," Demography, Springer;Population Association of America (PAA), vol. 48(2), pages 481-506, May.
    7. Renato Assunção & Carl Schmertmann & Joseph Potter & Suzana Cavenaghi, 2005. "Empirical bayes estimation of demographic schedules for small areas," Demography, Springer;Population Association of America (PAA), vol. 42(3), pages 537-558, August.
    8. Vicki Freedman & Douglas Wolf, 1995. "A case study on the use of multiple imputation," Demography, Springer;Population Association of America (PAA), vol. 32(3), pages 459-470, August.
    9. Rendall, Michael S. & Ghosh-Dastidar, Bonnie & Weden, Margaret M. & Nazarov, Zafar, 2011. "Multiple Imputation for Combined-Survey Estimation With Incomplete Regressors In One But Not Both Surveys," Working Papers 887-1, RAND Corporation.
    10. Guido W. Imbens & Tony Lancaster, 1994. "Combining Micro and Macro Data in Microeconometric Models," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 61(4), pages 655-680.
    11. Classen, Timothy & Hokayem, Charles, 2005. "Childhood influences on youth obesity," Economics & Human Biology, Elsevier, vol. 3(2), pages 165-187, July.
    12. Mark Handcock & Sami Huovilainen & Michael Rendall, 2000. "Combining registration-system and survey data to estimate birth probabilities," Demography, Springer;Population Association of America (PAA), vol. 37(2), pages 187-192, May.
    13. McCloskey, Donald N, 1985. "The Loss Function Has Been Mislaid: The Rhetoric of Significance Tests," American Economic Review, American Economic Association, vol. 75(2), pages 201-205, May.
    14. Michael Rendall & Ryan Admiraal & Alessandra DeRose & Paola DiGiulio & Mark Handcock & Filomena Racioppi, 2008. "Population constraints on pooled surveys in demographic hazard modeling," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 17(4), pages 519-539, October.
    15. Moriarity, Chris & Scheuren, Fritz, 2003. "A Note on Rubin's Statistical Matching Using File Concatenation with Adjusted Weights and Multiple Imputations," Journal of Business & Economic Statistics, American Statistical Association, vol. 21(1), pages 65-73, January.
    16. Ai, Chunrong & Norton, Edward C., 2003. "Interaction terms in logit and probit models," Economics Letters, Elsevier, vol. 80(1), pages 123-129, July.
    17. Pfeffermann, Danny & Sverchkov, Michail, 2007. "Small-Area Estimation Under Informative Probability Sampling of Areas and Within the Selected Areas," Journal of the American Statistical Association, American Statistical Association, vol. 102, pages 1427-1439, December.
    18. Ridder, Geert & Moffitt, Robert, 2007. "The Econometrics of Data Combination," Handbook of Econometrics, in: J.J. Heckman & E.E. Leamer (ed.), Handbook of Econometrics, edition 1, volume 6, chapter 75, Elsevier.
    19. Rubin, Donald B, 1986. "Statistical Matching Using File Concatenation with Adjusted Weights and Multiple Imputations," Journal of Business & Economic Statistics, American Statistical Association, vol. 4(1), pages 87-94, January.
    20. Michael S. Rendall & Bonnie Ghosh-Dastidar & Margaret M. Weden & Zafar Nazarov, 2011. "Multiple Imputation for Combined-Survey Estimation With Incomplete Regressors In One But Not Both Surveys," Working Papers WR-887-1, RAND Corporation.
    21. Kimbro, R.T. & Brooks-Gunn, J. & McLanahan, S., 2007. "Racial and ethnic differentials in overweight and obesity among 3-year-old children," American Journal of Public Health, American Public Health Association, vol. 97(2), pages 298-305.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Angela Greulich & Michael Rendall, 2014. "Multiple imputation for demographic hazard models with left-censored predictor variables," Working Papers hal-01298942, HAL.
    2. Angela Greulich & Michael Rendall, 2014. "Multiple imputation for demographic hazard models with left-censored predictor variables," Université Paris1 Panthéon-Sorbonne (Post-Print and Working Papers) hal-01298942, HAL.
    3. Catalina Amuedo-Dorantes & Mary J. Lopez, 2018. "Impeding or Accelerating Assimilation? Immigration Enforcement and Its Impact on Naturalization Patterns," RF Berlin - CReAM Discussion Paper Series 1814, Rockwool Foundation Berlin (RF Berlin) - Centre for Research and Analysis of Migration (CReAM).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Michael S. Rendall & Bonnie Ghosh-Dastidar & Margaret M. Weden & Zafar Nazarov, 2011. "Multiple Imputation for Combined-Survey Estimation With Incomplete Regressors In One But Not Both Surveys," Working Papers WR-887-1, RAND Corporation.
    2. Michael S. Rendall & Mark S. Handcock & Stefan H. Jonsson, 2007. "Bayesian Estimation of Hispanic Fertility Hazards from Survey and Population Data," Working Papers WR-496, RAND Corporation.
    3. Michael Rendall & Ryan Admiraal & Alessandra DeRose & Paola DiGiulio & Mark Handcock & Filomena Racioppi, 2008. "Population constraints on pooled surveys in demographic hazard modeling," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 17(4), pages 519-539, October.
    4. Kiesl, Hans & Rässler, Susanne, 2006. "How valid can data fusion be?," IAB-Discussion Paper 200615, Institut für Arbeitsmarkt- und Berufsforschung (IAB), Nürnberg [Institute for Employment Research, Nuremberg, Germany].
    5. F Bravo, 2008. "Effcient M-estimators with auxiliary information," Discussion Papers 08/26, Department of Economics, University of York.
    6. Igari, Ryosuke & Hoshino, Takahiro, 2018. "A Bayesian data combination approach for repeated durations under unobserved missing indicators: Application to interpurchase-timing in marketing," Computational Statistics & Data Analysis, Elsevier, vol. 126(C), pages 150-166.
    7. Devereux, Paul J. & Tripathi, Gautam, 2009. "Optimally combining censored and uncensored datasets," Journal of Econometrics, Elsevier, vol. 151(1), pages 17-32, July.
    8. Liu, Tianqing & Yuan, Xiaohui, 2012. "Combining quasi and empirical likelihoods in generalized linear models with missing responses," Journal of Multivariate Analysis, Elsevier, vol. 111(C), pages 39-58.
    9. Hirukawa, Masayuki & Prokhorov, Artem, 2018. "Consistent estimation of linear regression models using matched data," Journal of Econometrics, Elsevier, vol. 203(2), pages 344-358.
    10. Bryan S. Graham & Cristine Campos De Xavier Pinto & Daniel Egel, 2012. "Inverse Probability Tilting for Moment Condition Models with Missing Data," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 79(3), pages 1053-1079.
    11. Bayram, Deniz & Dayé, Modeste, 2014. "Asymptotic Properties of the Weighted Least Squares Estimator Under Moments Restriction," MPRA Paper 60465, University Library of Munich, Germany.
    12. Esmeralda A. Ramalho & Richard J. Smith, 2013. "Discrete Choice Non-Response," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 80(1), pages 343-364.
    13. Michael S. Rendall & Ryan Admiraal & Alessandra De Rose & Paola Di Giulio & Mark S. Handcock & Filomena Racioppi, 2006. "Population constraints on pooled surveys in demographic hazard modeling," MPIDR Working Papers WP-2006-039, Max Planck Institute for Demographic Research, Rostock, Germany.
    14. Esmeralda A. Ramalho & Joaquim J. S. Ramalho & Rui Evangelista, 2017. "Combining micro and macro data in hedonic price indexes," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 26(2), pages 317-332, June.
    15. Ahfock, Daniel & Pyne, Saumyadipta & Lee, Sharon X. & McLachlan, Geoffrey J., 2016. "Partial identification in the statistical matching problem," Computational Statistics & Data Analysis, Elsevier, vol. 104(C), pages 79-90.
    16. Michael S. Rendall & Mark S. Handcock & Stefan H. Jonsson, 2007. "Bayesian Estimation of Hispanic Fertility Hazards from Survey and Population Data," Working Papers 496, RAND Corporation.
    17. Ryosuke Igari & Takahiro Hoshino, 2018. "A Bayesian Gamma Frailty Model Using the Sum of Independent Random Variables: Application of the Estimation of an Interpurchase Timing Model," Keio-IES Discussion Paper Series 2018-021, Institute for Economics Studies, Keio University.
    18. Keisuke Hirano & Guido W. Imbens & Geert Ridder & Donald B. Rubin, 2001. "Combining Panel Data Sets with Attrition and Refreshment Samples," Econometrica, Econometric Society, vol. 69(6), pages 1645-1659, November.
    19. John Fitzgerald & Peter Gottschalk & Robert Moffitt, 1998. "An Analysis of Sample Attrition in Panel Data: The Michigan Panel Study of Income Dynamics," Journal of Human Resources, University of Wisconsin Press, vol. 33(2), pages 251-299.
    20. Baum II, Charles L. & Ruhm, Christopher J., 2009. "Age, socioeconomic status and obesity growth," Journal of Health Economics, Elsevier, vol. 28(3), pages 635-648, May.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:sae:somere:v:42:y:2013:i:4:p:483-530. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: SAGE Publications (email available below). General contact details of provider: .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.