IDEAS home Printed from https://ideas.repec.org/p/ehl/lserod/64763.html
   My bibliography  Save this paper

Using binary paradata to correct for measurement error in survey data analysis

Author

Listed:
  • Da Silva, Damião Nóbrega
  • Skinner, Chris J.
  • Kim, Jae Kwang

Abstract

Paradata refers here to data at unit level on an observed auxiliary variable, not usually of direct scientific interest, which may be informative about the quality of the survey data for the unit. There is increasing interest among survey researchers in how to use such data. Its use to reduce bias from nonresponse has received more attention so far than its use to correct for measurement error. This article considers the latter with a focus on binary paradata indicating the presence of measurement error. A motivating application concerns inference about a regression model, where earnings is a covariate measured with error and whether a respondent refers to pay records is the paradata variable. We specify a parametric model allowing for either normally or t-distributed measurement errors and discuss the assumptions required to identify the regression coefficients. We propose two estimation approaches that take account of complex survey designs: pseudo-maximum likelihood estimation and parametric fractional imputation. These approaches are assessed in a simulation study and are applied to a regression of a measure of deprivation given earnings and other covariates using British Household Panel Survey data. It is found that the proposed approach to correcting for measurement error reduces bias and improves on the precision of a simple approach based on accurate observations. We outline briefly possible extensions to uses of this approach at earlier stages in the survey process. Supplemental materials are available online.

Suggested Citation

  • Da Silva, Damião Nóbrega & Skinner, Chris J. & Kim, Jae Kwang, 2016. "Using binary paradata to correct for measurement error in survey data analysis," LSE Research Online Documents on Economics 64763, London School of Economics and Political Science, LSE Library.
  • Handle: RePEc:ehl:lserod:64763
    as

    Download full text from publisher

    File URL: http://eprints.lse.ac.uk/64763/
    File Function: Open access version.
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Erich Battistin & Raffaele Miniaci & Guglielmo Weber, 2003. "What Do We Learn from Recall Consumption Data?," Journal of Human Resources, University of Wisconsin Press, vol. 38(2).
    2. Da Silva, Damião Nóbrega & Skinner, Chris J., 2014. "The use of accuracy indicators to correct for survey measurement error," LSE Research Online Documents on Economics 51256, London School of Economics and Political Science, LSE Library.
    3. Matthew Blackwell & James Honaker & Gary King, 2017. "A Unified Approach to Measurement Error and Missing Data: Overview and Applications," Sociological Methods & Research, , vol. 46(3), pages 303-341, August.
    4. Yulei He & Alan M. Zaslavsky, 2009. "Combining Information from Cancer Registry and Medical Records Data to Improve Analyses of Adjuvant Cancer Therapies," Biometrics, The International Biometric Society, vol. 65(3), pages 946-952, September.
    5. Damião N. Da Silva & Chris Skinner, 2014. "The use of accuracy indicators to correct for survey measurement error," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 63(2), pages 303-319, February.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Jae Kwang Kim & J.N.K. Rao & Yonghyun Kwon, 2022. "Analysis of clustered survey data based on two‐stage informative sampling and associated two‐level models," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 185(4), pages 1522-1540, October.
    2. Bruce D. Meyer & Nikolas Mittag, 2019. "Combining Administrative and Survey Data to Improve Income Measurement," NBER Working Papers 25738, National Bureau of Economic Research, Inc.
    3. Mengli Zhang & Yang Bai, 2021. "On the use of repeated measurement errors in linear regression models," Metrika: International Journal for Theoretical and Applied Statistics, Springer, vol. 84(5), pages 779-803, July.
    4. Meyer, Bruce D. & Mittag, Nikolas, 2019. "Combining Administrative and Survey Data to Improve Income Measurement," IZA Discussion Papers 12266, Institute of Labor Economics (IZA).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Damião Nóbrega Da Silva & Chris Skinner & Jae Kwang Kim, 2016. "Using Binary Paradata to Correct for Measurement Error in Survey Data Analysis," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 111(514), pages 526-537, April.
    2. Matthew Blackwell & James Honaker & Gary King, 2017. "A Unified Approach to Measurement Error and Missing Data: Overview and Applications," Sociological Methods & Research, , vol. 46(3), pages 303-341, August.
    3. Erich Battistin & Raffaele Miniaci & Guglielmo Weber, 2003. "What Do We Learn from Recall Consumption Data?," Journal of Human Resources, University of Wisconsin Press, vol. 38(2).
    4. Dlugosz, Stephan & Mammen, Enno & Wilke, Ralf A., 2017. "Generalized partially linear regression with misclassified data and an application to labour market transitions," Computational Statistics & Data Analysis, Elsevier, vol. 110(C), pages 145-159.
    5. Meyer, Bruce D. & Mittag, Nikolas, 2019. "Combining Administrative and Survey Data to Improve Income Measurement," IZA Discussion Papers 12266, Institute of Labor Economics (IZA).
    6. Sebastian Barfort & Nikolaj Harmon & Frederik Hjorth & Asmus Leth Olsen, 2015. "Dishonesty and Selection into Public Service in Denmark: Who Runs the World’s Least Corrupt Public Sector?," Discussion Papers 15-12, University of Copenhagen. Department of Economics.
    7. repec:ebl:ecbull:v:3:y:2004:i:9:p:1-12 is not listed on IDEAS
    8. Jonathan B Slapin, 2014. "Measurement, model testing, and legislative influence in the European Union," European Union Politics, , vol. 15(1), pages 24-42, March.
    9. Erich Battistin & Mario Padula, 2016. "Survey instruments and the reports of consumption expenditures: evidence from the consumer expenditure surveys," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 179(2), pages 559-581, February.
    10. Campos, Rodolfo G., 2013. "Measurement error and imputation of consumption in survey data," UC3M Working papers. Economics we1219, Universidad Carlos III de Madrid. Departamento de Economía.
    11. Etheridge, Ben, 2015. "A test of the household income process using consumption and wealth data," European Economic Review, Elsevier, vol. 78(C), pages 129-157.
    12. Matteo Barigozzi & Lucia Alessi & Marco Capasso & Giorgio Fagiolo, 2008. "The Distribution of Consumption-Expenditure Budget Shares. Evidence from Italian Households," LEM Papers Series 2008/18, Laboratory of Economics and Management (LEM), Sant'Anna School of Advanced Studies, Pisa, Italy.
    13. Niño-Zarazúa, Miguel & Chiripanhura, Blessing, 2013. "The impacts of the food, fuel and financial crises on households in Nigeria. A retrospective approach for research enquiry," MPRA Paper 47348, University Library of Munich, Germany.
    14. Martin Browning & Thomas F. Crossley & Joachim Winter, 2014. "The Measurement of Household Consumption Expenditures," Annual Review of Economics, Annual Reviews, vol. 6(1), pages 475-501, August.
    15. Rodolphe Desbordes & Gary Koop, 2014. "The Known Unknowns of Governance," Working Paper series 38_14, Rimini Centre for Economic Analysis.
    16. Michael Gideon & Brooke Helppie-McFall & Joanne W. Hsu, 2017. "Heaping at Round Numbers on Financial Questions : The Role of Satisficing," Finance and Economics Discussion Series 2017-006, Board of Governors of the Federal Reserve System (U.S.).
    17. Ton de Waal & Arnout van Delden & Sander Scholtus, 2020. "Multi‐source Statistics: Basic Situations and Methods," International Statistical Review, International Statistical Institute, vol. 88(1), pages 203-228, April.
    18. Rob Alessie & Agar Brugiavini & Guglielmo Weber, 2006. "Saving and Cohabitation: The Economic Consequences of Living with One's Parents in Italy and the Netherlands," NBER Chapters, in: NBER International Seminar on Macroeconomics 2004, pages 413-457, National Bureau of Economic Research, Inc.
    19. Tullio Jappelli & Luigi Pistaferri & Guglielmo Weber, 2007. "Health care quality, economic inequality, and precautionary saving," Health Economics, John Wiley & Sons, Ltd., vol. 16(4), pages 327-346, April.
    20. Joscha Legewie, 2018. "Living on the Edge: Neighborhood Boundaries and the Spatial Dynamics of Violent Crime," Demography, Springer;Population Association of America (PAA), vol. 55(5), pages 1957-1977, October.
    21. Kuwayama, Yusuke & Olmstead, Sheila & Zheng, Jiameng, 2022. "A more comprehensive estimate of the value of water quality," Journal of Public Economics, Elsevier, vol. 207(C).

    More about this item

    Keywords

    auxiliary survey information; complex sampling; fractional imputation; pseudo maximum likelihood;
    All these keywords.

    JEL classification:

    • C1 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods and Methodology: General

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:ehl:lserod:64763. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: LSERO Manager (email available below). General contact details of provider: https://edirc.repec.org/data/lsepsuk.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.