IDEAS home Printed from https://ideas.repec.org/a/bla/jorssa/v181y2018i4p1211-1230.html
   My bibliography  Save this article

Correlates of record linkage and estimating risks of non‐linkage biases in business data sets

Author

Listed:
  • Jamie C. Moore
  • Peter W. F. Smith
  • Gabriele B. Durrant

Abstract

Researchers often utilize data sets that link information from multiple sources, but non‐linkage biases caused by linked and non‐linked subject differences are little understood, especially in business data sets. We address these knowledge gaps by studying biases in linkable 2010 UK Small Business Survey data sets. We identify correlates of business linkage propensity, and also for the first time its components: consent to linkage and register identifier appendability. As well, we take a novel approach to evaluating non‐linkage bias risks, by computing data set representativeness indicators (comparable, decomposable sample subset similarity measures). We find that the main impacts on linkage propensities and bias risks are due to consenter–non‐consenter differences explicable given business survey response processes, and differences between subjects with and without identifiers caused by register undercoverage of very small businesses. We then discuss consequences for the analysis of linked business data sets, and implications of the evaluation methods we introduce for linked data set producers and users.

Suggested Citation

  • Jamie C. Moore & Peter W. F. Smith & Gabriele B. Durrant, 2018. "Correlates of record linkage and estimating risks of non‐linkage biases in business data sets," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 181(4), pages 1211-1230, October.
  • Handle: RePEc:bla:jorssa:v:181:y:2018:i:4:p:1211-1230
    DOI: 10.1111/rssa.12342
    as

    Download full text from publisher

    File URL: https://doi.org/10.1111/rssa.12342
    Download Restriction: no

    File URL: https://libkey.io/10.1111/rssa.12342?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Melissa Bjelland & Bruce Fallick & John Haltiwanger & Erika McEntarfer, 2011. "Employer-to-Employer Flows in the United States: Estimates Using Linked Employer-Employee Data," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 29(4), pages 493-505, October.
    2. Schouten, Barry & Shlomo, Natalie & Skinner, Chris J., 2011. "Indicators for monitoring and improving representativeness of response," LSE Research Online Documents on Economics 39121, London School of Economics and Political Science, LSE Library.
    3. Felix Ritchie, 2008. "Secure access to confidential microdata: four years of the Virtual Microdata Laboratory," Economic & Labour Market Review, Palgrave Macmillan;Office for National Statistics, vol. 2(5), pages 29-34, May.
    4. Alexander Hijzen & Richard Upward & Peter W. Wright, 2010. "Job Creation, Job Destruction and the Role of Small Firms: Firm‐Level Evidence for the UK," Oxford Bulletin of Economics and Statistics, Department of Economics, University of Oxford, vol. 72(5), pages 621-647, October.
    5. repec:taf:jnlbes:v:30:y:2012:i:2:p:191-201 is not listed on IDEAS
    6. John J. Abowd & John Haltiwanger & Julia Lane, 2004. "Integrated Longitudinal Employer-Employee Data for the United States," American Economic Review, American Economic Association, vol. 94(2), pages 224-229, May.
    7. Timothy Dunne & J. Bradford Jensen & Mark J. Roberts, 2009. "Introduction to "Producer Dynamics: New Evidence from Micro Data"," NBER Chapters, in: Producer Dynamics: New Evidence from Micro Data, pages 1-12, National Bureau of Economic Research, Inc.
    8. Abowd, John M. & Vilhuber, Lars, 2005. "The Sensitivity of Economic Statistics to Coding Errors in Personal Identifiers," Journal of Business & Economic Statistics, American Statistical Association, vol. 23, pages 133-152, April.
    9. Robert Hayes & Catrin Omerod & Felix Ritchie, 2007. "Earnings: summary of sources and developments," Economic & Labour Market Review, Palgrave Macmillan;Office for National Statistics, vol. 1(1), pages 42-47, January.
    10. Barry Schouten & Jelke Bethlehem & Koen Beullens & Øyvin Kleven & Geert Loosveldt & Annemieke Luiten & Katja Rutar & Natalie Shlomo & Chris Skinner, 2012. "Evaluating, Comparing, Monitoring, and Improving Representativeness of Survey Response Through R-Indicators and Partial R-Indicators," International Statistical Review, International Statistical Institute, vol. 80(3), pages 382-399, December.
    11. Barry Schouten & Natalie Shlomo, 2017. "Selecting Adaptive Survey Design Strata with Partial R-indicators," International Statistical Review, International Statistical Institute, vol. 85(1), pages 143-163, April.
    12. Florian Janik & Susanne Kohaut, 2012. "Why don’t they answer? Unit non-response in the IAB establishment panel," Quality & Quantity: International Journal of Methodology, Springer, vol. 46(3), pages 917-934, April.
    13. Timothy Dunne & J. Bradford Jensen & Mark J. Roberts, 2009. "Producer Dynamics: New Evidence from Micro Data," NBER Books, National Bureau of Economic Research, Inc, number dunn05-1.
    14. Shlomo, Natalie & Skinner, Chris J. & Schouten, Barry, 2012. "Estimation of an indicator of the representativeness of survey response," LSE Research Online Documents on Economics 39124, London School of Economics and Political Science, LSE Library.
    15. Barry Schouten & Fannie Cobben & Peter Lundquist & James Wagner, 2016. "Does more balanced survey response imply less non-response bias?," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 179(3), pages 727-748, June.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Serena Pattaro & Nick Bailey & Chris Dibben, 2020. "Using Linked Longitudinal Administrative Data to Identify Social Disadvantage," Social Indicators Research: An International and Interdisciplinary Journal for Quality-of-Life Measurement, Springer, vol. 147(3), pages 865-895, February.
    2. Valentin Reich, 2024. "Machine Learning Based Linkage of Company Data for Economic Research: Application to the EBDC Business Panels," ifo Working Paper Series 409, ifo Institute - Leibniz Institute for Economic Research at the University of Munich.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Roberts Caroline & Vandenplas Caroline & Herzing Jessica M.E., 2020. "A Validation of R-Indicators as a Measure of the Risk of Bias using Data from a Nonresponse Follow-Up Survey," Journal of Official Statistics, Sciendo, vol. 36(3), pages 675-701, September.
    2. Ian M. Schmutte, 2015. "Job Referral Networks and the Determination of Earnings in Local Labor Markets," Journal of Labor Economics, University of Chicago Press, vol. 33(1), pages 1-32.
    3. Jamie C. Moore & Gabriele B. Durrant & Peter W. F. Smith, 2021. "Do coefficients of variation of response propensities approximate non‐response biases during survey data collection?," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 184(1), pages 301-323, January.
    4. Fariha Kamal & C.J. Krizan, 2012. "Decomposing Aggregate Trade Flows: New Evidence from U.S. Traders," Working Papers 12-17, Center for Economic Studies, U.S. Census Bureau.
    5. Henry Hyatt & Erika McEntarfer, 2012. "Job-to-Job Flows and the Business Cycle," Working Papers 12-04, Center for Economic Studies, U.S. Census Bureau.
    6. Barry Schouten & Natalie Shlomo, 2017. "Selecting Adaptive Survey Design Strata with Partial R-indicators," International Statistical Review, International Statistical Institute, vol. 85(1), pages 143-163, April.
    7. Abowd, John M. & Vilhuber, Lars, 2011. "National estimates of gross employment and job flows from the Quarterly Workforce Indicators with demographic and industry detail," Journal of Econometrics, Elsevier, vol. 161(1), pages 82-99, March.
    8. Thais Paiva & Jerry Reiter, 2014. "Using Imputation Techniques To Evaluate Stopping Rules In Adaptive Survey Design," Working Papers 14-40, Center for Economic Studies, U.S. Census Bureau.
    9. Ahn, JaeBin & Choi, Moon Jung, 2020. "From firm-level imports to aggregate productivity: Evidence from Korean manufacturing firm data," Japan and the World Economy, Elsevier, vol. 56(C).
    10. Barth, Erling & Davis, James C. & Freeman, Richard B. & McElheran, Kristina, 2023. "Twisting the demand curve: Digitalization and the older workforce," Journal of Econometrics, Elsevier, vol. 233(2), pages 443-467.
    11. José Fillat & Stefania Garetto & Arthur V. Smith, 2018. "What are the consequences of global banking for the international transmission of shocks?: a quantitative analysis," Working Papers 18-11, Federal Reserve Bank of Boston.
    12. Jose L. Fillat & Stefania Garetto & Arthur V. Smith, 2018. "What are the consequences of global banking for the international transmission of shocks? A quantitative analysis∗," Boston University - Department of Economics - The Institute for Economic Development Working Papers Series dp-303, Boston University - Department of Economics, revised Oct 2018.
    13. Kevin L. McKinney & John M. Abowd & John Sabelhaus, 2021. "United States Earnings Dynamics: Inequality, Mobility, and Volatility," NBER Chapters, in: Measuring Distribution and Mobility of Income and Wealth, pages 69-104, National Bureau of Economic Research, Inc.
    14. Justin Pierce & Peter Schott, 2009. "A Concordance Between Ten-Digit U.S. Harmonized System Codes and SIC/NAICS Product Classes and Industries," Working Papers 09-41, Center for Economic Studies, U.S. Census Bureau.
    15. Carl-Erik Särndal & Imbi Traat & Kaur Lumiste, 2018. "Interaction Between Data Collection And Estimation Phases In Surveys With Nonresponse," Statistics in Transition New Series, Polish Statistical Association, vol. 19(2), pages 183-200, June.
    16. David Card & Jesse Rothstein & Moises Yi, 2021. "Location, Location, Location," Working Papers 21-32, Center for Economic Studies, U.S. Census Bureau.
    17. Manova, Kalina & Yu, Zhihong, 2017. "Multi-product firms and product quality," Journal of International Economics, Elsevier, vol. 109(C), pages 116-137.
    18. Gerard Hoberg & S. Katie Moon, 2019. "The Offshoring Return Premium," Management Science, INFORMS, vol. 67(6), pages 2876-2899, June.
    19. John Haltiwanger & Henry Hyatt & Erika McEntarfer, 2015. "Cyclical Reallocation of Workers Across Employers by Firm Size and Firm Wage," NBER Working Papers 21235, National Bureau of Economic Research, Inc.
    20. Andrew B. Bernard & Stephen J. Redding & Peter K. Schott, 2011. "Multiproduct Firms and Trade Liberalization," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 126(3), pages 1271-1318.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bla:jorssa:v:181:y:2018:i:4:p:1211-1230. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: https://edirc.repec.org/data/rssssea.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.