IDEAS home Printed from https://ideas.repec.org/p/cen/wpaper/24-27.html
   My bibliography  Save this paper

Gradient Boosting to Address Statistical Problems Arising from Non-Linkage of Census Bureau Datasets

Author

Listed:
  • Matthew Cefalu
  • John Sullivan
  • Narayan Sastry
  • Elizabeth Fussell
  • Todd Gardner

Abstract

This article introduces the twangRDC package, which contains functions to address non-linkage in US Census Bureau datasets. The Census Bureau’s Person Identification Validation System facilitates data linkage by assigning unique person identifiers to federal, third party, decennial census, and survey data. Not all records in these datasets can be linked to the reference file and as such not all records will be assigned an identifier. This article is a tutorial for using the twangRDC to generate nonresponse weights to account for non-linkage of person records across US Census Bureau datasets.

Suggested Citation

  • Matthew Cefalu & John Sullivan & Narayan Sastry & Elizabeth Fussell & Todd Gardner, 2024. "Gradient Boosting to Address Statistical Problems Arising from Non-Linkage of Census Bureau Datasets," Working Papers 24-27, Center for Economic Studies, U.S. Census Bureau.
  • Handle: RePEc:cen:wpaper:24-27
    as

    Download full text from publisher

    File URL: https://www2.census.gov/library/working-papers/2024/adrm/ces/CES-WP-24-27.pdf
    File Function: First version, 2024
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Deborah Wagner & Mary Lane, 2014. "The Person Identification Validation System (PVS): Applying the Center for Administrative Records Research and Applications’ (CARRA) Record Linkage Software," CARRA Working Papers 2014-01, Center for Economic Studies, U.S. Census Bureau.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Jennifer R. Withrow & Kendall A. Houghton & Eva Lyubich & Mary Munro & Suvy Qin & John L. Voorheis, 2024. "The Census Historical Environmental Impacts Frame," Working Papers 24-66, Center for Economic Studies, U.S. Census Bureau.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. John Carter Braxton & Kyle F. Herkenhoff & Jonathan Rothbaum & Lawrence Schmidt, 2021. "Changing Income Risk across the US Skill Distribution: Evidence from a Generalized Kalman Filter," Opportunity and Inclusive Growth Institute Working Papers 55, Federal Reserve Bank of Minneapolis.
    2. Illenin Kondo & Kevin Rinz & Natalie Gubbay & Brandon Hawkins & John Voorheis & Abigail Wozniak, 2024. "Granular Income Inequality and Mobility Using IDDA: Exploring Patterns across Race and Ethnicity," NBER Chapters, in: Race, Ethnicity, and Economic Statistics for the 21st Century, National Bureau of Economic Research, Inc.
    3. Ufuk Akcigit & Nathan Goldschlag, 2022. "Measuring the Characteristics and Employment Dynamics of U.S. Inventors," Working Papers 22-43, Center for Economic Studies, U.S. Census Bureau.
    4. Nicholas Jones & Eric Jensen & Karen Battle & Rachel Marks, 2024. "Measuring the Racial and Ethnic Composition and Diversity of the United States Population: Historical Challenges and Contemporary Opportunities," NBER Chapters, in: Race, Ethnicity, and Economic Statistics for the 21st Century, National Bureau of Economic Research, Inc.
    5. Robert Collinson & John Eric Humphries & Nicholas Mader & Davin Reed & Daniel Tannenbaum & Winnie van Dijk, 2024. "Eviction and Poverty in American Cities," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 139(1), pages 57-120.
    6. Kevin Rinz, 2022. "Did Timing Matter? Life Cycle Differences in Effects of Exposure to the Great Recession," Journal of Labor Economics, University of Chicago Press, vol. 40(3), pages 703-735.
    7. Richard Blundell & Christopher R. Bollinger & Charles Hokayem & James P. Ziliak, 2024. "Interpreting Cohort Profiles of Lifecycle Earnings Volatility," Working Papers 24-21, Center for Economic Studies, U.S. Census Bureau.
    8. Kevin L. McKinney & John M. Abowd, 2024. "Estimating the Potential Impact of Combined Race and Ethnicity Reporting on Long-Term Earnings Statistics," NBER Chapters, in: Race, Ethnicity, and Economic Statistics for the 21st Century, National Bureau of Economic Research, Inc.
    9. Bruce D. Meyer & Derek Wu & Victoria R. Mooers & Carla Medalia, 2019. "The Use and Misuse of Income Data and Extreme Poverty in the United States," NBER Working Papers 25907, National Bureau of Economic Research, Inc.
    10. Jonathan M. Colmer & John L. Voorheis, 2024. "Microdata and the Valuation of Natural Capital," NBER Chapters, in: Measuring and Accounting for Environmental Public Goods: A National Accounts Perspective, National Bureau of Economic Research, Inc.
    11. Mary Layne & Deborah Wagner & Cynthia Rothhaas, 2014. "Estimating Record Linkage False Match Rate for the Person Identification Validation System," CARRA Working Papers 2014-02, Center for Economic Studies, U.S. Census Bureau.
    12. Jonathan Colmer & John Voorheis, 2020. "The Grandkids Aren't Alright: The Intergenerational Effects of Prenatal Pollution Exposure," Working Papers 20-36, Center for Economic Studies, U.S. Census Bureau.
    13. Javier Miranda & Nikolas Zolas, 2017. "Measuring the Impact of Household Innovation Using Administrative Data," NBER Chapters, in: Measuring and Accounting for Innovation in the Twenty-First Century, pages 61-102, National Bureau of Economic Research, Inc.
    14. Meyer, Bruce D. & Mittag, Nikolas, 2019. "An Empirical Total Survey Error Decomposition Using Data Combination," IZA Discussion Papers 12151, Institute of Labor Economics (IZA).
    15. Meyer, Bruce D. & Mittag, Nikolas, 2021. "An empirical total survey error decomposition using data combination," Journal of Econometrics, Elsevier, vol. 224(2), pages 286-305.
    16. Ariel J. Binder & Amanda Eng & Kendall Houghton & Andrew Foote, 2023. "The Gender Pay Gap and Its Determinants Across the Human Capital Distribution," Working Papers 23-31, Center for Economic Studies, U.S. Census Bureau.
    17. Mark A. Leach & Jennifer Van Hook & James D. Bachmeier, 2018. "Using Linked Data to Investigate True Intergenerational Change: Three Generations Over Seven Decades," CARRA Working Papers 2018-09, Center for Economic Studies, U.S. Census Bureau.
    18. Hellerstein, Judith K. & Kutzbach, Mark J. & Neumark, David, 2019. "Labor market networks and recovery from mass layoffs: Evidence from the Great Recession period," Journal of Urban Economics, Elsevier, vol. 113(C).
    19. Michael Mueller-Smith & Benjamin Pyle & Caroline Walker, 2023. "Estimating the Impact of the Age of Criminal Majority: Decomposing Multiple Treatments in a Regression Discontinuity Framework," Working Papers 23-01, Center for Economic Studies, U.S. Census Bureau.
    20. Joshua D. Gottlieb & Maria Polyakova & Kevin Rinz & Hugh Shiplett & Victoria Udalova, 2020. "Who Values Human Capitalists' Human Capital? Healthcare Spending and Physician Earnings," Working Papers 20-23, Center for Economic Studies, U.S. Census Bureau.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:cen:wpaper:24-27. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Dawn Anderson (email available below). General contact details of provider: https://edirc.repec.org/data/cesgvus.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.