IDEAS home Printed from https://ideas.repec.org/a/eee/exehis/v93y2024ics0014498324000093.html
   My bibliography  Save this article

Linked samples and measurement error in historical US census data

Author

Listed:
  • Hwang, Sam Il Myoung
  • Squires, Munir

Abstract

The quality of historical US census data is critical to the performance of linking algorithms. We use genealogical profiles to correct measurement error in census names and ages. Our findings suggest that one in every two records has an error in name or age, and human capital is correlated with lower error rates. While errors in age decline across subsequent census rounds from 1850 to 1930, errors in names do not exhibit such trends. Fixing all transcription errors, hence leaving only those errors made at the time of enumeration, would reduce error rates in names by 41 percent. Correcting all names and ages using genealogical profiles leads to 20%–36% more links and fewer false positives. Reassuringly, we find that reducing such errors has a negligible effect on estimates of intergenerational mobility.

Suggested Citation

  • Hwang, Sam Il Myoung & Squires, Munir, 2024. "Linked samples and measurement error in historical US census data," Explorations in Economic History, Elsevier, vol. 93(C).
  • Handle: RePEc:eee:exehis:v:93:y:2024:i:c:s0014498324000093
    DOI: 10.1016/j.eeh.2024.101579
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0014498324000093
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.eeh.2024.101579?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Joseph Ferrie & Catherine Massey & Jonathan Rothbaum, 2021. "Do Grandparents Matter? Multigenerational Mobility in the United States, 1940–2015," Journal of Labor Economics, University of Chicago Press, vol. 39(3), pages 597-637.
    2. Joseph P. Ferrie, 2005. "History Lessons: The End of American Exceptionalism? Mobility in the United States Since 1850," Journal of Economic Perspectives, American Economic Association, vol. 19(3), pages 199-215, Summer.
    3. A'Hearn, Brian & Baten, Jörg & Crayen, Dorothee, 2009. "Quantifying Quantitative Literacy: Age Heaping and the History of Human Capital," The Journal of Economic History, Cambridge University Press, vol. 69(3), pages 783-808, September.
    4. Zachary Ward, 2023. "Intergenerational Mobility in American History: Accounting for Race and Measurement Error," American Economic Review, American Economic Association, vol. 113(12), pages 3213-3248, December.
    5. Mauricio Sadinle & Stephen E. Fienberg, 2013. "A Generalized Fellegi--Sunter Framework for Multiple Record Linkage With Application to Homicide Record Systems," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 108(502), pages 385-397, June.
    6. Jason Long & Joseph Ferrie, 2013. "Intergenerational Occupational Mobility in Great Britain and the United States since 1850," American Economic Review, American Economic Association, vol. 103(4), pages 1109-1137, June.
    7. Alexandre Poirier & Nicolas L. Ziebarth, 2019. "Estimation of Models With Multiple-Valued Explanatory Variables," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 37(4), pages 586-597, October.
    8. Costanza Biavaschi & Corrado Giulietti & Zahra Siddique, 2017. "The Economic Payoff of Name Americanization," Journal of Labor Economics, University of Chicago Press, vol. 35(4), pages 1089-1116.
    9. Martha J. Bailey & Connor Cole & Morgan Henderson & Catherine Massey, 2020. "How Well Do Automated Linking Methods Perform? Lessons from US Historical Data," Journal of Economic Literature, American Economic Association, vol. 58(4), pages 997-1044, December.
    10. Price, Joseph & Buckles, Kasey & Van Leeuwen, Jacob & Riley, Isaac, 2021. "Combining family history and machine learning to link historical records: The Census Tree data set," Explorations in Economic History, Elsevier, vol. 80(C).
    11. Claudia Olivetti & M. Daniele Paserman, 2015. "In the Name of the Son (and the Daughter): Intergenerational Mobility in the United States, 1850-1940," American Economic Review, American Economic Association, vol. 105(8), pages 2695-2724, August.
    12. Parman, John, 2011. "American Mobility and the Expansion of Public Education," The Journal of Economic History, Cambridge University Press, vol. 71(1), pages 105-132, March.
    13. Jonas Helgertz & Joseph Price & Jacob Wellington & Kelly J Thompson & Steven Ruggles & Catherine A. Fitch, 2022. "A new strategy for linking U.S. historical censuses: A case study for the IPUMS multigenerational longitudinal panel," Historical Methods: A Journal of Quantitative and Interdisciplinary History, Taylor & Francis Journals, vol. 55(1), pages 12-29, January.
    14. Xi Song & Catherine G. Massey & Karen A. Rolf & Joseph P. Ferrie & Jonathan L. Rothbaum & Yu Xie, 2020. "Long-term decline in intergenerational mobility in the United States since the 1850s," Proceedings of the National Academy of Sciences, Proceedings of the National Academy of Sciences, vol. 117(1), pages 251-258, January.
    15. Ran Abramitzky & Leah Boustan & Katherine Eriksson & James Feigenbaum & Santiago Pérez, 2021. "Automated Linking of Historical Data," Journal of Economic Literature, American Economic Association, vol. 59(3), pages 865-918, September.
    16. Martha Bailey & Connor Cole & Catherine Massey, 2020. "Simple strategies for improving inference with linked data: a case study of the 1850–1930 IPUMS linked representative historical samples," Historical Methods: A Journal of Quantitative and Interdisciplinary History, Taylor & Francis Journals, vol. 53(2), pages 80-93, April.
    17. Zachary Ward, 2022. "Internal Migration, Education, and Intergenerational Mobility: Evidence from American History," Journal of Human Resources, University of Wisconsin Press, vol. 57(6), pages 1981-2011.
    18. James J. Feigenbaum, 2018. "Multiple Measures of Historical Intergenerational Mobility: Iowa 1915 to 1940," Economic Journal, Royal Economic Society, vol. 128(612), pages 446-481, July.
    19. Jason Long & Joseph Ferrie, 2013. "Intergenerational Occupational Mobility in Great Britain and the United States since 1850: Reply," American Economic Review, American Economic Association, vol. 103(5), pages 2041-2049, August.
    20. Fagernäs, Sonja, 2014. "Papers, please! The effect of birth registration on child labor and education in early 20th century USA," Explorations in Economic History, Elsevier, vol. 52(C), pages 63-92.
    21. Zachary Ward, 2020. "The Not-So-Hot Melting Pot: The Persistence of Outcomes for Descendants of the Age of Mass Migration," American Economic Journal: Applied Economics, American Economic Association, vol. 12(4), pages 73-102, October.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Zachary Ward, 2023. "Intergenerational Mobility in American History: Accounting for Race and Measurement Error," American Economic Review, American Economic Association, vol. 113(12), pages 3213-3248, December.
    2. Martha J. Bailey & Peter Z. Lin, 2024. "Marital Matching and Women’s Intergenerational Mobility in the Late 19th and Early 20th Century US," NBER Chapters, in: The Economic History of American Inequality: New Evidence and Perspectives, National Bureau of Economic Research, Inc.
    3. Krzysztof Karbownik & Anthony Wray, 2019. "Educational, Labor-market and Intergenerational Consequences of Poor Childhood Health," NBER Working Papers 26368, National Bureau of Economic Research, Inc.
    4. Juliana Jaramillo-Echeverri, 2024. "Movilidad social en la educación: el caso de la Universidad de los Andes en Colombia entre 1949 y 2018," Cuadernos de Historia Económica 61, Banco de la Republica de Colombia.
    5. Berger, Thor & Engzell, Per & Eriksson, Björn & Molinder, Jakob, 2023. "Social Mobility in Sweden before the Welfare State," The Journal of Economic History, Cambridge University Press, vol. 83(2), pages 431-463, June.
    6. Torsten Santavirta & Jan Stuhler, 2024. "Name-Based Estimators of Intergenerational Mobility," The Economic Journal, Royal Economic Society, vol. 134(663), pages 2982-3016.
    7. Eric S. M. Protzer & Sultan Orazbayev & Andres Gomez-Lievano & Matte Hartog & Frank Neffke, 2024. "A New Algorithm to Efficiently Match U.S. Census Records and Balance Representativity with Match Quality," Growth Lab Working Papers 238, Harvard's Growth Lab.
    8. Combes, Pierre-Philippe & Gobillon, Laurent & Zylberberg, Yanos, 2022. "Urban economics in a historical perspective: Recovering data with machine learning," Regional Science and Urban Economics, Elsevier, vol. 94(C).
    9. Elisa Jácome & Ilyana Kuziemko & Suresh Naidu, 2021. "Mobility for All: Representative Intergenerational Mobility Estimates over the 20th Century," Working Papers 302, Princeton University, Department of Economics, Center for Economic Policy Studies..
    10. Collins, William J. & Zimran, Ariell, 2019. "The economic assimilation of Irish Famine migrants to the United States," Explorations in Economic History, Elsevier, vol. 74(C).
    11. Julián Costas-Fernández & José-Alberto Guerra & Myra Mohnen, 2020. "Train to Opportunity: the Effect of Infrastructure on Intergenerational Mobility," Documentos CEDE 18591, Universidad de los Andes, Facultad de Economía, CEDE.
    12. Bautista, María Angélica & Gonzalez, Felipe & Martinez, Luis R. & Muñoz, Pablo & Prem, Mounu, 2022. "The Intergenerational Transmission of College: Evidence from the 1973 Coup in Chile," SocArXiv eyw2a, Center for Open Science.
    13. James J. Feigenbaum & Hui Ren Tan, 2019. "The Return to Education in the Mid-20th Century: Evidence from Twins," NBER Working Papers 26407, National Bureau of Economic Research, Inc.
    14. Ran Abramitzky & Leah Platt Boustan & Elisa Jácome & Santiago Pérez, 2019. "Intergenerational Mobility of Immigrants over Two Centuries," Working Papers 2019-6, Princeton University. Economics Department..
    15. Ran Abramitzky & Leah Platt Boustan & Elisa Jácome & Santiago Pérez, 2019. "Intergenerational Mobility of Immigrants in the US over Two Centuries," NBER Working Papers 26408, National Bureau of Economic Research, Inc.
    16. Barbara Castillo Rico, 2020. "Trends in intergenerational homeownership mobility in France between 1960-2015," AMSE Working Papers 2008, Aix-Marseille School of Economics, France.
    17. Zachary Ward, 2019. "Internal Migration, Education and Upward Rank Mobility:Evidence from American History," CEH Discussion Papers 04, Centre for Economic History, Research School of Economics, Australian National University.
    18. Ran Abramitzky & Leah Boustan & Katherine Eriksson & James Feigenbaum & Santiago Pérez, 2021. "Automated Linking of Historical Data," Journal of Economic Literature, American Economic Association, vol. 59(3), pages 865-918, September.
    19. Chu, Luke Yu-Wei & Lin, Ming-Jen, 2016. "Economic development and intergenerational earnings mobility: Evidence from Taiwan," Working Paper Series 19495, Victoria University of Wellington, School of Economics and Finance.
    20. Shiue, Carol, 2019. "Social Mobility in the Long Run: A Temporal Analysis of China from 1300 to 1900," CEPR Discussion Papers 13589, C.E.P.R. Discussion Papers.

    More about this item

    Keywords

    Intergenerational mobility; Measurement error; Data linkage;
    All these keywords.

    JEL classification:

    • J62 - Labor and Demographic Economics - - Mobility, Unemployment, Vacancies, and Immigrant Workers - - - Job, Occupational and Intergenerational Mobility; Promotion
    • C55 - Mathematical and Quantitative Methods - - Econometric Modeling - - - Large Data Sets: Modeling and Analysis
    • N00 - Economic History - - General - - - General

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:exehis:v:93:y:2024:i:c:s0014498324000093. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/inca/622830 .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.