IDEAS home Printed from https://ideas.repec.org/p/cge/wacage/568.html
   My bibliography  Save this paper

The Problem of False Positives in Automated Census Linking: Evidence from Nineteenth-Century New York's Irish Immigrants

Author

Listed:
  • Anbinder, Tyler

    (George Washington University)

  • Connor, Dylan

    (Arizona State University)

  • O Grada, Cormac

    (University College, Dublin)

  • Wegge, Simone

    (College of Staten Island and The Graduate Center—CUNY)

Abstract

Automated census linkage algorithms have become popular for generating longitudinal data on social mobility, especially for immigrants and their children. But what if these algorithms are particularly bad at tracking immigrants? Using nineteenth-century Irish immigrants as a test case, we examine the most popular of these algorithms—that created by Abramitzky, Boustan, Eriksson (ABE), and their collaborators. Our findings raise serious questions about the quality of automated census links. False positives range from about one-third to one-half of all links depending on the ABE variant used. These bad links lead to sizeable estimation errors when measuring Irish immigrant social mobility.

Suggested Citation

  • Anbinder, Tyler & Connor, Dylan & O Grada, Cormac & Wegge, Simone, 2021. "The Problem of False Positives in Automated Census Linking: Evidence from Nineteenth-Century New York's Irish Immigrants," CAGE Online Working Paper Series 568, Competitive Advantage in the Global Economy (CAGE).
  • Handle: RePEc:cge:wacage:568
    as

    Download full text from publisher

    File URL: https://warwick.ac.uk/fac/soc/economics/research/centres/cage/manage/publications/wp568.2021.pdf
    Download Restriction: no
    ---><---

    Other versions of this item:

    References listed on IDEAS

    as
    1. Catherine G. Massey, 2017. "Playing with matches: An assessment of accuracy in linked historical data," Historical Methods: A Journal of Quantitative and Interdisciplinary History, Taylor & Francis Journals, vol. 50(3), pages 129-143, July.
    2. Kosack, Edward & Ward, Zachary, 2020. "El Sueño Americano? The Generational Progress of Mexican Americans Prior to World War II," The Journal of Economic History, Cambridge University Press, vol. 80(4), pages 961-995, December.
    3. Herscovici, Steven, 1997. "Progress Amid Poverty: Economic Opportunity in Antebellum Newburyport," The Journal of Economic History, Cambridge University Press, vol. 57(2), pages 484-488, June.
    4. Ran Abramitzky & Leah Platt Boustan & Katherine Eriksson, 2012. "Europe's Tired, Poor, Huddled Masses: Self-Selection and Economic Outcomes in the Age of Mass Migration," American Economic Review, American Economic Association, vol. 102(5), pages 1832-1856, August.
    5. Simone A. Wegge & Tyler Anbinder & Cormac Ó Gráda, 2017. "Immigrants and savers: A rich new database on the Irish in 1850s New York," Historical Methods: A Journal of Quantitative and Interdisciplinary History, Taylor & Francis Journals, vol. 50(3), pages 144-155, July.
    6. A'Hearn, Brian & Baten, Jörg & Crayen, Dorothee, 2009. "Quantifying Quantitative Literacy: Age Heaping and the History of Human Capital," The Journal of Economic History, Cambridge University Press, vol. 69(3), pages 783-808, September.
    7. Philipp Ager & Leah Boustan & Katherine Eriksson, 2021. "The Intergenerational Effects of a Large Wealth Shock: White Southerners after the Civil War," American Economic Review, American Economic Association, vol. 111(11), pages 3767-3794, November.
    8. Zachary Ward, 2023. "Intergenerational Mobility in American History: Accounting for Race and Measurement Error," American Economic Review, American Economic Association, vol. 113(12), pages 3213-3248, December.
    9. Matthias Blum & Christopher L. Colvin & Laura McAtackney & Eoin McLaughlin, 2017. "Women of an uncertain age: quantifying human capital accumulation in rural Ireland in the nineteenth century," Economic History Review, Economic History Society, vol. 70(1), pages 187-223, February.
    10. Martin Dribe & J. David Hacker & Francesco Scalone, 2014. "The impact of socio-economic status on net fertility during the historical fertility decline: A comparative analysis of Canada, Iceland, Sweden, Norway, and the USA," Population Studies, Taylor & Francis Journals, vol. 68(2), pages 135-149, July.
    11. Chris Vickers & Nicolas L. Ziebarth, 2016. "Economic Development and the Demographics of Criminals in Victorian England," Journal of Law and Economics, University of Chicago Press, vol. 59(1), pages 191-223.
    12. Cirenza, Peter, 2015. "Geography and assimilation: a case study of Irish immigrants in late nineteenth century America," Economic History Working Papers 60964, London School of Economics and Political Science, Department of Economic History.
    13. Mokyr, Joel & Grada, Cormac O, 1982. "Emigration and poverty in prefamine Ireland," Explorations in Economic History, Elsevier, vol. 19(4), pages 360-384, October.
    14. Ran Abramitzky & Roy Mill & Santiago Pérez, 2020. "Linking individuals across historical sources: A fully automated approach," Historical Methods: A Journal of Quantitative and Interdisciplinary History, Taylor & Francis Journals, vol. 53(2), pages 94-111, April.
    15. Alter, George & Goldin, Claudia & Rotella, Elyce, 1994. "The Savings of Ordinary Americans: The Philadelphia Saving Fund Society in the Mid-Nineteenth Century," The Journal of Economic History, Cambridge University Press, vol. 54(4), pages 735-767, December.
    16. Martha J. Bailey & Connor Cole & Morgan Henderson & Catherine Massey, 2020. "How Well Do Automated Linking Methods Perform? Lessons from US Historical Data," Journal of Economic Literature, American Economic Association, vol. 58(4), pages 997-1044, December.
    17. Price, Joseph & Buckles, Kasey & Van Leeuwen, Jacob & Riley, Isaac, 2021. "Combining family history and machine learning to link historical records: The Census Tree data set," Explorations in Economic History, Elsevier, vol. 80(C).
    18. Herscovici, Steven, 1998. "Migration and Economic Mobility: Wealth Accumulation and Occupational Change Among Antebellum Migrants and Persisters," The Journal of Economic History, Cambridge University Press, vol. 58(4), pages 927-956, December.
    19. Cormac Ó Gráda, 2005. "The New York Irish in the 1850s : locked in by poverty?," Open Access publications 10197/489, School of Economics, University College Dublin.
    20. Eugene N. White & Cormac Ó Gráda, 2003. "The panics of 1854 and 1857 : a view from the Emigration Industrial Savings Bank," Open Access publications 10197/438, School of Economics, University College Dublin.
    21. Marco Breschi & Massimo Esposito & Stanislao Mazzoni & Lucia Pozzi, 2014. "Fertility transition and social stratification in the town of Alghero, Sardinia (1866-1935)," Demographic Research, Max Planck Institute for Demographic Research, Rostock, Germany, vol. 30(28), pages 823-852.
    22. Ó Gráda, Cormac & White, Eugene N., 2003. "The Panics of 1854 and 1857: A View from the Emigrant Industrial Savings Bank," The Journal of Economic History, Cambridge University Press, vol. 63(1), pages 213-240, March.
    23. Dorothee Crayen & Joerg Baten, 2010. "New evidence and new methods to measure human capital inequality before and during the industrial revolution: France and the US in the seventeenth to nineteenth centuries," Economic History Review, Economic History Society, vol. 63(2), pages 452-478, May.
    24. Jason Long & Joseph Ferrie, 2018. "Grandfathers Matter(ed): Occupational Mobility Across Three Generations in the US and Britain, 1850–1911," Economic Journal, Royal Economic Society, vol. 128(612), pages 422-445, July.
    25. Cormac O Grada & Morgan Kelly, 2000. "Market Contagion: Evidence from the Panics of 1854 and 1857," American Economic Review, American Economic Association, vol. 90(5), pages 1110-1124, December.
    26. Connor, Dylan Shane, 2019. "The Cream of the Crop? Geography, Networks, and Irish Migrant Selection in the Age of Mass Migration," The Journal of Economic History, Cambridge University Press, vol. 79(1), pages 139-175, March.
    27. Aaronson, Daniel & Davis, Jonathan & Schulze, Karl, 2020. "Internal immigrant mobility in the early 20th century: evidence from Galveston, Texas," Explorations in Economic History, Elsevier, vol. 76(C).
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Zimran, Ariell, 2022. "US immigrants’ secondary migration and geographic assimilation during the Age of Mass Migration," Explorations in Economic History, Elsevier, vol. 85(C).
    2. Dora Costa & CoraLee Lewis & Noelle Yetter, 2022. "Children and Grandchildren of Union Army Veterans: New Data Collections to Study the Persistence of Longevity and Socioeconomic Status Across Generations," NBER Working Papers 30747, National Bureau of Economic Research, Inc.
    3. Zhu, Ziming, 2022. "Like father like son? Intergenerational immobility in England, 1851-1911," Economic History Working Papers 117588, London School of Economics and Political Science, Department of Economic History.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Collins, William J. & Zimran, Ariell, 2019. "The economic assimilation of Irish Famine migrants to the United States," Explorations in Economic History, Elsevier, vol. 74(C).
    2. Hwang, Sam Il Myoung & Squires, Munir, 2024. "Linked samples and measurement error in historical US census data," Explorations in Economic History, Elsevier, vol. 93(C).
    3. Dahl, Christian M. & Johansen, Torben S.D. & Sørensen, Emil N. & Wittrock, Simon, 2023. "HANA: A handwritten name database for offline handwritten text recognition," Explorations in Economic History, Elsevier, vol. 87(C).
    4. Zachary Ward, 2023. "Intergenerational Mobility in American History: Accounting for Race and Measurement Error," American Economic Review, American Economic Association, vol. 113(12), pages 3213-3248, December.
    5. Simone A. Wegge & Tyler Anbinder & Cormac Ó Gráda, 2017. "Immigrants and savers: A rich new database on the Irish in 1850s New York," Historical Methods: A Journal of Quantitative and Interdisciplinary History, Taylor & Francis Journals, vol. 50(3), pages 144-155, July.
    6. Lehmann-Hasemeyer, Sibylle & Neumayer, Andreas & Streb, Jochen, 2023. "Heterogeneous inflation and deflation experiences and savings decisions during German industrialization," Journal of Banking & Finance, Elsevier, vol. 154(C).
    7. Ran Abramitzky & Roy Mill & Santiago Pérez, 2020. "Linking individuals across historical sources: A fully automated approach," Historical Methods: A Journal of Quantitative and Interdisciplinary History, Taylor & Francis Journals, vol. 53(2), pages 94-111, April.
    8. Mark Egan & Ali Hortaçsu & Gregor Matvos, 2017. "Deposit Competition and Financial Fragility: Evidence from the US Banking Sector," American Economic Review, American Economic Association, vol. 107(1), pages 169-216, January.
    9. Kiss, Hubert J. & Rodriguez-Lara, Ismael & Rosa-Garcia, Alfonso, 2014. "Do women panic more than men? An experimental study of financial decisions," Journal of Behavioral and Experimental Economics (formerly The Journal of Socio-Economics), Elsevier, vol. 52(C), pages 40-51.
    10. Bennett, Robert J. & Montebruno, Piero & Van Lieshout, Carry & Smith, Harry, 2022. "Business entry and exit: career changes of proprietors in England and Wales (1851-81) using record-linkage," LSE Research Online Documents on Economics 113867, London School of Economics and Political Science, LSE Library.
    11. Lehmann-Hasemeyer, Sibylle H. & Neumayer, Andreas & Streb, Jochen, 2022. "Heterogeneous savers and their inflation expectation during German industrialization: Social class, wealth, and gender," Working Papers 33, German Research Foundation's Priority Programme 1859 "Experience and Expectation. Historical Foundations of Economic Behaviour", Humboldt University Berlin.
    12. Elisa Jácome & Ilyana Kuziemko & Suresh Naidu, 2021. "Mobility for All: Representative Intergenerational Mobility Estimates over the 20th Century," Working Papers 302, Princeton University, Department of Economics, Center for Economic Policy Studies..
    13. Hubert J. Kiss & Ismael Rodriguez-Lara & Alfonso Rosa-Garcia, 2022. "Experimental bank runs," Chapters, in: Sascha Füllbrunn & Ernan Haruvy (ed.), Handbook of Experimental Finance, chapter 25, pages 347-361, Edward Elgar Publishing.
      • Hubert J. Kiss & Ismael Rodriguez-Lara & Alfonso Rosa-Garcia, 2021. "Experimental Bank Runs," ThE Papers 21/03, Department of Economic Theory and Economic History of the University of Granada..
    14. Beltrán Tapia, Francisco J. & Díez-Minguela, Alfonso & Martinez-Galarraga, Julio & Tirado-Fabregat, Daniel A., 2022. "Two Stories, One Fate: Age-Heaping And Literacy In Spain, 1877-1930," Revista de Historia Económica / Journal of Iberian and Latin American Economic History, Cambridge University Press, vol. 40(3), pages 405-438, December.
    15. Howard Bodenhorn, 2017. "Finance and Growth: Household Savings, Public Investment, and Public Health in Late Nineteenth-Century New Jersey," NBER Working Papers 23430, National Bureau of Economic Research, Inc.
    16. Hubert Janos Kiss & Ismael Rodriguez-Lara & Alfonso Rosa-Garcia, 2018. "Who runs first to the bank?," CERS-IE WORKING PAPERS 1826, Institute of Economics, Centre for Economic and Regional Studies.
    17. Martin Dribe & Björn Eriksson & Jonas Helgertz, 2023. "From Sweden to America: migrant selection in the transatlantic migration, 1890–1910," European Review of Economic History, European Historical Economics Society, vol. 27(1), pages 24-44.
    18. Julián Costas-Fernández & José-Alberto Guerra & Myra Mohnen, 2020. "Train to Opportunity: the Effect of Infrastructure on Intergenerational Mobility," Documentos CEDE 18591, Universidad de los Andes, Facultad de Economía, CEDE.
    19. Florian Schaffner, 2015. "Predicting US bank failures with internet search volume data," ECON - Working Papers 214, Department of Economics - University of Zurich.
    20. Escamilla-Guerrero, David & Kosack, Edward & Ward, Zachary, 2021. "Life after crossing the border: Assimilation during the first Mexican mass migration," Explorations in Economic History, Elsevier, vol. 82(C).

    More about this item

    JEL classification:

    • N21 - Economic History - - Financial Markets and Institutions - - - U.S.; Canada: Pre-1913
    • J61 - Labor and Demographic Economics - - Mobility, Unemployment, Vacancies, and Immigrant Workers - - - Geographic Labor Mobility; Immigrant Workers
    • R23 - Urban, Rural, Regional, Real Estate, and Transportation Economics - - Household Analysis - - - Regional Migration; Regional Labor Markets; Population

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:cge:wacage:568. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Jane Snape (email available below). General contact details of provider: https://edirc.repec.org/data/dewaruk.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.