IDEAS home Printed from https://ideas.repec.org/a/gam/jijerp/v17y2020i18p6937-d417575.html
   My bibliography  Save this article

An Introduction to Probabilistic Record Linkage with a Focus on Linkage Processing for WTC Registries

Author

Listed:
  • Jana Asher

    (Department of Mathematics and Statistics, Slippery Rock University, Slippery Rock, PA 16057, USA)

  • Dean Resnick

    (National Opinion Research Center, Boston, MA 02114, USA)

  • Jennifer Brite

    (Division of Epidemiology, New York City Department of Health and Mental Hygiene, World Trade Center Health Registry, New York, NY 11101, USA)

  • Robert Brackbill

    (Division of Epidemiology, New York City Department of Health and Mental Hygiene, World Trade Center Health Registry, New York, NY 11101, USA)

  • James Cone

    (Division of Epidemiology, New York City Department of Health and Mental Hygiene, World Trade Center Health Registry, New York, NY 11101, USA)

Abstract

Since its post-World War II inception, the science of record linkage has grown exponentially and is used across industrial, governmental, and academic agencies. The academic fields that rely on record linkage are diverse, ranging from history to public health to demography. In this paper, we introduce the different types of data linkage and give a historical context to their development. We then introduce the three types of underlying models for probabilistic record linkage: Fellegi-Sunter-based methods, machine learning methods, and Bayesian methods. Practical considerations, such as data standardization and privacy concerns, are then discussed. Finally, recommendations are given for organizations developing or maintaining record linkage programs, with an emphasis on organizations measuring long-term complications of disasters, such as 9/11.

Suggested Citation

  • Jana Asher & Dean Resnick & Jennifer Brite & Robert Brackbill & James Cone, 2020. "An Introduction to Probabilistic Record Linkage with a Focus on Linkage Processing for WTC Registries," IJERPH, MDPI, vol. 17(18), pages 1-16, September.
  • Handle: RePEc:gam:jijerp:v:17:y:2020:i:18:p:6937-:d:417575
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/1660-4601/17/18/6937/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/1660-4601/17/18/6937/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Rebecca C. Steorts & Rob Hall & Stephen E. Fienberg, 2016. "A Bayesian Approach to Graphical Record Linkage and Deduplication," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 111(516), pages 1660-1672, October.
    2. Lariscy, Joseph, 2011. "Differential Record Linkage by Hispanic Ethnicity and Age in Linked Mortality Studies: Implications for the Epidemiologic Paradox," SocArXiv tw9a4, Center for Open Science.
    3. Alan F Karr & Matthew T Taylor & Suzanne L West & Soko Setoguchi & Tzuyung D Kou & Tobias Gerhard & Daniel B Horton, 2019. "Comparing record linkage software programs and algorithms using real-world data," PLOS ONE, Public Library of Science, vol. 14(9), pages 1-16, September.
    4. Joseph T. Lariscy, 2017. "Black–White Disparities in Adult Mortality: Implications of Differential Record Linkage for Understanding the Mortality Crossover," Population Research and Policy Review, Springer;Southern Demographic Association (SDA), vol. 36(1), pages 137-156, February.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. James E. Cone & Albeliz Santiago-Colón & Roberto Lucchini, 2021. "9/11 Health Update," IJERPH, MDPI, vol. 18(12), pages 1-5, June.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Mara Sheftel & Frank W. Heiland, 2018. "Disability crossover: Is there a Hispanic immigrant health advantage that reverses from working to old age?," Demographic Research, Max Planck Institute for Demographic Research, Rostock, Germany, vol. 39(7), pages 209-250.
    2. Brown, Dustin C, 2019. "Comparability of Mortality Estimates from Social Surveys and Vital Statistics Data in the United States," SocArXiv x9f5y, Center for Open Science.
    3. Dustin C. Brown & Joseph T. Lariscy & Lucie Kalousová, 2019. "Comparability of Mortality Estimates from Social Surveys and Vital Statistics Data in the United States," Population Research and Policy Review, Springer;Southern Demographic Association (SDA), vol. 38(3), pages 371-401, June.
    4. Jennifer Karas Montez & Kaitlyn Barnes, 2016. "The Benefits of Educational Attainment for U.S. Adult Mortality: Are they Contingent on the Broader Environment?," Population Research and Policy Review, Springer;Southern Demographic Association (SDA), vol. 35(1), pages 73-100, February.
    5. Mark Hayward & Robert Hummer & Chi-Tsun Chiu & César González-González & Rebeca Wong, 2014. "Does the Hispanic Paradox in U.S. Adult Mortality Extend to Disability?," Population Research and Policy Review, Springer;Southern Demographic Association (SDA), vol. 33(1), pages 81-96, February.
    6. Dan A. Black & Yu-Chieh Hsu & Seth G. Sanders & Lynne Steuerle Schofield & Lowell J. Taylor, 2017. "The Methuselah Effect: The Pernicious Impact of Unreported Deaths on Old-Age Mortality Estimates," Demography, Springer;Population Association of America (PAA), vol. 54(6), pages 2001-2024, December.
    7. Andrew Fenelon, 2017. "Rethinking the Hispanic Paradox: The Mortality Experience of Mexican Immigrants in Traditional Gateways and New Destinations," International Migration Review, Wiley Blackwell, vol. 51(3), pages 567-599, September.
    8. Vo, Thanh Huan & Chauvet, Guillaume & Happe, André & Oger, Emmanuel & Paquelet, Stéphane & Garès, Valérie, 2023. "Extending the Fellegi-Sunter record linkage model for mixed-type data with application to the French national health data system," Computational Statistics & Data Analysis, Elsevier, vol. 179(C).
    9. Angelo Moretti & Natalie Shlomo, 2023. "Improving Probabilistic Record Linkage Using Statistical Prediction Models," International Statistical Review, International Statistical Institute, vol. 91(3), pages 368-394, December.
    10. Betancourt, Brenda & Sosa, Juan & Rodríguez, Abel, 2022. "A prior for record linkage based on allelic partitions," Computational Statistics & Data Analysis, Elsevier, vol. 172(C).
    11. Daniel H. Weinberg & John M. Abowd & Robert F. Belli & Noel Cressie & David C. Folch & Scott H. Holan & Margaret C. Levenstein & Kristen M. Olson & Jerome P. Reiter & Matthew D. Shapiro & Jolene Smyth, 2017. "Effects of a Government-Academic Partnership: Has the NSF-Census Bureau Research Network Helped Improve the U.S. Statistical System?," Working Papers 17-59r, Center for Economic Studies, U.S. Census Bureau.
    12. Beręsewicz Maciej, 2019. "Correlates of Representation Errors in Internet Data Sources for Real Estate Market," Journal of Official Statistics, Sciendo, vol. 35(3), pages 509-529, September.
    13. Julia Holmes & Anne Driscoll & Melonie Heron, 2015. "Mortality among US-born and immigrant Hispanics in the US: effects of nativity, duration of residence, and age at immigration," International Journal of Public Health, Springer;Swiss School of Public Health (SSPH+), vol. 60(5), pages 609-617, July.
    14. Bera Sabyasachi & Chatterjee Snigdhansu, 2020. "High dimensional, robust, unsupervised record linkage," Statistics in Transition New Series, Statistics Poland, vol. 21(4), pages 123-143, August.
    15. John M. Abowd & Joelle Abramowitz & Margaret C. Levenstein & Kristin McCue & Dhiren Patki & Trivellore Raghunathan & Ann M. Rodgers & Matthew D. Shapiro & Nada Wasi & Dawn Zinsser, 2021. "Finding Needles in Haystacks: Multiple-Imputation Record Linkage Using Machine Learning," Working Papers 21-35, Center for Economic Studies, U.S. Census Bureau.
    16. Joseph T. Lariscy, 2017. "Black–White Disparities in Adult Mortality: Implications of Differential Record Linkage for Understanding the Mortality Crossover," Population Research and Policy Review, Springer;Southern Demographic Association (SDA), vol. 36(1), pages 137-156, February.
    17. Duncan Smith, 2020. "Re‐identification in the Absence of Common Variables for Matching," International Statistical Review, International Statistical Institute, vol. 88(2), pages 354-379, August.
    18. Justin T. Denney & Jarron M. Saint Onge & Jeff A. Dennis, 2018. "Neighborhood Concentrated Disadvantage and Adult Mortality: Insights for Racial and Ethnic Differences," Population Research and Policy Review, Springer;Southern Demographic Association (SDA), vol. 37(2), pages 301-321, April.
    19. Sabyasachi Bera & Snigdhansu Chatterjee, 2020. "High dimensional, robust, unsupervised record linkage," Statistics in Transition New Series, Polish Statistical Association, vol. 21(4), pages 123-143, August.
    20. Donnelly, Rachel & Umberson, Debra & Hummer, Robert A. & Garcia, Michael A., 2020. "Race, death of a child, and mortality risk among aging parents in the United States," Social Science & Medicine, Elsevier, vol. 249(C).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jijerp:v:17:y:2020:i:18:p:6937-:d:417575. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.