IDEAS home Printed from https://ideas.repec.org/a/dem/demres/v51y2024i41.html
   My bibliography  Save this article

Using online genealogical data for demographic research: An empirical examination of the FamiLinx database

Author

Listed:
  • Andrea Colasurdo

    (Max-Planck-Institut für Demografische Forschung)

  • Riccardo Omenti

    (Università di Bologna (UNIBO))

Abstract

Background: Online genealogies are promising data sources for demographic research, but their limitations are understudied. This paper takes a critical approach to evaluating the potential strengths and weaknesses of using online genealogical data for population studies. We focus on the FamiLinx dataset, which contains demographic information and kinship ties across multiple countries and centuries. Objective: We propose novel measures to assess the completeness and the quality of demographic variables in the FamiLinx data at both the individual and the familial level over the 1600–1900 period. Utilizing Sweden as a test country, we investigate how the age–sex distribution and the mortality levels of the digital population extracted from FamiLinx diverge from the registered population. Methods: We employ descriptive statistics, negative binomial regression modeling, and standard life table techniques for our measures of completeness and quality. Results: Missing values and accuracy in demographic information from FamiLinx are selective. When one demographic variable is available, researchers can effectively anticipate the availability of other demographic information. The completeness and quality of demographic variables within kinship networks are markedly higher for individuals with more complete and accurate demographic information. Populations from FamiLinx display lower mortality levels than the registered population and their representativeness improves towards the end of the 19th century. Contribution: This study sheds new light on the opportunities and challenges of harnessing online genealogies for demographic research. Although this data source offers much promise, its usability in population studies is dependent on the quality and completeness of its recorded demographic information and their selectivity.

Suggested Citation

  • Andrea Colasurdo & Riccardo Omenti, 2024. "Using online genealogical data for demographic research: An empirical examination of the FamiLinx database," Demographic Research, Max Planck Institute for Demographic Research, Rostock, Germany, vol. 51(41), pages 1299-1350.
  • Handle: RePEc:dem:demres:v:51:y:2024:i:41
    DOI: 10.4054/DemRes.2024.51.41
    as

    Download full text from publisher

    File URL: https://www.demographic-research.org/volumes/vol51/41/51-41.pdf
    Download Restriction: no

    File URL: https://libkey.io/10.4054/DemRes.2024.51.41?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Cummins, Neil, 2017. "Lifespans of the European Elite, 800–1800," The Journal of Economic History, Cambridge University Press, vol. 77(2), pages 406-439, June.
    2. repec:cai:poeine:pope_704_0729 is not listed on IDEAS
    3. Chen-Hao Hsu & Oliver Posegga & Kai Fischbach & Henriette Engelhardt, 2021. "Examining the trade-offs between human fertility and longevity over three centuries using crowdsourced genealogy data," PLOS ONE, Public Library of Science, vol. 16(8), pages 1-20, August.
    4. Robert Mare, 2011. "A Multigenerational View of Inequality," Demography, Springer;Population Association of America (PAA), vol. 48(1), pages 1-23, February.
    5. Marco Cozzani & Saverio Minardi & Giulia Corti & Nicola Barban, 2023. "Birth month and adult lifespan: A within-family, cohort, and spatial examination using FamiLinx data in the United States (1700–1899)," Demographic Research, Max Planck Institute for Demographic Research, Rostock, Germany, vol. 49(9), pages 201-218.
    6. Nina Cesare & Hedwig Lee & Tyler McCormick & Emma Spiro & Emilio Zagheni, 2018. "Promises and Pitfalls of Using Digital Traces for Demographic Research," Demography, Springer;Population Association of America (PAA), vol. 55(5), pages 1979-1999, October.
    7. Camarda, Carlo G., 2012. "MortalitySmooth: An R Package for Smoothing Poisson Counts with P-Splines," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 50(i01).
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Colasurdo, Andrea & Omenti, Riccardo, 2024. "Using Online Genealogical Data for Demographic Research: An Empirical Examination of the FamiLinx Database," SocArXiv 62yxm, Center for Open Science.
    2. Robert Stelter & David De la Croix & Mikko Myrskylä, 2020. "Leaders and laggards in life expectancy among European scholars from the sixteenth to the early twentieth century," MPIDR Working Papers WP-2020-030, Max Planck Institute for Demographic Research, Rostock, Germany.
    3. Sirois, Catherine, 2020. "The strain of sons' incarceration on mothers’ health," Social Science & Medicine, Elsevier, vol. 264(C).
    4. Michelle Maroto, 2018. "Saving, Sharing, or Spending? The Wealth Consequences of Raising Children," Demography, Springer;Population Association of America (PAA), vol. 55(6), pages 2257-2282, December.
    5. Kieron Barclay & Torkild Lyngstad & Dalton Conley, 2018. "The Production of Inequalities within Families and Across Generations: The Intergenerational Effects of Birth Order and Family Size on Educational Attainment," NBER Working Papers 24530, National Bureau of Economic Research, Inc.
    6. Ahbab Mohammad Fazle Rabbi & Stefano Mazzuco, 2021. "Mortality Forecasting with the Lee–Carter Method: Adjusting for Smoothing and Lifespan Disparity," European Journal of Population, Springer;European Association for Population Studies, vol. 37(1), pages 97-120, March.
    7. John Robert Warren, 2015. "Potential Data Sources for a New Study of Social Mobility in the United States," The ANNALS of the American Academy of Political and Social Science, , vol. 657(1), pages 208-246, January.
    8. van den Berg, Gerard J. & Pinger, Pia R., 2016. "Transgenerational effects of childhood conditions on third generation health and education outcomes," Economics & Human Biology, Elsevier, vol. 23(C), pages 103-120.
    9. Kroeger, Sarah & Thompson, Owen, 2016. "Educational mobility across three generations of American women," Economics of Education Review, Elsevier, vol. 53(C), pages 72-86.
    10. José M. Aburto & Alyson A. van Raalte, 2017. "Lifespan dispersion in times of life expectancy fluctuation: the case of Central and Eastern Europe," MPIDR Working Papers WP-2017-018, Max Planck Institute for Demographic Research, Rostock, Germany.
    11. Michael Hout, 2015. "A Summary of What We Know about Social Mobility," The ANNALS of the American Academy of Political and Social Science, , vol. 657(1), pages 27-36, January.
    12. André Grow & Daniela Perrotta & Emanuele Del Fava & Jorge Cimentada & Francesco Rampazzo & B. Sofia Gil-Clavel & Emilio Zagheni & René D. Flores & Ilana Ventura & Ingmar G. Weber, 2021. "How reliable is Facebook’s advertising data for use in social science research? Insights from a cross-national online survey," MPIDR Working Papers WP-2021-006, Max Planck Institute for Demographic Research, Rostock, Germany.
    13. Juan Manuel Pérez-Salamero González & Marta Regúlez-Castillo & Carlos Vidal-Meliá, 2021. "Differences in Life Expectancy Between Self-Employed Workers and Paid Employees when Retirement Pensioners: Evidence from Spanish Social Security Records," European Journal of Population, Springer;European Association for Population Studies, vol. 37(3), pages 697-725, July.
    14. Deirdre Bloome, 2017. "Childhood Family Structure and Intergenerational Income Mobility in the United States," Demography, Springer;Population Association of America (PAA), vol. 54(2), pages 541-569, April.
    15. Alex Luscombe & Kevin Dick & Kevin Walby, 2022. "Algorithmic thinking in the public interest: navigating technical, legal, and ethical hurdles to web scraping in the social sciences," Quality & Quantity: International Journal of Methodology, Springer, vol. 56(3), pages 1023-1044, June.
    16. Marie-Pier Bergeron-Boucher & Vladimir Canudas-Romo & James E. Oeppen & James W. Vaupel, 2017. "Coherent forecasts of mortality with compositional data analysis," Demographic Research, Max Planck Institute for Demographic Research, Rostock, Germany, vol. 37(17), pages 527-566.
    17. Jurić, Tado, 2022. "Forecasting Migration and Integration Trends Using Digital Demography – A Case Study of Emigration Flows from Croatia to Austria and Germany," EconStor Open Access Articles and Book Chapters, ZBW - Leibniz Information Centre for Economics, vol. 70(1), pages 125-152.
    18. Bacolod, Marigee & Blum, Bernardo S. & Rangel, Marcos A. & Strange, William C., 2023. "Learners in cities: Agglomeration and the spatial division of cognition," Regional Science and Urban Economics, Elsevier, vol. 98(C).
    19. Timothy M. Smeeding, 2018. "The PSID in Research and Policy," The ANNALS of the American Academy of Political and Social Science, , vol. 680(1), pages 29-47, November.
    20. Alice Kasakoff & Andrew Lawson & Purbasha Dasgupta & Michael DuBois & Stephen Feetham, 2018. "The effects of family and location on wealth: A longitudinal study of the US North, 1850–1870," Demographic Research, Max Planck Institute for Demographic Research, Rostock, Germany, vol. 38(59), pages 1815-1842.

    More about this item

    Keywords

    genealogies; kinship network; data quality; FamiLinx; completeness; digital data;
    All these keywords.

    JEL classification:

    • J1 - Labor and Demographic Economics - - Demographic Economics
    • Z0 - Other Special Topics - - General

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:dem:demres:v:51:y:2024:i:41. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Editorial Office (email available below). General contact details of provider: https://www.demogr.mpg.de/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.