IDEAS home Printed from https://ideas.repec.org/p/nbr/nberwo/30135.html
   My bibliography  Save this paper

Historical Newspaper Data: A Researcher’s Guide and Toolkit

Author

Listed:
  • Brian Beach
  • W. Walker Hanlon

Abstract

Digitized historical newspaper databases offer a valuable research tool. A rapidly expanding set of studies use these databases to address a wide range of topics. We review this literature and provide a toolkit for researchers interested in working with historical newspaper data. We provide a brief description of the evolution of historical newspapers, focusing on aspects that are likely to have implications for the design of empirical studies. We then review the main databases in use. We also discuss some key challenges in using these data, most importantly the fact that even the most extensive datasets contain only a selected sample of the universe of historical newspaper articles. We offer tools for evaluating the comprehensiveness of available newspaper datasets, show how to assess potential identification concerns, and suggest some solutions.

Suggested Citation

  • Brian Beach & W. Walker Hanlon, 2022. "Historical Newspaper Data: A Researcher’s Guide and Toolkit," NBER Working Papers 30135, National Bureau of Economic Research, Inc.
  • Handle: RePEc:nbr:nberwo:30135
    Note: DAE
    as

    Download full text from publisher

    File URL: http://www.nber.org/papers/w30135.pdf
    Download Restriction: no
    ---><---

    Other versions of this item:

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Melissa Dell & Jacob Carlson & Tom Bryan & Emily Silcock & Abhishek Arora & Zejiang Shen & Luca D'Amico-Wong & Quan Le & Pablo Querubin & Leander Heldring, 2023. "American Stories: A Large-Scale Structured Text Dataset of Historical U.S. Newspapers," Papers 2308.12477, arXiv.org.
    2. Alex Armand & Paul Atwell & Joseph F. Gomes & Yannik Schenk, 2023. "It’s a Bird, it’s a Plane, it’s Superman! Using Mass Media to fight Intolerance," LIDAM Discussion Papers IRES 2023012, Université catholique de Louvain, Institut de Recherches Economiques et Sociales (IRES).
    3. Jacob Carlson & Tom Bryan & Melissa Dell, 2023. "Efficient OCR for Building a Diverse Digital History," Papers 2304.02737, arXiv.org, revised Jul 2024.
    4. Melissa Dell, 2024. "Deep Learning for Economists," Papers 2407.15339, arXiv.org, revised Nov 2024.
    5. Quentin Lippmann & Khushboo Surana, 2022. "The Hierarchy of Partner Preferences," Discussion Papers 22/08, Department of Economics, University of York.
    6. Daniel J. Smith & Macy Scheck, 2023. "Examining the public interest rationale for regulating whiskey with the pure food and drugs act," Public Choice, Springer, vol. 196(1), pages 85-122, July.

    More about this item

    JEL classification:

    • N0 - Economic History - - General

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nbr:nberwo:30135. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: the person in charge (email available below). General contact details of provider: https://edirc.repec.org/data/nberrus.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.