IDEAS home Printed from https://ideas.repec.org/a/taf/vhimxx/v53y2020i4p226-243.html
   My bibliography  Save this article

Digital begriffsgeschichte: Tracing semantic change using word embeddings

Author

Listed:
  • Melvin Wevers
  • Marijn Koolen

Abstract

Recently, the use of word embedding models (WEM) has received ample attention in the natural language processing community. These models can capture semantic information in large corpora of text by learning distributional properties of words, that is how often particular words appear in specific contexts. Scholars have pointed out the potential of WEMs for historical research. In particular, their ability to capture semantic change might assist historians studying conceptual change or specific discursive formations over time. Concurrently, others voiced their criticism and pointed out that WEMs require large amounts of training data, that they are challenging to evaluate, and they lack the specificity looked for by historians. The ability to examine semantic change resonates with the goals of historians such as Reinhart Koselleck, whose research focused on the formation of concepts and the transformation of semantic fields. However, word embeddings can only be used to study particular types of semantic change, and the model’s use is dependent on the size, quality, and bias in training data. In this article, we examine what is required of historical data to produce reliable WEMs, and we describe the types of questions that can be answered using WEMs.

Suggested Citation

  • Melvin Wevers & Marijn Koolen, 2020. "Digital begriffsgeschichte: Tracing semantic change using word embeddings," Historical Methods: A Journal of Quantitative and Interdisciplinary History, Taylor & Francis Journals, vol. 53(4), pages 226-243, October.
  • Handle: RePEc:taf:vhimxx:v:53:y:2020:i:4:p:226-243
    DOI: 10.1080/01615440.2020.1760157
    as

    Download full text from publisher

    File URL: http://hdl.handle.net/10.1080/01615440.2020.1760157
    Download Restriction: Access to full text is restricted to subscribers.

    File URL: https://libkey.io/10.1080/01615440.2020.1760157?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Sarah Oberbichler & Emanuela Boroş & Antoine Doucet & Jani Marjanen & Eva Pfanzelter & Juha Rautiainen & Hannu Toivonen & Mikko Tolonen, 2022. "Integrated interdisciplinary workflows for research on historical newspapers: Perspectives from humanities scholars, computer scientists, and librarians," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 73(2), pages 225-239, February.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:taf:vhimxx:v:53:y:2020:i:4:p:226-243. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Chris Longhurst (email available below). General contact details of provider: http://www.tandfonline.com/vhim20 .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.