IDEAS home Printed from https://ideas.repec.org/a/spr/jcsosc/v7y2024i1d10.1007_s42001-023-00243-6.html
   My bibliography  Save this article

Using word embeddings for immigrant and refugee stereotype quantification in a diachronic and multilingual setting

Author

Listed:
  • Danielly Sorato

    (Universitat Pompeu Fabra)

  • Martin Lundsteen

    (Universitat de Barcelona)

  • Carme Colominas Ventura

    (Universitat Pompeu Fabra)

  • Diana Zavala-Rojas

    (Universitat Pompeu Fabra
    European Social Survey ERIC)

Abstract

Word embeddings are efficient machine-learning-based representations of human language used in many Natural Language Processing tasks nowadays. Due to their ability to learn underlying word association patterns present in large volumes of data, it is possible to observe various sociolinguistic phenomena in the embedding semantic space, such as social stereotypes. The use of stereotypical framing in discourse can be detrimental and induce misconceptions about certain groups, such as immigrants and refugees, especially when used by media and politicians in public discourse. In this paper, we use word embeddings to investigate immigrant and refugee stereotypes in a multilingual and diachronic setting. We analyze the Danish, Dutch, English, and Spanish portions of four different multilingual corpora of political discourse, covering the 1997–2018 period. Then, we measure the effect of sociopolitical variables such as the number of offences committed and the size of the refugee and immigrant groups in the host country over our measurements of stereotypical association using the Bayesian multilevel framework. Our results indicate the presence of stereotypical associations towards both immigrants and refugees for all 4 languages, and that the immigrants are overall more strongly associated with the stereotypical frames than refugees.

Suggested Citation

  • Danielly Sorato & Martin Lundsteen & Carme Colominas Ventura & Diana Zavala-Rojas, 2024. "Using word embeddings for immigrant and refugee stereotype quantification in a diachronic and multilingual setting," Journal of Computational Social Science, Springer, vol. 7(1), pages 469-521, April.
  • Handle: RePEc:spr:jcsosc:v:7:y:2024:i:1:d:10.1007_s42001-023-00243-6
    DOI: 10.1007/s42001-023-00243-6
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s42001-023-00243-6
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s42001-023-00243-6?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:jcsosc:v:7:y:2024:i:1:d:10.1007_s42001-023-00243-6. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.