IDEAS home Printed from https://ideas.repec.org/a/taf/raagxx/v112y2022i7p2045-2063.html
   My bibliography  Save this article

Sleeping Lion or Sick Man? Machine Learning Approaches to Deciphering Heterogeneous Images of Chinese in North America

Author

Listed:
  • Qiang Fu
  • Yufan Zhuang
  • Yushu Zhu
  • Xin Guo

Abstract

Based on more than 280,000 newspaper articles published in North America, this study proposes an integrative machine learning framework to explore heterogeneous social sentiments over time. After retrieving and preprocessing articles containing the term “Chinese” from six mainstream newspapers, we identified major discussion topics and assigned articles to their corresponding topics via posterior probabilities estimated by using a novel Bayesian nonparametric model, the hierarchical Dirichlet process. We also employed a groundbreaking deep learning technique, bidirectional encoder representations from transformers, to assign a negative or positive sentiment score to each newspaper article, which was trained on binary-labeled movie reviews from the Internet Movie Database (IMDb). By combining state-of-the-art tools for topic modeling and sentiment analysis, we found an overall lack of consensus on whether sentiments in North America since 1978 were pro- or anti-Chinese. Moreover, the images of Chinese are highly topic specific: (1) sentiments across different topics show distinct trajectories over the period of study; (2) discussion topics explain much more of the variation in sentiments than do the publisher, year of publication, or country of publisher; (3) less positive sentiments appear to be more relevant to material concerns than to ethnic considerations, whereas more positive sentiments are associated with an appreciation of culture; and (4) sentiments on the same or similar topic might exhibit different temporal patterns in the United States and Canada. These new findings not only suggest a multifaceted and dynamic view of social sentiments in a transnational context but also call for a paradigm shift in understanding intertwined sociodiscursive interactions over time.

Suggested Citation

  • Qiang Fu & Yufan Zhuang & Yushu Zhu & Xin Guo, 2022. "Sleeping Lion or Sick Man? Machine Learning Approaches to Deciphering Heterogeneous Images of Chinese in North America," Annals of the American Association of Geographers, Taylor & Francis Journals, vol. 112(7), pages 2045-2063, October.
  • Handle: RePEc:taf:raagxx:v:112:y:2022:i:7:p:2045-2063
    DOI: 10.1080/24694452.2022.2042180
    as

    Download full text from publisher

    File URL: http://hdl.handle.net/10.1080/24694452.2022.2042180
    Download Restriction: Access to full text is restricted to subscribers.

    File URL: https://libkey.io/10.1080/24694452.2022.2042180?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:taf:raagxx:v:112:y:2022:i:7:p:2045-2063. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Chris Longhurst (email available below). General contact details of provider: http://www.tandfonline.com/raag .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.