IDEAS home Printed from https://ideas.repec.org/a/gam/jijerp/v18y2021i22p11759-d675355.html
   My bibliography  Save this article

Looking for Razors and Needles in a Haystack: Multifaceted Analysis of Suicidal Declarations on Social Media—A Pragmalinguistic Approach

Author

Listed:
  • Michal Ptaszynski

    (Department of Computer Science, Kitami Institute of Technology, Kitami 090-8507, Japan
    All authors contributed equally to this work.
    Current address: 165 Koen-cho, Kitami 090-8507, Japan.)

  • Monika Zasko-Zielinska

    (Department of Contemporary Polish Language, Faculty of Philology, University of Wrocław, 50-140 Wrocław, Poland
    All authors contributed equally to this work.)

  • Michal Marcinczuk

    (Samurai Labs, 81-824 Sopot, Poland
    Department of Computational Intelligence, Faculty of Computer Science and Management, Wrocław University of Science and Technology, 50-370 Wrocław, Poland
    All authors contributed equally to this work.)

  • Gniewosz Leliwa

    (Samurai Labs, 81-824 Sopot, Poland
    All authors contributed equally to this work.)

  • Marcin Fortuna

    (Samurai Labs, 81-824 Sopot, Poland
    Institute of English and American Studies, Glottodidactics and Natural Language Processing Division, University of Gdańsk, 80-308 Gdańsk, Poland
    All authors contributed equally to this work.)

  • Kamil Soliwoda

    (Samurai Labs, 81-824 Sopot, Poland
    All authors contributed equally to this work.)

  • Ida Dziublewska

    (Samurai Labs, 81-824 Sopot, Poland
    All authors contributed equally to this work.)

  • Olimpia Hubert

    (Samurai Labs, 81-824 Sopot, Poland
    All authors contributed equally to this work.)

  • Pawel Skrzek

    (Samurai Labs, 81-824 Sopot, Poland
    All authors contributed equally to this work.)

  • Jan Piesiewicz

    (Samurai Labs, 81-824 Sopot, Poland
    All authors contributed equally to this work.)

  • Paula Karbowska

    (Samurai Labs, 81-824 Sopot, Poland
    All authors contributed equally to this work.)

  • Maria Dowgiallo

    (Samurai Labs, 81-824 Sopot, Poland
    Institute of Clinical Psychology, SWPS University of Social Sciences and Humanities, 03-815 Warsaw, Poland
    All authors contributed equally to this work.)

  • Juuso Eronen

    (Department of Computer Science, Kitami Institute of Technology, Kitami 090-8507, Japan
    All authors contributed equally to this work.)

  • Patrycja Tempska

    (Samurai Labs, 81-824 Sopot, Poland
    All authors contributed equally to this work.)

  • Maciej Brochocki

    (Samurai Labs, 81-824 Sopot, Poland
    All authors contributed equally to this work.)

  • Marek Godny

    (Samurai Labs, 81-824 Sopot, Poland
    All authors contributed equally to this work.)

  • Michal Wroczynski

    (Samurai Labs, 81-824 Sopot, Poland
    All authors contributed equally to this work.)

Abstract

In this paper, we study language used by suicidal users on Reddit social media platform. To do that, we firstly collect a large-scale dataset of Reddit posts and annotate it with highly trained and expert annotators under a rigorous annotation scheme. Next, we perform a multifaceted analysis of the dataset, including: (1) the analysis of user activity before and after posting a suicidal message, and (2) a pragmalinguistic study on the vocabulary used by suicidal users. In the second part of the analysis, we apply LIWC, a dictionary-based toolset widely used in psychology and linguistic research, which provides a wide range of linguistic category annotations on text. However, since raw LIWC scores are not sufficiently reliable, or informative, we propose a procedure to decrease the possibility of unreliable and misleading LIWC scores leading to misleading conclusions by analyzing not each category separately, but in pairs with other categories. The analysis of the results supported the validity of the proposed approach by revealing a number of valuable information on the vocabulary used by suicidal users and helped to pin-point false predictors. For example, we were able to specify that death-related words, typically associated with suicidal posts in the majority of the literature, become false predictors, when they co-occur with apostrophes, even in high-risk subreddits. On the other hand, the category-pair based disambiguation helped to specify that death becomes a predictor only when co-occurring with future-focused language, informal language, discrepancy, or 1st person pronouns. The promising applicability of the approach was additionally analyzed for its limitations, where we found out that although LIWC is a useful and easily applicable tool, the lack of any contextual processing makes it unsuitable for application in psychological and linguistic studies. We conclude that disadvantages of LIWC can be easily overcome by creating a number of high-performance AI-based classifiers trained for annotation of similar categories as LIWC, which we plan to pursue in future work.

Suggested Citation

  • Michal Ptaszynski & Monika Zasko-Zielinska & Michal Marcinczuk & Gniewosz Leliwa & Marcin Fortuna & Kamil Soliwoda & Ida Dziublewska & Olimpia Hubert & Pawel Skrzek & Jan Piesiewicz & Paula Karbowska , 2021. "Looking for Razors and Needles in a Haystack: Multifaceted Analysis of Suicidal Declarations on Social Media—A Pragmalinguistic Approach," IJERPH, MDPI, vol. 18(22), pages 1-49, November.
  • Handle: RePEc:gam:jijerp:v:18:y:2021:i:22:p:11759-:d:675355
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/1660-4601/18/22/11759/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/1660-4601/18/22/11759/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Takanao Tanaka & Shohei Okamoto, 2021. "Increase in suicide following an initial decline during the COVID-19 pandemic in Japan," Nature Human Behaviour, Nature, vol. 5(2), pages 229-238, February.
    2. Škare, Marinko & Soriano, Domingo Riberio & Porada-Rochoń, Małgorzata, 2021. "Impact of COVID-19 on the travel and tourism industry," Technological Forecasting and Social Change, Elsevier, vol. 163(C).
    3. Shaoxiong Ji & Celina Ping Yu & Sai-fu Fung & Shirui Pan & Guodong Long, 2018. "Supervised Learning for Suicidal Ideation Detection in Online User Content," Complexity, Hindawi, vol. 2018, pages 1-10, September.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Michal Ptaszynski & Agata Pieciukiewicz & Pawel Dybala & Pawel Skrzek & Kamil Soliwoda & Marcin Fortuna & Gniewosz Leliwa & Michal Wroczynski, 2023. "Expert-Annotated Dataset to Study Cyberbullying in Polish Language," Data, MDPI, vol. 9(1), pages 1-26, December.
    2. Yun Gu & Deyuan Chen & Xiaoqian Liu, 2022. "Suicide Possibility Scale Detection via Sina Weibo Analytics: Preliminary Results," IJERPH, MDPI, vol. 20(1), pages 1-11, December.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Iva Gregurec & Martina Tomičić Furjan & Katarina Tomičić-Pupek, 2021. "The Impact of COVID-19 on Sustainable Business Models in SMEs," Sustainability, MDPI, vol. 13(3), pages 1-24, January.
    2. Kusa, Rafał & Suder, Marcin & Duda, Joanna, 2023. "Impact of greening on performance in the hospitality industry: Moderating effect of flexibility and inter-organizational cooperation," Technological Forecasting and Social Change, Elsevier, vol. 190(C).
    3. de Palma, André & Vosough, Shaghayegh & Liao, Feixiong, 2022. "An overview of effects of COVID-19 on mobility and lifestyle: 18 months since the outbreak," Transportation Research Part A: Policy and Practice, Elsevier, vol. 159(C), pages 372-397.
    4. Chopdar, Prasanta Kr & Paul, Justin & Prodanova, Jana, 2022. "Mobile shoppers’ response to Covid-19 phobia, pessimism and smartphone addiction: Does social influence matter?," Technological Forecasting and Social Change, Elsevier, vol. 174(C).
    5. M. A. Hannan & M. S. Abd Rahman & Ali Q. Al-Shetwi & R. A. Begum & Pin Jern Ker & M. Mansor & M. S. Mia & M. J. Hossain & Z. Y. Dong & T. M. I. Mahlia, 2022. "Impact Assessment of COVID-19 Severity on Environment, Economy and Society towards Affecting Sustainable Development Goals," Sustainability, MDPI, vol. 14(23), pages 1-23, November.
    6. Svaleryd, Helena & Vlachos, Jonas, 2022. "COVID-19 and School Closures," GLO Discussion Paper Series 1008, Global Labor Organization (GLO).
    7. Yun Gu & Deyuan Chen & Xiaoqian Liu, 2022. "Suicide Possibility Scale Detection via Sina Weibo Analytics: Preliminary Results," IJERPH, MDPI, vol. 20(1), pages 1-11, December.
    8. Sugiyama, Yuri, 2022. "Can Soft Law Improve the Welfare of Sexual Minorities? The Case of Same-sex Partnership Policy in Japan," CEI Working Paper Series 2022-06, Center for Economic Institutions, Institute of Economic Research, Hitotsubashi University.
    9. Derya Demirdelen Alrawadieh, 2021. "Does Employability Anxiety Trigger Psychological Distress and Academic Major Dissatisfaction? A Study on Tour Guiding Students," Journal of Tourismology, Istanbul University, Faculty of Economics, vol. 7(1), pages 55-71, June.
    10. Ruri Okubo & Ryusuke Matsumoto & Eishi Motomura & Motohiro Okada, 2024. "Uncertainties of Economic Policy and Government Management Stability Played Important Roles in Increasing Suicides in Japan from 2009 to 2023," IJERPH, MDPI, vol. 21(10), pages 1-18, October.
    11. Chung-Wei Kuo, 2021. "Can We Return to Our Normal Life When the Pandemic Is under Control? A Preliminary Study on the Influence of COVID-19 on the Tourism Characteristics of Taiwan," Sustainability, MDPI, vol. 13(17), pages 1-17, August.
    12. Kamer-Ainur Aivaz & Adrian Micu, 2021. "An analysis of the impact of the COVID-19 pandemic on the number of tourists arriving in Romania using the correspondence factor analysis," Technium Social Sciences Journal, Technium Science, vol. 24(1), pages 324-335, October.
    13. Kristina Gligorić & Arnaud Chiolero & Emre Kıcıman & Ryen W. White & Robert West, 2022. "Population-scale dietary interests during the COVID-19 pandemic," Nature Communications, Nature, vol. 13(1), pages 1-14, December.
    14. Jose Antonio Cava Jimenez & Mª Genoveva Millán Vázquez de la Torre & Mª Genoveva Dancausa Millán, 2022. "Enotourism in Southern Spain: The Montilla-Moriles PDO," IJERPH, MDPI, vol. 19(6), pages 1-21, March.
    15. Huiyue Liu & Qiancai Tan & Huiping Mai, 2023. "Stress-Buffering Effects of Social Support on Tourism Employees during the COVID-19 Pandemic: A Moderated Mediation Model," IJERPH, MDPI, vol. 20(3), pages 1-20, January.
    16. Zhenyu Qi & Yuezhou You, 2024. "The Impact of the Integration of the Culture Industry and Tourism on Regional Green Development: Empirical Evidence from China," Sustainability, MDPI, vol. 16(8), pages 1-21, April.
    17. Segarra-Blasco, Agustí & Teruel, Mercedes & Cattaruzzo, Sebastiano, 2021. "The economic reaction to non-pharmaceutical interventions during Covid-19," Economic Analysis and Policy, Elsevier, vol. 72(C), pages 592-608.
    18. Dorn, Florian & Lange, Berit & Braml, Martin & Gstrein, David & Nyirenda, John L.Z. & Vanella, Patrizio & Winter, Joachim & Fuest, Clemens & Krause, Gérard, 2023. "The challenge of estimating the direct and indirect effects of COVID-19 interventions – Toward an integrated economic and epidemiological approach," Economics & Human Biology, Elsevier, vol. 49(C).
    19. Schiopu, Andreea Fortuna & Hornoiu, Remus Ion & Padurean, Ana Mihaela & Nica, Ana-Maria, 2022. "Constrained and virtually traveling? Exploring the effect of travel constraints on intention to use virtual reality in tourism," Technology in Society, Elsevier, vol. 71(C).
    20. Erika Cantor & Rodrigo Salas & Romina Torres, 2022. "Femicide and Attempted Femicide before and during the COVID-19 Pandemic in Chile," IJERPH, MDPI, vol. 19(13), pages 1-13, June.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jijerp:v:18:y:2021:i:22:p:11759-:d:675355. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.