IDEAS home Printed from https://ideas.repec.org/a/gam/jforec/v6y2024i1p13-238d1354410.html
   My bibliography  Save this article

Effective Natural Language Processing Algorithms for Early Alerts of Gout Flares from Chief Complaints

Author

Listed:
  • Lucas Lopes Oliveira

    (School of Computing, Mathematics and Data Sciences, Coventry University, Coventry CV1 5FB, UK
    These authors contributed equally to this work.)

  • Xiaorui Jiang

    (Centre for Computational Sciences and Mathematical Modelling, Coventry University, Coventry CV1 2TT, UK
    These authors contributed equally to this work.)

  • Aryalakshmi Nellippillipathil Babu

    (School of Computing, Mathematics and Data Sciences, Coventry University, Coventry CV1 5FB, UK
    These authors contributed equally to this work.)

  • Poonam Karajagi

    (School of Computing, Mathematics and Data Sciences, Coventry University, Coventry CV1 5FB, UK)

  • Alireza Daneshkhah

    (School of Computing, Mathematics and Data Sciences, Coventry University, Coventry CV1 5FB, UK
    Centre for Computational Sciences and Mathematical Modelling, Coventry University, Coventry CV1 2TT, UK)

Abstract

Early identification of acute gout is crucial, enabling healthcare professionals to implement targeted interventions for rapid pain relief and preventing disease progression, ensuring improved long-term joint function. In this study, we comprehensively explored the potential early detection of gout flares (GFs) based on nurses’ chief complaint notes in the Emergency Department (ED). Addressing the challenge of identifying GFs prospectively during an ED visit, where documentation is typically minimal, our research focused on employing alternative Natural Language Processing (NLP) techniques to enhance detection accuracy. We investigated GF detection algorithms using both sparse representations by traditional NLP methods and dense encodings by medical domain-specific Large Language Models (LLMs), distinguishing between generative and discriminative models. Three methods were used to alleviate the issue of severe data imbalances, including oversampling, class weights, and focal loss. Extensive empirical studies were performed on the Gout Emergency Department Chief Complaint Corpora. Sparse text representations like tf-idf proved to produce strong performances, achieving F1 scores higher than 0.75. The best deep learning models were RoBERTa-large-PM-M3-Voc and BioGPT, which had the best F1 scores for each dataset, with a 0.8 on the 2019 dataset and a 0.85 F1 score on the 2020 dataset, respectively. We concluded that although discriminative LLMs performed better for this classification task when compared to generative LLMs, a combination of using generative models as feature extractors and employing a support vector machine for classification yielded promising results comparable to those obtained with discriminative models.

Suggested Citation

  • Lucas Lopes Oliveira & Xiaorui Jiang & Aryalakshmi Nellippillipathil Babu & Poonam Karajagi & Alireza Daneshkhah, 2024. "Effective Natural Language Processing Algorithms for Early Alerts of Gout Flares from Chief Complaints," Forecasting, MDPI, vol. 6(1), pages 1-15, March.
  • Handle: RePEc:gam:jforec:v:6:y:2024:i:1:p:13-238:d:1354410
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2571-9394/6/1/13/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2571-9394/6/1/13/
    Download Restriction: no
    ---><---

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jforec:v:6:y:2024:i:1:p:13-238:d:1354410. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.