IDEAS home Printed from https://ideas.repec.org/a/sae/risrel/v238y2024i5p945-956.html
   My bibliography  Save this article

Combining BERT with numerical variables to classify injury leave based on accident description

Author

Listed:
  • Plínio MS Ramos
  • July B Macedo
  • Caio BS Maior
  • Márcio C Moura
  • Isis D Lins

Abstract

The occurrence of work accidents may threaten the workers’ health and lead to consequences for the organizations as well, such as restructuring of work and direct/indirect costs with the absence of the worker. In this context, accident investigation reports contain information that can support companies to propose preventive and mitigative measures and identify causes and consequences of injury events. However, this information is frequently complex, redundant, and/or incomplete. Additionally, a complete human review of the entire database is arduous, considering numerous reports produced by a company. Indeed, Natural Language Processing (NLP)-based techniques are suitable for analyzing a massive amount of textual information. In this paper, we adopted NLP techniques to determine whether an injury leave would be expected from a given accident report. The methodology was applied to accident reports collected from an actual hydroelectric power company using Bidirectional Encoder Representations from Transformers (BERT), a state-of-art NLP method. The text representations provided by BERT model were combined with numerical and binary variables extracted from the accident reports. These combined variables are input to a Multilayer Perceptron (MLP) that predicts the occurrence of the accident leave for a given accident. After cross-validation, the results showed a median accuracy of 73.5%. Additionally, we discuss several reports that presented high and low proportions of correct classifications by the models tested and discussed the possible reasons. Indeed, accident investigation reports provide useful knowledge to support decisions in the safety context.

Suggested Citation

  • Plínio MS Ramos & July B Macedo & Caio BS Maior & Márcio C Moura & Isis D Lins, 2024. "Combining BERT with numerical variables to classify injury leave based on accident description," Journal of Risk and Reliability, , vol. 238(5), pages 945-956, October.
  • Handle: RePEc:sae:risrel:v:238:y:2024:i:5:p:945-956
    DOI: 10.1177/1748006X221140194
    as

    Download full text from publisher

    File URL: https://journals.sagepub.com/doi/10.1177/1748006X221140194
    Download Restriction: no

    File URL: https://libkey.io/10.1177/1748006X221140194?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. das Chagas Moura, Márcio & Azevedo, Rafael Valença & Droguett, Enrique López & Chaves, Leandro Rego & Lins, Isis Didier & Vilela, Romulo Fernando & Filho, Romero Sales, 2016. "Estimation of expected number of accidents and workforce unavailability through Bayesian population variability analysis and Markov-based model," Reliability Engineering and System Safety, Elsevier, vol. 150(C), pages 136-146.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Westreich, Sara & Perlman, Yael & Winkler, Michael, 2021. "Analysis and Implications of the Management of Near-Miss Events: A Game Theoretic Approach," Reliability Engineering and System Safety, Elsevier, vol. 212(C).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:sae:risrel:v:238:y:2024:i:5:p:945-956. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: SAGE Publications (email available below). General contact details of provider: .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.