IDEAS home Printed from https://ideas.repec.org/a/spr/drugsa/v48y2025i5d10.1007_s40264-025-01515-y.html
   My bibliography  Save this article

Network Analysis and Machine Learning for Signal Detection and Prioritization Using Electronic Healthcare Records and Administrative Databases: A Proof of Concept in Drug-Induced Acute Myocardial Infarction

Author

Listed:
  • Maria Antonietta Barbieri

    (University of Messina
    University of Copenhagen)

  • Andrea Abate

    (University of Messina
    University of Copenhagen)

  • Olivér M. Balogh

    (Semmelweis University
    Semmelweis University)

  • Mátyás Pétervári

    (Semmelweis University
    Semmelweis University
    Sanovigado Kft)

  • Péter Ferdinandy

    (Semmelweis University
    Semmelweis University
    Pharmahungary Group)

  • Bence Ágg

    (Semmelweis University
    Semmelweis University
    Pharmahungary Group)

  • Vera Battini

    (Università Degli Studi di Milano)

  • Marianna Cocco

    (Università Degli Studi di Milano)

  • Andrea Rossi

    (University of Milan
    IRCCS MultiMedica)

  • Carla Carnovale

    (Università Degli Studi di Milano)

  • Manuela Casula

    (University of Milan
    IRCCS MultiMedica)

  • Edoardo Spina

    (University of Messina)

  • Maurizio Sessa

    (University of Copenhagen)

Abstract

Background Safety signals for potential drug-induced adverse events (AEs) typically emerge from multiple data sources, primarily spontaneous reporting systems, despite known limitations. Increasingly, real-world data from sources such as electronic health records (EHRs) and administrative databases are leveraged for signal detection. Although network analysis has shown promise in mapping relationships between clinical attributes for signal detection in spontaneous reporting system databases, its application in real-world data from EHRs and administrative databases remains limited. Objective This study aimed to evaluate the performance of network analysis in detecting safety signals within Italian administrative databases, using drug-induced acute myocardial infarction (AMI) as a proof of concept. Methods We employed a case–crossover design to explore the association between drug exposure and AMI using the Healthcare Administrative Database of Mantova, Italy, from 2014 to 2018. Patients with their first AMI hospitalization were identified after a 365-day washout period to exclude prior hospitalizations. We constructed a network to analyse the relationships between prescribed drugs and diagnoses, represented as nodes, with undirected edges illustrating their interactions. For each patient with AMI, we identified all diagnoses and drugs recorded or redeemed within 365 days of the first AMI episode and generated various drug–diagnosis, drug–drug, and diagnosis–diagnosis pairs. We calculated the frequency of these pairs, and three types of edge weights quantified the strength of connections. We identified outlier drug–AMI pairs using a predictive score (F) based on frequency (C) and full edge weights (WF), with validation for known AMI associations. We prioritized signals using the F score, C of AMI, and WF, analysed through k-means clustering to identify patterns in the data. Results From 2014 to 2018, a total of 3918 patients had an AMI, with 4686 AMI diagnoses. Of those, 2866 had prescriptions in the previous year, totalling 498,591 prescriptions. A network analysis identified 2968 unique nodes, revealing 529,935 diagnosis–diagnosis connections, 235,380 drug–diagnosis connections, and 102,831 drug–drug connections. The median number of connections (C) was 404 (Q1–Q3: 194–671) for drug nodes and 380 (Q1–Q3: 216–664) for diagnosis nodes. The median WF was 11.8 (Q1–Q3: 9–14), and the median F score across pairs was 0.1 (Q1–Q3: 0.1–0.3). A total of 249 potential safety signals were detected, with 63.4% aligning with known AEs. Among the remaining signals, 80 were prioritized, and five emerged as the highest priority: terazosin, tamsulosin, allopurinol, esomeprazole, and omeprazole. Conclusions Overall, our novel method demonstrates that network analysis is a valuable tool for signal detection and prioritization in drug-induced AEs based on EHRs and administrative databases.

Suggested Citation

  • Maria Antonietta Barbieri & Andrea Abate & Olivér M. Balogh & Mátyás Pétervári & Péter Ferdinandy & Bence Ágg & Vera Battini & Marianna Cocco & Andrea Rossi & Carla Carnovale & Manuela Casula & Edoard, 2025. "Network Analysis and Machine Learning for Signal Detection and Prioritization Using Electronic Healthcare Records and Administrative Databases: A Proof of Concept in Drug-Induced Acute Myocardial Infa," Drug Safety, Springer, vol. 48(5), pages 513-526, May.
  • Handle: RePEc:spr:drugsa:v:48:y:2025:i:5:d:10.1007_s40264-025-01515-y
    DOI: 10.1007/s40264-025-01515-y
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s40264-025-01515-y
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s40264-025-01515-y?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:drugsa:v:48:y:2025:i:5:d:10.1007_s40264-025-01515-y. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com/economics/journal/40264 .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.