IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0180830.html
   My bibliography  Save this article

Adaptive Swarm Balancing Algorithms for rare-event prediction in imbalanced healthcare data

Author

Listed:
  • Jinyan Li
  • Lian-sheng Liu
  • Simon Fong
  • Raymond K Wong
  • Sabah Mohammed
  • Jinan Fiaidhi
  • Yunsick Sung
  • Kelvin K L Wong

Abstract

Clinical data analysis and forecasting have made substantial contributions to disease control, prevention and detection. However, such data usually suffer from highly imbalanced samples in class distributions. In this paper, we aim to formulate effective methods to rebalance binary imbalanced dataset, where the positive samples take up only the minority. We investigate two different meta-heuristic algorithms, particle swarm optimization and bat algorithm, and apply them to empower the effects of synthetic minority over-sampling technique (SMOTE) for pre-processing the datasets. One approach is to process the full dataset as a whole. The other is to split up the dataset and adaptively process it one segment at a time. The experimental results reported in this paper reveal that the performance improvements obtained by the former methods are not scalable to larger data scales. The latter methods, which we call Adaptive Swarm Balancing Algorithms, lead to significant efficiency and effectiveness improvements on large datasets while the first method is invalid. We also find it more consistent with the practice of the typical large imbalanced medical datasets. We further use the meta-heuristic algorithms to optimize two key parameters of SMOTE. The proposed methods lead to more credible performances of the classifier, and shortening the run time compared to brute-force method.

Suggested Citation

  • Jinyan Li & Lian-sheng Liu & Simon Fong & Raymond K Wong & Sabah Mohammed & Jinan Fiaidhi & Yunsick Sung & Kelvin K L Wong, 2017. "Adaptive Swarm Balancing Algorithms for rare-event prediction in imbalanced healthcare data," PLOS ONE, Public Library of Science, vol. 12(7), pages 1-25, July.
  • Handle: RePEc:plo:pone00:0180830
    DOI: 10.1371/journal.pone.0180830
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0180830
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0180830&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0180830?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Yitong Guo & Jie Mei & Zhiting Pan & Haonan Liu & Weiwei Li, 2022. "Adaptively Promoting Diversity in a Novel Ensemble Method for Imbalanced Credit-Risk Evaluation," Mathematics, MDPI, vol. 10(11), pages 1-21, May.
    2. Xiaoshun Xie & Wanni Xu & Xiaobo Lian & You-Lei Fu, 2022. "Sustainable Restoration of Ancient Architectural Patterns in Fujian Using Improved Algorithms Based on Criminisi," Sustainability, MDPI, vol. 14(21), pages 1-25, October.
    3. Treena Basu & Olaf Menzer & Joshua Ward & Indranil SenGupta, 2022. "A Novel Implementation of Siamese Type Neural Networks in Predicting Rare Fluctuations in Financial Time Series," Risks, MDPI, vol. 10(2), pages 1-16, February.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0180830. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.