IDEAS home Printed from https://ideas.repec.org/a/gam/jmathe/v12y2024i16p2586-d1461048.html
   My bibliography  Save this article

Predicting the Performance of Ensemble Classification Using Conditional Joint Probability

Author

Listed:
  • Iqbal Murtza

    (Education and Research Center for IoT Convergence Intelligent City Safety Platform, Chonnam National University, Gwangju 61186, Republic of Korea
    Department of Creative Technologies, Faculty of Computing & AI, Air University, Islamabad 44230, Pakistan)

  • Jin-Young Kim

    (Department of Intelligent Electronics and Computer Engineering, Chonnam National University, Gwangju 61186, Republic of Korea)

  • Muhammad Adnan

    (Department of Technology and Safety, UiT the Arctic University of Norway, 9019 Tromsø, Norway)

Abstract

In many machine learning applications, there are many scenarios when performance is not satisfactory by single classifiers. In this case, an ensemble classification is constructed using several weak base learners to achieve satisfactory performance. Unluckily, the construction of the ensemble classification is empirical, i.e., to try an ensemble classification and if performance is not satisfactory then discard it. In this paper, a challenging analytical problem of the estimation of ensemble classification using the prediction performance of the base learners is considered. The proposed formulation is aimed at estimating the performance of ensemble classification without physically developing it, and it is derived from the perspective of probability theory by manipulating the decision probabilities of the base learners. For this purpose, the output of a base learner (which is either true positive, true negative, false positive, or false negative) is considered as a random variable. Then, the effects of logical disjunction-based and majority voting-based decision combination strategies are analyzed from the perspective of conditional joint probability. To evaluate the forecasted performance of ensemble classifier by the proposed methodology, publicly available standard datasets have been employed. The results show the effectiveness of the derived formulations to estimate the performance of ensemble classification. In addition to this, the theoretical and experimental results show that the logical disjunction-based decision outperforms majority voting in imbalanced datasets and cost-sensitive scenarios.

Suggested Citation

  • Iqbal Murtza & Jin-Young Kim & Muhammad Adnan, 2024. "Predicting the Performance of Ensemble Classification Using Conditional Joint Probability," Mathematics, MDPI, vol. 12(16), pages 1-16, August.
  • Handle: RePEc:gam:jmathe:v:12:y:2024:i:16:p:2586-:d:1461048
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2227-7390/12/16/2586/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2227-7390/12/16/2586/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Chakraborty, Tanujit & Chakraborty, Ashis Kumar & Murthy, C.A., 2019. "A nonparametric ensemble binary classifier and its statistical properties," Statistics & Probability Letters, Elsevier, vol. 149(C), pages 16-23.
    2. Yong Zhang & Dapeng Wang, 2013. "A Cost-Sensitive Ensemble Method for Class-Imbalanced Datasets," Abstract and Applied Analysis, Hindawi, vol. 2013, pages 1-6, April.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Chou, Ping & Chuang, Howard Hao-Chun & Chou, Yen-Chun & Liang, Ting-Peng, 2022. "Predictive analytics for customer repurchase: Interdisciplinary integration of buy till you die modeling and machine learning," European Journal of Operational Research, Elsevier, vol. 296(2), pages 635-651.
    2. Tanujit Chakraborty & Ashis Kumar Chakraborty & Zubia Mansoor, 2019. "A hybrid regression model for water quality prediction," OPSEARCH, Springer;Operational Research Society of India, vol. 56(4), pages 1167-1178, December.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jmathe:v:12:y:2024:i:16:p:2586-:d:1461048. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.