IDEAS home Printed from https://ideas.repec.org/a/gam/jmathe/v10y2022i5p828-d764745.html
   My bibliography  Save this article

Bayes in Wonderland! Predictive Supervised Classification Inference Hits Unpredictability

Author

Listed:
  • Ali Amiryousefi

    (Research Program in Systems Oncology, Faculty of Medicine, University of Helsinki, 00014 Helsinki, Finland)

  • Ville Kinnula

    (Research Program in Systems Oncology, Faculty of Medicine, University of Helsinki, 00014 Helsinki, Finland)

  • Jing Tang

    (Research Program in Systems Oncology, Faculty of Medicine, University of Helsinki, 00014 Helsinki, Finland)

Abstract

The marginal Bayesian predictive classifiers (mBpc), as opposed to the simultaneous Bayesian predictive classifiers (sBpc), handle each data separately and, hence, tacitly assume the independence of the observations. Due to saturation in learning of generative model parameters, the adverse effect of this false assumption on the accuracy of mBpc tends to wear out in the face of an increasing amount of training data, guaranteeing the convergence of these two classifiers under the de Finetti type of exchangeability. This result, however, is far from trivial for the sequences generated under Partition Exchangeability (PE), where even umpteen amount of training data does not rule out the possibility of an unobserved outcome (Wonderland!). We provide a computational scheme that allows the generation of the sequences under PE. Based on that, with controlled increase of the training data, we show the convergence of the sBpc and mBpc. This underlies the use of simpler yet computationally more efficient marginal classifiers instead of simultaneous. We also provide a parameter estimation of the generative model giving rise to the partition exchangeable sequence as well as a testing paradigm for the equality of this parameter across different samples. The package for Bayesian predictive supervised classifications, parameter estimation and hypothesis testing of the Ewens sampling formula generative model is deposited on CRAN as PEkit package.

Suggested Citation

  • Ali Amiryousefi & Ville Kinnula & Jing Tang, 2022. "Bayes in Wonderland! Predictive Supervised Classification Inference Hits Unpredictability," Mathematics, MDPI, vol. 10(5), pages 1-11, March.
  • Handle: RePEc:gam:jmathe:v:10:y:2022:i:5:p:828-:d:764745
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2227-7390/10/5/828/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2227-7390/10/5/828/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Federico Bassetti & Lucia Ladelli, 2021. "Mixture of Species Sampling Models," Mathematics, MDPI, vol. 9(23), pages 1-27, December.
    2. Bassetti, Federico & Ladelli, Lucia, 2020. "Asymptotic number of clusters for species sampling sequences with non-diffuse base measure," Statistics & Probability Letters, Elsevier, vol. 162(C).
    3. Hansen, Ben & Pitman, Jim, 2000. "Prediction rules for exchangeable sequences related to species sampling," Statistics & Probability Letters, Elsevier, vol. 46(3), pages 251-256, February.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Federico Bassetti & Lucia Ladelli, 2021. "Mixture of Species Sampling Models," Mathematics, MDPI, vol. 9(23), pages 1-27, December.
    2. U. Garibaldi & D. Costantini & P. Viarengo, 2007. "The two-parameter Ewens distribution: a finitary approach," Journal of Economic Interaction and Coordination, Springer;Society for Economic Science with Heterogeneous Interacting Agents, vol. 2(2), pages 147-161, December.
    3. Canale, Antonio & Lijoi, Antonio & Nipoti, Bernardo & Prünster, Igor, 2023. "Inner spike and slab Bayesian nonparametric models," Econometrics and Statistics, Elsevier, vol. 27(C), pages 120-135.
    4. Andrea Collevecchio & Codina Cotar & Marco LiCalzi, 2011. "On a preferential attachment and generalized Pólya's urn model," Working Papers 8, Venice School of Management - Department of Management, Università Ca' Foscari Venezia, revised Oct 2012.
    5. Emanuele Dolera, 2022. "Preface to the Special Issue on “Bayesian Predictive Inference and Related Asymptotics—Festschrift for Eugenio Regazzini’s 75th Birthday”," Mathematics, MDPI, vol. 10(15), pages 1-4, July.
    6. Masanao Aoki & Hiroshi Yoshikawa, 2012. "Non-self-averaging in macroeconomic models: a criticism of modern micro-founded macroeconomics," Journal of Economic Interaction and Coordination, Springer;Society for Economic Science with Heterogeneous Interacting Agents, vol. 7(1), pages 1-22, May.
    7. Martínez-Ovando Juan Carlos & Olivares-Guzmán Sergio I. & Roldán-Rodríguez Adriana, 2014. "Predictive Inference on Finite Populations Segmented in Planned and Unplanned Domains," Working Papers 2014-04, Banco de México.
    8. Wenpin Tang, 2022. "Stability of shares in the Proof of Stake Protocol -- Concentration and Phase Transitions," Papers 2206.02227, arXiv.org.
    9. Bissiri, Pier Giovanni, 2010. "Characterization of the law of a finite exchangeable sequence through the finite-dimensional distributions of the empirical measure," Statistics & Probability Letters, Elsevier, vol. 80(17-18), pages 1306-1312, September.
    10. Cerquetti, Annalisa, 2007. "A note on Bayesian nonparametric priors derived from exponentially tilted Poisson-Kingman models," Statistics & Probability Letters, Elsevier, vol. 77(18), pages 1705-1711, December.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jmathe:v:10:y:2022:i:5:p:828-:d:764745. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.