IDEAS home Printed from https://ideas.repec.org/a/gam/jmathe/v12y2024i21p3392-d1510043.html
   My bibliography  Save this article

Yet Another Discriminant Analysis (YADA): A Probabilistic Model for Machine Learning Applications

Author

Listed:
  • Richard V. Field

    (Sandia National Laboratories, Albuquerque, NM 87185, USA)

  • Michael R. Smith

    (Sandia National Laboratories, Albuquerque, NM 87185, USA)

  • Ellery J. Wuest

    (Klipsch School of Electrical and Computer Engineering, New Mexico State University, Las Cruces, NM 88003, USA)

  • Joe B. Ingram

    (Sandia National Laboratories, Albuquerque, NM 87185, USA)

Abstract

This paper presents a probabilistic model for various machine learning (ML) applications. While deep learning (DL) has produced state-of-the-art results in many domains, DL models are complex and over-parameterized, which leads to high uncertainty about what the model has learned, as well as its decision process. Further, DL models are not probabilistic, making reasoning about their output challenging. In contrast, the proposed model, referred to as Yet Another Discriminate Analysis(YADA), is less complex than other methods, is based on a mathematically rigorous foundation, and can be utilized for a wide variety of ML tasks including classification, explainability, and uncertainty quantification. YADA is thus competitive in most cases with many state-of-the-art DL models. Ideally, a probabilistic model would represent the full joint probability distribution of its features, but doing so is often computationally expensive and intractable. Hence, many probabilistic models assume that the features are either normally distributed, mutually independent, or both, which can severely limit their performance. YADA is an intermediate model that (1) captures the marginal distributions of each variable and the pairwise correlations between variables and (2) explicitly maps features to the space of multivariate Gaussian variables. Numerous mathematical properties of the YADA model can be derived, thereby improving the theoretic underpinnings of ML. Validation of the model can be statistically verified on new or held-out data using native properties of YADA. However, there are some engineering and practical challenges that we enumerate to make YADA more useful.

Suggested Citation

  • Richard V. Field & Michael R. Smith & Ellery J. Wuest & Joe B. Ingram, 2024. "Yet Another Discriminant Analysis (YADA): A Probabilistic Model for Machine Learning Applications," Mathematics, MDPI, vol. 12(21), pages 1-25, October.
  • Handle: RePEc:gam:jmathe:v:12:y:2024:i:21:p:3392-:d:1510043
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2227-7390/12/21/3392/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2227-7390/12/21/3392/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Targino, Rodrigo S. & Peters, Gareth W. & Shevchenko, Pavel V., 2015. "Sequential Monte Carlo Samplers for capital allocation under copula-dependent risk models," Insurance: Mathematics and Economics, Elsevier, vol. 61(C), pages 206-226.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Huang, Zhenzhen & Kwok, Yue Kuen & Xu, Ziqing, 2024. "Efficient algorithms for calculating risk measures and risk contributions in copula credit risk models," Insurance: Mathematics and Economics, Elsevier, vol. 115(C), pages 132-150.
    2. Man Chung Fung & Gareth W. Peters & Pavel V. Shevchenko, 2016. "A unified approach to mortality modelling using state-space framework: characterisation, identification, estimation and forecasting," Papers 1605.09484, arXiv.org.
    3. Sunoh Kim & Jin Hur, 2020. "Probabilistic Approaches to the Security Analysis of Smart Grid with High Wind Penetration: The Case of Jeju Island’s Power Grids," Energies, MDPI, vol. 13(21), pages 1-13, November.
    4. Ji, Liuyan & Tan, Ken Seng & Yang, Fan, 2021. "Tail dependence and heavy tailedness in extreme risks," Insurance: Mathematics and Economics, Elsevier, vol. 99(C), pages 282-293.
    5. Gareth W. Peters & Rodrigo S. Targino & Mario V. Wüthrich, 2017. "Bayesian Modelling, Monte Carlo Sampling and Capital Allocation of Insurance Risks," Risks, MDPI, vol. 5(4), pages 1-51, September.
    6. Gareth W. Peters & Efstathios Panayi & Francois Septier, 2015. "SMC-ABC methods for the estimation of stochastic simulation models of the limit order book," Papers 1504.05806, arXiv.org.
    7. Koike, Takaaki & Saporito, Yuri & Targino, Rodrigo, 2022. "Avoiding zero probability events when computing Value at Risk contributions," Insurance: Mathematics and Economics, Elsevier, vol. 106(C), pages 173-192.
    8. Takaaki Koike & Marius Hofert, 2020. "Markov Chain Monte Carlo Methods for Estimating Systemic Risk Allocations," Risks, MDPI, vol. 8(1), pages 1-33, January.
    9. Tong Pu & Yifei Zhang & Yiying Zhang, 2024. "On Joint Marginal Expected Shortfall and Associated Contribution Risk Measures," Papers 2405.07549, arXiv.org.
    10. Ignatieva, Katja & Landsman, Zinoviy, 2019. "Conditional tail risk measures for the skewed generalised hyperbolic family," Insurance: Mathematics and Economics, Elsevier, vol. 86(C), pages 98-114.
    11. Jaume Belles-Sampera & Montserrat Guillen & Miguel Santolino, 2023. "Haircut Capital Allocation as the Solution of a Quadratic Optimisation Problem," Mathematics, MDPI, vol. 11(18), pages 1-17, September.
    12. Takaaki Koike & Marius Hofert, 2019. "Markov Chain Monte Carlo Methods for Estimating Systemic Risk Allocations," Papers 1909.11794, arXiv.org, revised May 2020.
    13. Mélina Mailhot & Mhamed Mesfioui, 2016. "Multivariate TVaR-Based Risk Decomposition for Vector-Valued Portfolios," Risks, MDPI, vol. 4(4), pages 1-16, September.
    14. Paulusch, Joachim & Schlütter, Sebastian, 2022. "Sensitivity-implied tail-correlation matrices," Journal of Banking & Finance, Elsevier, vol. 134(C).
    15. Yuhao Liu & Petar M. Djurić & Young Shin Kim & Svetlozar T. Rachev & James Glimm, 2021. "Systemic Risk Modeling with Lévy Copulas," JRFM, MDPI, vol. 14(6), pages 1-20, June.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jmathe:v:12:y:2024:i:21:p:3392-:d:1510043. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.