IDEAS home Printed from https://ideas.repec.org/a/gam/jmathe/v12y2024i6p788-d1353151.html
   My bibliography  Save this article

Sparse Bayesian Neural Networks: Bridging Model and Parameter Uncertainty through Scalable Variational Inference

Author

Listed:
  • Aliaksandr Hubin

    (Bioinformatics and Applied Statistics, Norwegian University of Life Sciences, 1433 Ås, Norway
    Department of Mathematics, University of Oslo, 0316 Oslo, Norway
    Research Administration, Ostfold University College, 1757 Halden, Norway
    Norwegian Computing Center, 0373 Oslo, Norway)

  • Geir Storvik

    (Department of Mathematics, University of Oslo, 0316 Oslo, Norway
    Norwegian Computing Center, 0373 Oslo, Norway)

Abstract

Bayesian neural networks (BNNs) have recently regained a significant amount of attention in the deep learning community due to the development of scalable approximate Bayesian inference techniques. There are several advantages of using a Bayesian approach: parameter and prediction uncertainties become easily available, facilitating more rigorous statistical analysis. Furthermore, prior knowledge can be incorporated. However, the construction of scalable techniques that combine both structural and parameter uncertainty remains a challenge. In this paper, we apply the concept of model uncertainty as a framework for structural learning in BNNs and, hence, make inferences in the joint space of structures/models and parameters. Moreover, we suggest an adaptation of a scalable variational inference approach with reparametrization of marginal inclusion probabilities to incorporate the model space constraints. Experimental results on a range of benchmark datasets show that we obtain comparable accuracy results with the competing models, but based on methods that are much more sparse than ordinary BNNs.

Suggested Citation

  • Aliaksandr Hubin & Geir Storvik, 2024. "Sparse Bayesian Neural Networks: Bridging Model and Parameter Uncertainty through Scalable Variational Inference," Mathematics, MDPI, vol. 12(6), pages 1-28, March.
  • Handle: RePEc:gam:jmathe:v:12:y:2024:i:6:p:788-:d:1353151
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2227-7390/12/6/788/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2227-7390/12/6/788/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Ludwig Fahrmeir & Stefan Lang, 2001. "Bayesian inference for generalized additive mixed models based on Markov random field priors," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 50(2), pages 201-220.
    2. Dobra, Adrian & Hans, Chris & Jones, Beatrix & Nevins, J.R.Joseph R. & Yao, Guang & West, Mike, 2004. "Sparse graphical models for exploring gene expression data," Journal of Multivariate Analysis, Elsevier, vol. 90(1), pages 196-212, July.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Giraud Christophe & Huet Sylvie & Verzelen Nicolas, 2012. "Graph Selection with GGMselect," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 11(3), pages 1-52, February.
    2. Paci, Lucia & Consonni, Guido, 2020. "Structural learning of contemporaneous dependencies in graphical VAR models," Computational Statistics & Data Analysis, Elsevier, vol. 144(C).
    3. Simon N. Wood & Natalya Pya & Benjamin Säfken, 2016. "Smoothing Parameter and Model Selection for General Smooth Models," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 111(516), pages 1548-1563, October.
    4. Thomas A. Murray & Brian P. Hobbs & Theodore C. Lystig & Bradley P. Carlin, 2014. "Semiparametric Bayesian commensurate survival model for post-market medical device surveillance with non-exchangeable historical data," Biometrics, The International Biometric Society, vol. 70(1), pages 185-191, March.
    5. Lawrence N Kazembe, 2013. "A Bayesian Two Part Model Applied to Analyze Risk Factors of Adult Mortality with Application to Data from Namibia," PLOS ONE, Public Library of Science, vol. 8(9), pages 1-10, September.
    6. Wessel N. van Wieringen & Carel F. W. Peeters & Renee X. de Menezes & Mark A. van de Wiel, 2018. "Testing for pathway (in)activation by using Gaussian graphical models," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 67(5), pages 1419-1436, November.
    7. Byol Kim & Song Liu & Mladen Kolar, 2021. "Two‐sample inference for high‐dimensional Markov networks," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 83(5), pages 939-962, November.
    8. Rouven E. Haschka & Helmut Herwartz, 2022. "Endogeneity in pharmaceutical knowledge generation: An instrument‐free copula approach for Poisson frontier models," Journal of Economics & Management Strategy, Wiley Blackwell, vol. 31(4), pages 942-960, November.
    9. Agius Phaedra & Ying Yiming & Campbell Colin, 2009. "Bayesian Unsupervised Learning with Multiple Data Types," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 8(1), pages 1-29, June.
    10. Shuxi Zeng & Elizabeth C. Lange & Elizabeth A. Archie & Fernando A. Campos & Susan C. Alberts & Fan Li, 2023. "A Causal Mediation Model for Longitudinal Mediators and Survival Outcomes with an Application to Animal Behavior," Journal of Agricultural, Biological and Environmental Statistics, Springer;The International Biometric Society;American Statistical Association, vol. 28(2), pages 197-218, June.
    11. Xin Fang & Bo Fang & Chunfang Wang & Tian Xia & Matteo Bottai & Fang Fang & Yang Cao, 2019. "Comparison of Frequentist and Bayesian Generalized Additive Models for Assessing the Association between Daily Exposure to Fine Particles and Respiratory Mortality: A Simulation Study," IJERPH, MDPI, vol. 16(5), pages 1-20, March.
    12. Wang, Hao, 2010. "Sparse seemingly unrelated regression modelling: Applications in finance and econometrics," Computational Statistics & Data Analysis, Elsevier, vol. 54(11), pages 2866-2877, November.
    13. Eric Gilman & Milani Chaloupka & Brett Wiedoff & Jeremy Willson, 2014. "Mitigating Seabird Bycatch during Hauling by Pelagic Longline Vessels," PLOS ONE, Public Library of Science, vol. 9(1), pages 1-12, January.
    14. Daniel Felix Ahelegbey & Luis Carvalho & Eric D. Kolaczyk, 2020. "A Bayesian Covariance Graph And Latent Position Model For Multivariate Financial Time Series," DEM Working Papers Series 181, University of Pavia, Department of Economics and Management.
    15. Christine Peterson & Francesco C. Stingo & Marina Vannucci, 2015. "Bayesian Inference of Multiple Gaussian Graphical Models," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 110(509), pages 159-174, March.
    16. Serinaldi, Francesco, 2011. "Distributional modeling and short-term forecasting of electricity prices by Generalized Additive Models for Location, Scale and Shape," Energy Economics, Elsevier, vol. 33(6), pages 1216-1226.
    17. Belitz, Christiane & Lang, Stefan, 2008. "Simultaneous selection of variables and smoothing parameters in structured additive regression models," Computational Statistics & Data Analysis, Elsevier, vol. 53(1), pages 61-81, September.
    18. Volker Schmid & Leonhard Held, 2004. "Bayesian Extrapolation of Space–Time Trends in Cancer Registry Data," Biometrics, The International Biometric Society, vol. 60(4), pages 1034-1042, December.
    19. Birgit Schrödle & Leonhard Held, 2011. "A primer on disease mapping and ecological regression using $${\texttt{INLA}}$$," Computational Statistics, Springer, vol. 26(2), pages 241-258, June.
    20. Klein, Nadja & Herwartz, Helmut & Kneib, Thomas, 2020. "Modelling regional patterns of inefficiency: A Bayesian approach to geoadditive panel stochastic frontier analysis with an application to cereal production in England and Wales," Journal of Econometrics, Elsevier, vol. 214(2), pages 513-539.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jmathe:v:12:y:2024:i:6:p:788-:d:1353151. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.