IDEAS home Printed from https://ideas.repec.org/a/spr/testjl/v31y2022i2d10.1007_s11749-022-00812-3.html
   My bibliography  Save this article

Hybrid semiparametric Bayesian networks

Author

Listed:
  • David Atienza

    (Universidad Politécnica de Madrid)

  • Pedro Larrañaga

    (Universidad Politécnica de Madrid)

  • Concha Bielza

    (Universidad Politécnica de Madrid)

Abstract

This paper presents a new class of Bayesian networks called hybrid semiparametric Bayesian networks, which can model hybrid data (discrete and continuous data) by mixing parametric and nonparametric estimation models. The parametric estimation models can represent a conditional linear Gaussian relationship between variables, while the nonparametric estimation model can represent other types of relationships, such as non-Gaussian and nonlinear relationships. This new class of Bayesian networks generalizes the conditional linear Gaussian Bayesian networks, including them as a special case. In addition, we describe a learning procedure for the structure and the parameters of our proposed type of Bayesian network. This learning procedure finds the best combination of parametric and nonparametric models automatically from data. This requires the definition of a cross-validated score. We also detail how new data can be sampled from a hybrid semiparametric Bayesian network, which in turn can be useful to solve other related tasks, such as inference. Furthermore, we intuitively relate our proposal with adaptive kernel density estimation models. The experimental results show that hybrid semiparametric Bayesian networks are a valuable contribution when dealing with data that do not meet the parametric assumptions that are expected for other models, such as conditional linear Gaussian Bayesian networks. We include experiments with synthetic data and real-world data from the UCI repository which demonstrate the good performance and the ability to extract useful information about the relationship between the variables in the model.

Suggested Citation

  • David Atienza & Pedro Larrañaga & Concha Bielza, 2022. "Hybrid semiparametric Bayesian networks," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 31(2), pages 299-327, June.
  • Handle: RePEc:spr:testjl:v:31:y:2022:i:2:d:10.1007_s11749-022-00812-3
    DOI: 10.1007/s11749-022-00812-3
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s11749-022-00812-3
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s11749-022-00812-3?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. David Atienza & Pedro Larrañaga & Concha Bielza, 2022. "Rejoinder on: Hybrid semiparametric Bayesian networks," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 31(2), pages 344-347, June.
    2. Carpenter, Bob & Gelman, Andrew & Hoffman, Matthew D. & Lee, Daniel & Goodrich, Ben & Betancourt, Michael & Brubaker, Marcus & Guo, Jiqiang & Li, Peter & Riddell, Allen, 2017. "Stan: A Probabilistic Programming Language," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 76(i01).
    3. Peter Spirtes & Clark Glymour & Richard Scheines, 2001. "Causation, Prediction, and Search, 2nd Edition," MIT Press Books, The MIT Press, edition 1, volume 1, number 0262194406, April.
    4. Cao, Ricardo & Cuevas, Antonio & Gonzalez Manteiga, Wensceslao, 1994. "A comparative study of several smoothing methods in density estimation," Computational Statistics & Data Analysis, Elsevier, vol. 17(2), pages 153-176, February.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. David Atienza & Pedro Larrañaga & Concha Bielza, 2022. "Rejoinder on: Hybrid semiparametric Bayesian networks," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 31(2), pages 344-347, June.
    2. Yu, Yaocheng & Shuai, Bin & Huang, Wencheng, 2024. "Resilience evaluation of train control on-board system considering common cause failure: Based on a beta-factor and continuous-time bayesian network model," Reliability Engineering and System Safety, Elsevier, vol. 246(C).
    3. Rong Li & Qing Liu & Lei Wang, 2024. "An Index Model for the Evaluation of the Performance of Lock Navigation Scheduling Rules Considering the Perspective of Stakeholders," Sustainability, MDPI, vol. 16(5), pages 1-20, March.
    4. Stefan Sperlich, 2022. "Comments on: hybrid semiparametric Bayesian networks," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 31(2), pages 335-339, June.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Francis,David C. & Kubinec ,Robert, 2022. "Beyond Political Connections : A Measurement Model Approach to Estimating Firm-levelPolitical Influence in 41 Economies," Policy Research Working Paper Series 10119, The World Bank.
    2. Yongping Bao & Ludwig Danwitz & Fabian Dvorak & Sebastian Fehrler & Lars Hornuf & Hsuan Yu Lin & Bettina von Helversen, 2022. "Similarity and Consistency in Algorithm-Guided Exploration," CESifo Working Paper Series 10188, CESifo.
    3. Heinrich, Torsten & Yang, Jangho & Dai, Shuanping, 2020. "Growth, development, and structural change at the firm-level: The example of the PR China," MPRA Paper 105011, University Library of Munich, Germany.
    4. Xin Xu & Yang Lu & Yupeng Zhou & Zhiguo Fu & Yanjie Fu & Minghao Yin, 2021. "An Information-Explainable Random Walk Based Unsupervised Network Representation Learning Framework on Node Classification Tasks," Mathematics, MDPI, vol. 9(15), pages 1-14, July.
    5. Xiaoyue Xi & Simon E. F. Spencer & Matthew Hall & M. Kate Grabowski & Joseph Kagaayi & Oliver Ratmann & Rakai Health Sciences Program and PANGEA‐HIV, 2022. "Inferring the sources of HIV infection in Africa from deep‐sequence data with semi‐parametric Bayesian Poisson flow models," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 71(3), pages 517-540, June.
    6. Bettendorf, Timo & Heinlein, Reinhold, 2019. "Connectedness between G10 currencies: Searching for the causal structure," Discussion Papers 06/2019, Deutsche Bundesbank.
    7. Joseph B. Bak-Coleman & Ian Kennedy & Morgan Wack & Andrew Beers & Joseph S. Schafer & Emma S. Spiro & Kate Starbird & Jevin D. West, 2022. "Combining interventions to reduce the spread of viral misinformation," Nature Human Behaviour, Nature, vol. 6(10), pages 1372-1380, October.
    8. David M. Phillippo & Sofia Dias & A. E. Ades & Mark Belger & Alan Brnabic & Alexander Schacht & Daniel Saure & Zbigniew Kadziola & Nicky J. Welton, 2020. "Multilevel network meta‐regression for population‐adjusted treatment comparisons," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 183(3), pages 1189-1210, June.
    9. Alina Ferecatu & Arnaud Bruyn & Prithwiraj Mukherjee, 2024. "Silently killing your panelists one email at a time: The true cost of email solicitations," Journal of the Academy of Marketing Science, Springer, vol. 52(4), pages 1216-1239, July.
    10. Maarten J. Bijlsma & Rhian M. Daniel & Fanny Janssen & Bianca L. De Stavola, 2017. "An Assessment and Extension of the Mechanism-Based Approach to the Identification of Age-Period-Cohort Models," Demography, Springer;Population Association of America (PAA), vol. 54(2), pages 721-743, April.
    11. Burbano, Vanessa & Padilla, Nicolas & Meier, Stephan, 2020. "Gender Differences in Preferences for Meaning at Work," IZA Discussion Papers 13053, Institute of Labor Economics (IZA).
    12. Robert Kubinec & Haillie Na‐Kyung Lee & Andrey Tomashevskiy, 2021. "Politically connected companies are less likely to shutdown due to COVID‐19 restrictions," Social Science Quarterly, Southwestern Social Science Association, vol. 102(5), pages 2155-2169, September.
    13. Barrington-Leigh, C.P., 2024. "The econometrics of happiness: Are we underestimating the returns to education and income?," Journal of Public Economics, Elsevier, vol. 230(C).
    14. Ichimura, Hidehiko & Todd, Petra E., 2007. "Implementing Nonparametric and Semiparametric Estimators," Handbook of Econometrics, in: J.J. Heckman & E.E. Leamer (ed.), Handbook of Econometrics, edition 1, volume 6, chapter 74, Elsevier.
    15. Matthew A. Masten & Alexandre Poirier, 2020. "Inference on breakdown frontiers," Quantitative Economics, Econometric Society, vol. 11(1), pages 41-111, January.
    16. del Rio, Alejandro Quintela, 1996. "Comparison of bandwidth selectors in nonparametric regression under dependence," Computational Statistics & Data Analysis, Elsevier, vol. 21(5), pages 563-580, May.
    17. Salvatore Nunnari & Massimiliano Pozzi, 2022. "Meta-Analysis of Inequality Aversion Estimates," CESifo Working Paper Series 9851, CESifo.
    18. Andreas Kryger Jensen & Claus Thorn Ekstrøm, 2021. "Quantifying the trendiness of trends," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 70(1), pages 98-121, January.
    19. Lauderdale, Benjamin E. & Bailey, Delia & Blumenau, Jack & Rivers, Douglas, 2020. "Model-based pre-election polling for national and sub-national outcomes in the US and UK," International Journal of Forecasting, Elsevier, vol. 36(2), pages 399-413.
    20. Tamara Broderick & Ryan Giordano & Rachael Meager, 2020. "An Automatic Finite-Sample Robustness Metric: When Can Dropping a Little Data Make a Big Difference?," Papers 2011.14999, arXiv.org, revised Jul 2023.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:testjl:v:31:y:2022:i:2:d:10.1007_s11749-022-00812-3. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.