IDEAS home Printed from https://ideas.repec.org/a/spr/compst/v37y2022i2d10.1007_s00180-021-01131-1.html
   My bibliography  Save this article

Limitations and performance of three approaches to Bayesian inference for Gaussian copula regression models of discrete data

Author

Listed:
  • L. L. Henn

    (Arbor Research Collaborative for Health)

Abstract

Gaussian copula regression models provide a flexible, intuitive framework in which to model dependent responses with a variety of marginal distributions. With non-continuous outcomes, the time required to compute the likelihood directly grows exponentially with sample size. What alternatives exist rarely have been considered in a Bayesian framework. We conduct inference for Gaussian copula regression models of non-continuous outcomes using three distinct approaches in a Bayesian setting: the continuous extension, the distributional transform, and the composite likelihood. The latter two include curvature correction. We consider the posterior distributional shapes and computational performance as well. We consider both simulations of several types of non-continuous data and analyses of real data. Data sets and types were chosen to challenge the performance of these approaches. Using frequentist methods, we evaluate the inference resulting from these three approaches. The distributional transform with curvature correction has good to excellent coverage for discrete variables with numerous levels. It also offers considerably faster performance than the other options considered, making it attractive for evaluating models of mutually dependent non-continuous responses. For responses with fewer levels, composite likelihood may be the only viable option.

Suggested Citation

  • L. L. Henn, 2022. "Limitations and performance of three approaches to Bayesian inference for Gaussian copula regression models of discrete data," Computational Statistics, Springer, vol. 37(2), pages 909-946, April.
  • Handle: RePEc:spr:compst:v:37:y:2022:i:2:d:10.1007_s00180-021-01131-1
    DOI: 10.1007/s00180-021-01131-1
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s00180-021-01131-1
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s00180-021-01131-1?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Matthieu Marbac & Christophe Biernacki & Vincent Vandewalle, 2017. "Model-based clustering of Gaussian copulas for mixed data," Communications in Statistics - Theory and Methods, Taylor & Francis Journals, vol. 46(23), pages 11635-11656, December.
    2. Richard E. Chandler & Steven Bate, 2007. "Inference for clustered data using the independence loglikelihood," Biometrika, Biometrika Trust, vol. 94(1), pages 167-183.
    3. Peter Xue‐Kun Song, 2000. "Multivariate Dispersion Models Generated From Gaussian Copula," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 27(2), pages 305-320, June.
    4. Annie Qu, 2004. "Assessing robustness of generalised estimating equations and quadratic inference functions," Biometrika, Biometrika Trust, vol. 91(2), pages 447-459, June.
    5. Michael Pitt & David Chan & Robert Kohn, 2006. "Efficient Bayesian inference for Gaussian copula regression models," Biometrika, Biometrika Trust, vol. 93(3), pages 537-554, September.
    6. Genest, Christian & Nešlehová, Johanna, 2007. "A Primer on Copulas for Count Data," ASTIN Bulletin, Cambridge University Press, vol. 37(2), pages 475-515, November.
    7. Higgs, Megan Dailey & Hoeting, Jennifer A., 2010. "A clipped latent variable model for spatially correlated ordered categorical data," Computational Statistics & Data Analysis, Elsevier, vol. 54(8), pages 1999-2011, August.
    8. Michael S. Smith & Mohamad A. Khaled, 2012. "Estimation of Copula Models With Discrete Margins via Bayesian Data Augmentation," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 107(497), pages 290-303, March.
    9. Peter X.-K. Song & Mingyao Li & Ying Yuan, 2009. "Joint Regression Analysis of Correlated Data Using Gaussian Copulas," Biometrics, The International Biometric Society, vol. 65(1), pages 60-68, March.
    10. Denuit, Michel & Lambert, Philippe, 2005. "Constraints on concordance measures in bivariate discrete data," Journal of Multivariate Analysis, Elsevier, vol. 93(1), pages 40-57, March.
    11. Cristiano Varin, 2008. "On composite marginal likelihoods," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 92(1), pages 1-28, February.
    12. Yun Bai & Jian Kang & Peter X.-K. Song, 2014. "Efficient pairwise composite likelihood estimation for spatial-clustered data," Biometrics, The International Biometric Society, vol. 70(3), pages 661-670, September.
    13. Lu Yang & Edward W. Frees & Zhengjun Zhang, 2020. "Nonparametric Estimation of Copula Regression Models With Discrete Outcomes," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 115(530), pages 707-720, April.
    14. L. Madsen & Y. Fang, 2011. "Joint Regression Analysis for Discrete Longitudinal Data," Biometrics, The International Biometric Society, vol. 67(3), pages 1171-1175, September.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Nikoloulopoulos, Aristidis K., 2023. "Efficient and feasible inference for high-dimensional normal copula regression models," Computational Statistics & Data Analysis, Elsevier, vol. 179(C).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Smith, Michael Stanley, 2023. "Implicit Copulas: An Overview," Econometrics and Statistics, Elsevier, vol. 28(C), pages 81-104.
    2. Michael Stanley Smith, 2021. "Implicit Copulas: An Overview," Papers 2109.04718, arXiv.org.
    3. Fokianos, Konstantinos, 2024. "Multivariate Count Time Series Modelling," Econometrics and Statistics, Elsevier, vol. 31(C), pages 100-116.
    4. Xiaotian Zheng & Athanasios Kottas & Bruno Sansó, 2023. "Bayesian geostatistical modeling for discrete‐valued processes," Environmetrics, John Wiley & Sons, Ltd., vol. 34(7), November.
    5. George Karabatsos, 2024. "Copula Approximate Bayesian Computation Using Distribution Random Forests," Stats, MDPI, vol. 7(3), pages 1-49, September.
    6. Lu Yang & Claudia Czado, 2022. "Two‐part D‐vine copula models for longitudinal insurance claim data," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 49(4), pages 1534-1561, December.
    7. Fokianos, Konstantinos & Fried, Roland & Kharin, Yuriy & Voloshko, Valeriy, 2022. "Statistical analysis of multivariate discrete-valued time series," Journal of Multivariate Analysis, Elsevier, vol. 188(C).
    8. Jong-Min Kim & Hyunsu Ju & Yoonsung Jung, 2020. "Copula Approach for Developing a Biomarker Panel for Prediction of Dengue Hemorrhagic Fever," Annals of Data Science, Springer, vol. 7(4), pages 697-712, December.
    9. Stöber, Jakob & Hong, Hyokyoung Grace & Czado, Claudia & Ghosh, Pulak, 2015. "Comorbidity of chronic diseases in the elderly: Patterns identified by a copula design for mixed responses," Computational Statistics & Data Analysis, Elsevier, vol. 88(C), pages 28-39.
    10. Zilko, Aurelius A. & Kurowicka, Dorota, 2016. "Copula in a multivariate mixed discrete–continuous model," Computational Statistics & Data Analysis, Elsevier, vol. 103(C), pages 28-55.
    11. Craiu, V. Radu & Sabeti, Avideh, 2012. "In mixed company: Bayesian inference for bivariate conditional copula models with discrete and continuous outcomes," Journal of Multivariate Analysis, Elsevier, vol. 110(C), pages 106-120.
    12. Azam, Kazim & Pitt, Michael, 2014. "Bayesian Inference for a Semi-Parametric Copula-based Markov Chain," The Warwick Economics Research Paper Series (TWERPS) 1051, University of Warwick, Department of Economics.
    13. Aristidis Nikoloulopoulos & Dimitris Karlis, 2010. "Regression in a copula model for bivariate count data," Journal of Applied Statistics, Taylor & Francis Journals, vol. 37(9), pages 1555-1568.
    14. Zichen Ma & Shannon W. Davis & Yen‐Yi Ho, 2023. "Flexible copula model for integrating correlated multi‐omics data from single‐cell experiments," Biometrics, The International Biometric Society, vol. 79(2), pages 1559-1572, June.
    15. Azam, Kazim & Pitt, Michael, 2014. "Bayesian Inference for a Semi-Parametric Copula-based Markov Chain," Economic Research Papers 270232, University of Warwick - Department of Economics.
    16. Siem Jan Koopman & Rutger Lit & André Lucas & Anne Opschoor, 2018. "Dynamic discrete copula models for high‐frequency stock price changes," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 33(7), pages 966-985, November.
    17. Pravin Trivedi & David Zimmer, 2017. "A Note on Identification of Bivariate Copulas for Discrete Count Data," Econometrics, MDPI, vol. 5(1), pages 1-11, February.
    18. Elizabeth D. Schifano & Himchan Jeong & Ved Deshpande & Dipak K. Dey, 2021. "Fully and empirical Bayes approaches to estimating copula-based models for bivariate mixed outcomes using Hamiltonian Monte Carlo," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 30(1), pages 133-152, March.
    19. Ruben Loaiza-Maya & Michael Stanley Smith, 2017. "Variational Bayes Estimation of Discrete-Margined Copula Models with Application to Time Series," Papers 1712.09150, arXiv.org, revised Jul 2018.
    20. Amjad, Muhammad & Akbar, Muhammad & Ullah, Hamd, 2022. "A copula-based approach for creating an index of micronutrient intakes at household level in Pakistan," Economics & Human Biology, Elsevier, vol. 46(C).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:compst:v:37:y:2022:i:2:d:10.1007_s00180-021-01131-1. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.