IDEAS home Printed from https://ideas.repec.org/a/spr/sankhb/v84y2022i1d10.1007_s13571-021-00260-3.html
   My bibliography  Save this article

Fixed versus Mixed Effects Based Marginal Models for Clustered Correlated Binary Data: an Overview on Advances and Challenges

Author

Listed:
  • Brajendra C. Sutradhar

    (Memorial University)

Abstract

In a cross-sectional cluster setup, the binary responses from the individuals in a cluster become correlated as they share a common cluster effect, whereas longitudinal responses from an individual those form a cluster become correlated as the present and past responses are likely to maintain a suitable dynamic relationship. In both cluster and longitudinal setups, the marginal means may or may not be specified as the function of regression effects/parameters only. In a cluster setup, this depends on the distributional assumption of the random cluster effects and in a longitudinal setup this depends on the form such as linear or non-linear dynamic relationships used to construct a conditional model. However, over the last four decades, many studies arbitrarily pre-specified the marginal means as the function of regression effects only under both cluster and longitudinal setups and accommodated correlations also using arbitrarily selected ‘working’ correlation structures. This paper makes a thorough in-depth review of these decades long binary correlation models for consistent and efficient estimation of the regression effects. Both progress and drawbacks of these works are presented clearly showing how the inconsistency can arise if the pre-specified marginal fixed model is used when in fact such a marginal fixed effects model does not exist. This is because, some of the conditional random effects models in a cluster setup produce mixed effect models for the marginal means, and conditional non-linear dynamic models in a longitudinal setup produce history based marginal recursive/dynamic models. As the practitioners in both cluster and longitudinal setups deal with large data sets, it is demonstrated for their benefits how one can use the GQL (generalized quasi-likelihood) estimation approach both in cluster and longitudinal setups. Furthermore, there exist many studies using the Bayesisn approach where unlike the aforementioned parametric correlation structure based inferences, the marginal mixed effects models have been used for inferences for correlated binary data without specifying their correlation structures, under both cluster and longitudinal setup. We also provide a brief review on this alternative approach.

Suggested Citation

  • Brajendra C. Sutradhar, 2022. "Fixed versus Mixed Effects Based Marginal Models for Clustered Correlated Binary Data: an Overview on Advances and Challenges," Sankhya B: The Indian Journal of Statistics, Springer;Indian Statistical Institute, vol. 84(1), pages 259-302, May.
  • Handle: RePEc:spr:sankhb:v:84:y:2022:i:1:d:10.1007_s13571-021-00260-3
    DOI: 10.1007/s13571-021-00260-3
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s13571-021-00260-3
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s13571-021-00260-3?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Zengri Wang, 2003. "Matching conditional and marginal shapes in binary random intercept models using a bridge distribution function," Biometrika, Biometrika Trust, vol. 90(4), pages 765-775, December.
    2. Sutradhar, Brajendra C. & Mukerjee, Rahul, 2005. "On likelihood inference in binary mixed model with an application to COPD data," Computational Statistics & Data Analysis, Elsevier, vol. 48(2), pages 345-361, February.
    3. D. R. Cox, 1972. "The Analysis of Multivariate Binary Data," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 21(2), pages 113-120, June.
    4. J. C. Loredo‐Osti & Brajendra C. Sutradhar, 2012. "Estimation of regression and dynamic dependence paremeters for non‐stationary multinomial time series," Journal of Time Series Analysis, Wiley Blackwell, vol. 33(3), pages 458-467, May.
    5. Lin X. & Carroll R. J., 2001. "Semiparametric Regression for Clustered Data Using Generalized Estimating Equations," Journal of the American Statistical Association, American Statistical Association, vol. 96, pages 1045-1056, September.
    6. Sutradhar, Brajendra C. & Ali, Mir M., 1989. "A generalization of the Wishart distribution for the elliptical model and its moments for the multivariate t model," Journal of Multivariate Analysis, Elsevier, vol. 29(1), pages 155-162, April.
    7. Zhijian Chen & Grace Y. Yi & Changbao Wu, 2011. "Marginal methods for correlated binary data with misclassified responses," Biometrika, Biometrika Trust, vol. 98(3), pages 647-662.
    8. Zengri Wang & Thomas A. Louis, 2004. "Marginalized Binary Mixed-Effects Models with Covariate-Dependent Random Effects and Likelihood Inference," Biometrics, The International Biometric Society, vol. 60(4), pages 884-891, December.
    9. John M. Neuhaus, 2002. "Analysis of Clustered and Longitudinal Binary Data Subject to Response Misclassification," Biometrics, The International Biometric Society, vol. 58(3), pages 675-683, September.
    10. Chib, Siddhartha & Jeliazkov, Ivan, 2006. "Inference in Semiparametric Dynamic Models for Binary Longitudinal Data," Journal of the American Statistical Association, American Statistical Association, vol. 101, pages 685-700, June.
    11. Brajendra C. Sutradhar & Nan Zheng, 2018. "Inferences in Binary Dynamic Fixed Models in a Semi-parametric Setup," Sankhya B: The Indian Journal of Statistics, Springer;Indian Statistical Institute, vol. 80(2), pages 263-291, November.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Brajendra C. Sutradhar, 2023. "Prediction Theory for Multinomial Proportions Using Two-stage Cluster Samples," Sankhya A: The Indian Journal of Statistics, Springer;Indian Statistical Institute, vol. 85(2), pages 1452-1488, August.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Shun Yu & Xianzheng Huang, 2019. "Link misspecification in generalized linear mixed models with a random intercept for binary responses," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 28(3), pages 827-843, September.
    2. Caffo, Brian & An, Ming-Wen & Rohde, Charles, 2007. "Flexible random intercept models for binary outcomes using mixtures of normals," Computational Statistics & Data Analysis, Elsevier, vol. 51(11), pages 5220-5235, July.
    3. Iraj Kazemi & Fatemeh Hassanzadeh, 2021. "Marginalized random-effects models for clustered binomial data through innovative link functions," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 105(2), pages 197-228, June.
    4. Laura Boehm & Brian J. Reich & Dipankar Bandyopadhyay, 2013. "Bridging Conditional and Marginal Inference for Spatially Referenced Binary Data," Biometrics, The International Biometric Society, vol. 69(2), pages 545-554, June.
    5. Shun Yu & Xianzheng Huang, 2017. "Random-intercept misspecification in generalized linear mixed models for binary responses," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 26(3), pages 333-359, August.
    6. Christel Faes & Marc Aerts & Helena Geys & Geert Molenberghs, 2007. "Model Averaging Using Fractional Polynomials to Estimate a Safe Level of Exposure," Risk Analysis, John Wiley & Sons, vol. 27(1), pages 111-123, February.
    7. Laura Liu & Hyungsik Roger Moon & Frank Schorfheide, 2023. "Forecasting with a panel Tobit model," Quantitative Economics, Econometric Society, vol. 14(1), pages 117-159, January.
    8. Lian, Heng & Du, Pang & Li, YuanZhang & Liang, Hua, 2014. "Partially linear structure identification in generalized additive models with NP-dimensionality," Computational Statistics & Data Analysis, Elsevier, vol. 80(C), pages 197-208.
    9. McMahon, James M. & Pouget, Enrique R. & Tortu, Stephanie, 2006. "A guide for multilevel modeling of dyadic data with binary outcomes using SAS PROC NLMIXED," Computational Statistics & Data Analysis, Elsevier, vol. 50(12), pages 3663-3680, August.
    10. Micheas, Athanasios C. & Dey, Dipak K., 2005. "Modeling shape distributions and inferences for assessing differences in shapes," Journal of Multivariate Analysis, Elsevier, vol. 92(2), pages 257-280, February.
    11. Fang, B.Q., 2006. "Sample mean, covariance and T2 statistic of the skew elliptical model," Journal of Multivariate Analysis, Elsevier, vol. 97(7), pages 1675-1690, August.
    12. Lanjia Lin & Dipankar Bandyopadhyay & Stuart R. Lipsitz & Debajyoti Sinha, 2010. "Association Models for Clustered Data with Binary and Continuous Responses," Biometrics, The International Biometric Society, vol. 66(1), pages 287-293, March.
    13. Jason Roy & Michael J. Daniels, 2008. "A General Class of Pattern Mixture Models for Nonignorable Dropout with Many Possible Dropout Times," Biometrics, The International Biometric Society, vol. 64(2), pages 538-545, June.
    14. Kromidha, Endrit & Li, Matthew C., 2019. "Determinants of leadership in online social trading: A signaling theory perspective," Journal of Business Research, Elsevier, vol. 97(C), pages 184-197.
    15. Kim, Chul & Jun, Duk Bin & Park, Sungho, 2018. "Capturing flexible correlations in multiple-discrete choice outcomes using copulas," International Journal of Research in Marketing, Elsevier, vol. 35(1), pages 34-59.
    16. Francesco Bartolucci & Claudia Pigini, 2018. "Partial effects estimation for fixed-effects logit panel data models," Working Papers 431, Universita' Politecnica delle Marche (I), Dipartimento di Scienze Economiche e Sociali.
    17. Richards, Timothy J. & Hamilton, Stephen F. & Yonezawa, Koichi, 2018. "Retail Market Power in a Shopping Basket Model of Supermarket Competition," Journal of Retailing, Elsevier, vol. 94(3), pages 328-342.
    18. Todd E. Clark & Gergely Ganics & Elmar Mertens, 2022. "Constructing Fan Charts from the Ragged Edge of SPF Forecasts," Working Papers 22-36, Federal Reserve Bank of Cleveland.
    19. Tuglus Catherine & van der Laan Mark J., 2011. "Repeated Measures Semiparametric Regression Using Targeted Maximum Likelihood Methodology with Application to Transcription Factor Activity Discovery," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 10(1), pages 1-31, January.
    20. Timothy Tyler Brown & Juan Pablo Atal, 2019. "How robust are reference pricing studies on outpatient medical procedures? Three different preprocessing techniques applied to difference‐in differences," Health Economics, John Wiley & Sons, Ltd., vol. 28(2), pages 280-298, February.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:sankhb:v:84:y:2022:i:1:d:10.1007_s13571-021-00260-3. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.