IDEAS home Printed from https://ideas.repec.org/a/spr/sankha/v86y2024i2d10.1007_s13171-024-00362-w.html
   My bibliography  Save this article

Inferences for Fixed Effects Based Regression Parameters in a Finite Population Setup Using Two-stage Cluster Sample

Author

Listed:
  • Brajendra C. Sutradhar

    (Memorial University)

Abstract

In a clusters based infinite/super population (SP) setup, the inference complexity for regression parameters depend on the cluster correlations structure oriented marginal fixed or mixed effects based mean models, where as opposed to the fixed effects based models the mixed models exhibit both regression and cluster variance/correlation parameters in the mean structure especially for discrete such as clustered counts and binary data. On the other hand, as recently discussed by Sutradhar (2022, Sankhya B, 84, 259-302), in the fixed models setup the modelling of cluster correlations can be complex. In the finite population (FP) setup where FP data follow a SP model but the regression effects estimation is done using a two-stage cluster sample, the inferences become much more complex. As opposed to the recent mixed models based FP inferences (Sutradhar 2023b, Annals of the Institute of Statistical Mathematics, 75, 425-462), there does not appear any studies for fixed models based analysis for exponential family data where true correlation structures play important role in defining FP regression parameters. To resolve this inference issue, in this paper we make following specific contributions. First, the cluster correlations for linear, counts and binary data are developed in a SP setup such that the marginal mean models depend only on the fixed regression effects. Second the FP data which are hypothetical or unobserved until a sample is taken to observe a part, are utilized to develop hypothetical estimating equations for the SP regression parameters, which subsequently define the FP regression and correlation parameters. Third, the design weighted unbiased estimators are obtained for the FP regression and correlation parameters, where the specific formulas for correlation estimators depend on the nature of the response data whether linear, counts or binary. Also, as an additional application of the regression effects estimation, the FP total prediction is discussed. Next, the design consistency of the regression estimators is developed in details.

Suggested Citation

  • Brajendra C. Sutradhar, 2024. "Inferences for Fixed Effects Based Regression Parameters in a Finite Population Setup Using Two-stage Cluster Sample," Sankhya A: The Indian Journal of Statistics, Springer;Indian Statistical Institute, vol. 86(2), pages 951-991, August.
  • Handle: RePEc:spr:sankha:v:86:y:2024:i:2:d:10.1007_s13171-024-00362-w
    DOI: 10.1007/s13171-024-00362-w
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s13171-024-00362-w
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s13171-024-00362-w?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Zengri Wang, 2003. "Matching conditional and marginal shapes in binary random intercept models using a bridge distribution function," Biometrika, Biometrika Trust, vol. 90(4), pages 765-775, December.
    2. Sutradhar, Brajendra C. & Rao, R. Prabhakar, 2001. "On Marginal Quasi-Likelihood Inference in Generalized Linear Mixed Models," Journal of Multivariate Analysis, Elsevier, vol. 76(1), pages 1-34, January.
    3. Sutradhar, Brajendra C. & Jowaheer, Vandna, 2003. "On familial longitudinal Poisson mixed models with gamma random effects," Journal of Multivariate Analysis, Elsevier, vol. 87(2), pages 398-412, November.
    4. Brajendra C. Sutradhar, 2023. "Regression analysis for exponential family data in a finite population setup using two-stage cluster sample," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 75(3), pages 425-462, June.
    5. Brajendra C. Sutradhar, 2023. "Cluster Correlations and Complexity in Binary Regression Analysis Using Two-stage Cluster Samples," Sankhya A: The Indian Journal of Statistics, Springer;Indian Statistical Institute, vol. 85(1), pages 829-884, February.
    6. Brajendra C. Sutradhar, 2022. "Fixed versus Mixed Effects Based Marginal Models for Clustered Correlated Binary Data: an Overview on Advances and Challenges," Sankhya B: The Indian Journal of Statistics, Springer;Indian Statistical Institute, vol. 84(1), pages 259-302, May.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Lanjia Lin & Dipankar Bandyopadhyay & Stuart R. Lipsitz & Debajyoti Sinha, 2010. "Association Models for Clustered Data with Binary and Continuous Responses," Biometrics, The International Biometric Society, vol. 66(1), pages 287-293, March.
    2. Jason Roy & Michael J. Daniels, 2008. "A General Class of Pattern Mixture Models for Nonignorable Dropout with Many Possible Dropout Times," Biometrics, The International Biometric Society, vol. 64(2), pages 538-545, June.
    3. Shun Yu & Xianzheng Huang, 2019. "Link misspecification in generalized linear mixed models with a random intercept for binary responses," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 28(3), pages 827-843, September.
    4. Jonathan S. Schildcrout & Patrick J. Heagerty, 2007. "Marginalized Models for Moderate to Long Series of Longitudinal Binary Response Data," Biometrics, The International Biometric Society, vol. 63(2), pages 322-331, June.
    5. William Dunsmuir & Jieyi He, 2017. "Marginal Estimation of Parameter Driven Binomial Time Series Models," Journal of Time Series Analysis, Wiley Blackwell, vol. 38(1), pages 120-144, January.
    6. Brajendra C. Sutradhar, 2022. "Fixed versus Mixed Effects Based Marginal Models for Clustered Correlated Binary Data: an Overview on Advances and Challenges," Sankhya B: The Indian Journal of Statistics, Springer;Indian Statistical Institute, vol. 84(1), pages 259-302, May.
    7. Youn Ahn, Jae & Jeong, Himchan & Lu, Yang, 2021. "On the ordering of credibility factors," Insurance: Mathematics and Economics, Elsevier, vol. 101(PB), pages 626-638.
    8. Caffo, Brian & An, Ming-Wen & Rohde, Charles, 2007. "Flexible random intercept models for binary outcomes using mixtures of normals," Computational Statistics & Data Analysis, Elsevier, vol. 51(11), pages 5220-5235, July.
    9. Iddi, Samuel & Molenberghs, Geert, 2012. "A combined overdispersed and marginalized multilevel model," Computational Statistics & Data Analysis, Elsevier, vol. 56(6), pages 1944-1951.
    10. Iraj Kazemi & Fatemeh Hassanzadeh, 2021. "Marginalized random-effects models for clustered binomial data through innovative link functions," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 105(2), pages 197-228, June.
    11. Bruce J. Swihart & Brian S. Caffo & Ciprian M. Crainiceanu, 2014. "A Unifying Framework for Marginalised Random-Intercept Models of Correlated Binary Outcomes," International Statistical Review, International Statistical Institute, vol. 82(2), pages 275-295, August.
    12. Stephens Alisa & Tchetgen Tchetgen Eric & De Gruttola Victor, 2014. "Locally Efficient Estimation of Marginal Treatment Effects When Outcomes Are Correlated: Is the Prize Worth the Chase?," The International Journal of Biostatistics, De Gruyter, vol. 10(1), pages 59-75, May.
    13. Victor De Oliveira, 2017. "Geostatistical Binary Data: Models, Properties And Connections," Working Papers 0151mss, College of Business, University of Texas at San Antonio.
    14. Brajendra C. Sutradhar, 2023. "Regression analysis for exponential family data in a finite population setup using two-stage cluster sample," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 75(3), pages 425-462, June.
    15. Brajendra C. Sutradhar, 2023. "Prediction Theory for Multinomial Proportions Using Two-stage Cluster Samples," Sankhya A: The Indian Journal of Statistics, Springer;Indian Statistical Institute, vol. 85(2), pages 1452-1488, August.
    16. Laura Boehm & Brian J. Reich & Dipankar Bandyopadhyay, 2013. "Bridging Conditional and Marginal Inference for Spatially Referenced Binary Data," Biometrics, The International Biometric Society, vol. 69(2), pages 545-554, June.
    17. Shaun R. Seaman & Menelaos Pavlou & Andrew J. Copas, 2014. "Methods for observed-cluster inference when cluster size is informative: A review and clarifications," Biometrics, The International Biometric Society, vol. 70(2), pages 449-456, June.
    18. Huang, Youjun & Pan, Jianxin, 2021. "Joint generalized estimating equations for longitudinal binary data," Computational Statistics & Data Analysis, Elsevier, vol. 155(C).
    19. Pan Zhao & Yifan Cui, 2023. "A Semiparametric Instrumented Difference-in-Differences Approach to Policy Learning," Papers 2310.09545, arXiv.org.
    20. Nooraee, Nazanin & Molenberghs, Geert & van den Heuvel, Edwin R., 2014. "GEE for longitudinal ordinal data: Comparing R-geepack, R-multgee, R-repolr, SAS-GENMOD, SPSS-GENLIN," Computational Statistics & Data Analysis, Elsevier, vol. 77(C), pages 70-83.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:sankha:v:86:y:2024:i:2:d:10.1007_s13171-024-00362-w. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.