IDEAS home Printed from https://ideas.repec.org/a/spr/sankha/v85y2023i2d10.1007_s13171-022-00297-0.html
   My bibliography  Save this article

Prediction Theory for Multinomial Proportions Using Two-stage Cluster Samples

Author

Listed:
  • Brajendra C. Sutradhar

    (Memorial University)

Abstract

In a two-stage clusters sampling setup for categorical data, it is well known that the so-called best prediction of the category based proportions involves computing the conditional means of the non-sampled multinomial variables conditional on the sampled multinomial responses. This computation is however not easy mainly due to the complex cluster correlations among multinomial responses within a cluster. The independence assumption based approach or any linear model approach for cluster correlated data those used so far in the existing studies are not valid for the computation of such conditional means in the prediction function for multinomial data. As opposed to these ‘working’ independence or linear models based approaches, in this paper we first develop a cluster correlation structure for multinomial data and exploit this structure to compute theoretically valid formulas for the conditional means of non-sampled hypothetical responses. Next because these conditional means or equivalently the prediction function contains the regression and clustered variance/correlation parameters, we estimate these parameters using the survey sampling weights based conditional likelihood approach, whereas the existing studies mostly use the independence assumption based likelihood or moment approaches which are invalid or inadequate in a correlation setup. The proposed conditional likelihood estimators are shown to be consistent for their respective parameters leading to the consistent estimation of the prediction function for the multinomial proportions.

Suggested Citation

  • Brajendra C. Sutradhar, 2023. "Prediction Theory for Multinomial Proportions Using Two-stage Cluster Samples," Sankhya A: The Indian Journal of Statistics, Springer;Indian Statistical Institute, vol. 85(2), pages 1452-1488, August.
  • Handle: RePEc:spr:sankha:v:85:y:2023:i:2:d:10.1007_s13171-022-00297-0
    DOI: 10.1007/s13171-022-00297-0
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s13171-022-00297-0
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s13171-022-00297-0?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Thomas R. Ten Have & Alfredo Morabia, 1999. "Mixed Effects Models with Bivariate and Univariate Association Parameters for Longitudinal Bivariate Binary Response Data," Biometrics, The International Biometric Society, vol. 55(1), pages 85-93, March.
    2. Brajendra C. Sutradhar, 2022. "Fixed versus Mixed Effects Based Marginal Models for Clustered Correlated Binary Data: an Overview on Advances and Challenges," Sankhya B: The Indian Journal of Statistics, Springer;Indian Statistical Institute, vol. 84(1), pages 259-302, May.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Brajendra C. Sutradhar & R. Prabhakar Rao, 2023. "Asymptotic Inferences in a Multinomial Logit Mixed Model for Spatial Categorical Data," Sankhya A: The Indian Journal of Statistics, Springer;Indian Statistical Institute, vol. 85(1), pages 885-930, February.
    2. Brajendra C. Sutradhar, 2022. "Multinomial Logistic Mixed Models for Clustered Categorical Data in a Complex Survey Sampling Setup," Sankhya A: The Indian Journal of Statistics, Springer;Indian Statistical Institute, vol. 84(2), pages 743-789, August.
    3. D. Todem & Y. Zhang & A. Ismail & W. Sohn, 2010. "Random effects regression models for count data with excess zeros in caries research," Journal of Applied Statistics, Taylor & Francis Journals, vol. 37(10), pages 1661-1679.
    4. Brajendra C. Sutradhar, 2023. "Regression analysis for exponential family data in a finite population setup using two-stage cluster sample," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 75(3), pages 425-462, June.
    5. Bartolucci, Francesco & Farcomeni, Alessio, 2009. "A Multivariate Extension of the Dynamic Logit Model for Longitudinal Data Based on a Latent Markov Heterogeneity Structure," Journal of the American Statistical Association, American Statistical Association, vol. 104(486), pages 816-831.
    6. Chaubert, F. & Mortier, F. & Saint André, L., 2008. "Multivariate dynamic model for ordinal outcomes," Journal of Multivariate Analysis, Elsevier, vol. 99(8), pages 1717-1732, September.
    7. Daniel Nevo & Deborah Blacker & Eric B. Larson & Sebastien Haneuse, 2022. "Modeling semi‐competing risks data as a longitudinal bivariate process," Biometrics, The International Biometric Society, vol. 78(3), pages 922-936, September.
    8. Sutradhar, Brajendra C., 2021. "Block-band behavior of spatial correlations: An analytical asymptotic study in a spatial exponential family data setup," Journal of Multivariate Analysis, Elsevier, vol. 186(C).
    9. Brajendra C. Sutradhar, 2023. "Cluster Correlations and Complexity in Binary Regression Analysis Using Two-stage Cluster Samples," Sankhya A: The Indian Journal of Statistics, Springer;Indian Statistical Institute, vol. 85(1), pages 829-884, February.
    10. Brajendra C. Sutradhar, 2024. "Inferences for Fixed Effects Based Regression Parameters in a Finite Population Setup Using Two-stage Cluster Sample," Sankhya A: The Indian Journal of Statistics, Springer;Indian Statistical Institute, vol. 86(2), pages 951-991, August.
    11. Celine Marielle Laffont & Marc Vandemeulebroecke & Didier Concordet, 2014. "Multivariate Analysis of Longitudinal Ordinal Data With Mixed Effects Models, With Application to Clinical Outcomes in Osteoarthritis," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 109(507), pages 955-966, September.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:sankha:v:85:y:2023:i:2:d:10.1007_s13171-022-00297-0. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.