IDEAS home Printed from https://ideas.repec.org/a/eee/stapro/v81y2011i7p892-901.html
   My bibliography  Save this article

Pseudo-likelihood methodology for partitioned large and complex samples

Author

Listed:
  • Molenberghs, Geert
  • Verbeke, Geert
  • Iddi, Samuel

Abstract

Large data sets, either coming from a large number of independent replications, or because of hierarchies in the data with large numbers of within-unit replication, may pose challenges to the data analyst up to the point of making conventional inferential methods, such as maximum likelihood, prohibitive. Based on general pseudo-likelihood concepts, we propose a method to partition such a set of data, analyze each partition member, and properly combine the inferences into a single one. It is shown that the method is fully efficient for independent partitions, while with dependent sub-samples efficiency is sometimes but not always equal to one. It is argued that, for important realistic settings, efficiency is often very high. Illustrative examples enhance insight in the method's operation, while real-data analysis underscores its power for practice.

Suggested Citation

  • Molenberghs, Geert & Verbeke, Geert & Iddi, Samuel, 2011. "Pseudo-likelihood methodology for partitioned large and complex samples," Statistics & Probability Letters, Elsevier, vol. 81(7), pages 892-901, July.
  • Handle: RePEc:eee:stapro:v:81:y:2011:i:7:p:892-901
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0167-7152(11)00019-8
    Download Restriction: Full text for ScienceDirect subscribers only
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Steffen Fieuws & Geert Verbeke, 2006. "Pairwise Fitting of Mixed Models for the Joint Modeling of Multivariate Longitudinal Profiles," Biometrics, The International Biometric Society, vol. 62(2), pages 424-431, June.
    2. Cristiano Varin, 2008. "On composite marginal likelihoods," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 92(1), pages 1-28, February.
    3. Steffen Fieuws & Geert Verbeke & Filip Boen & Christophe Delecluse, 2006. "High dimensional multivariate mixed models for binary questionnaire data," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 55(4), pages 449-460, August.
    4. D. R. Cox, 2004. "A note on pseudolikelihood constructed from marginal densities," Biometrika, Biometrika Trust, vol. 91(3), pages 729-737, September.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Craiu, Radu V. & Duchesne, Thierry, 2018. "A scalable and efficient covariate selection criterion for mixed effects regression models with unknown random effects structure," Computational Statistics & Data Analysis, Elsevier, vol. 117(C), pages 154-161.
    2. Hillary Koch & Cheryl A. Keller & Guanjue Xiang & Belinda Giardine & Feipeng Zhang & Yicheng Wang & Ross C. Hardison & Qunhua Li, 2022. "CLIMB: High-dimensional association detection in large scale genomic data," Nature Communications, Nature, vol. 13(1), pages 1-15, December.
    3. Liu, Wei & Zhang, Bo & Zhang, Zhiwei & Chen, Baojiang & Zhou, Xiao-Hua, 2015. "A pseudo-likelihood approach for estimating diagnostic accuracy of multiple binary medical tests," Computational Statistics & Data Analysis, Elsevier, vol. 84(C), pages 85-98.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Papageorgiou, Ioulia & Moustaki, Irini, 2019. "Sampling of pairs in pairwise likelihood estimation for latent variable models with categorical observed variables," LSE Research Online Documents on Economics 87592, London School of Economics and Political Science, LSE Library.
    2. K. Florios & I. Moustaki & D. Rizopoulos & V. Vasdekis, 2015. "A modified weighted pairwise likelihood estimator for a class of random effects models," METRON, Springer;Sapienza Università di Roma, vol. 73(2), pages 217-228, August.
    3. Cristiano Varin, 2008. "On composite marginal likelihoods," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 92(1), pages 1-28, February.
    4. Lee Fawcett & David Walshaw, 2014. "Estimating the probability of simultaneous rainfall extremes within a region: a spatial approach," Journal of Applied Statistics, Taylor & Francis Journals, vol. 41(5), pages 959-976, May.
    5. Margaux Delporte & Steffen Fieuws & Geert Molenberghs & Geert Verbeke & Simeon Situma Wanyama & Elpis Hatziagorou & Christiane De Boeck, 2022. "A joint normal‐binary (probit) model," International Statistical Review, International Statistical Institute, vol. 90(S1), pages 37-51, December.
    6. Ana-Maria Staicu, 2017. "Interview with Nancy Reid," International Statistical Review, International Statistical Institute, vol. 85(3), pages 381-403, December.
    7. Joe, Harry & Lee, Youngjo, 2009. "On weighting of bivariate margins in pairwise likelihood," Journal of Multivariate Analysis, Elsevier, vol. 100(4), pages 670-685, April.
    8. Ioulia Papageorgiou, 2016. "Sampling from Correlated Populations: Optimal Strategies and Comparison Study," Sankhya B: The Indian Journal of Statistics, Springer;Indian Statistical Institute, vol. 78(1), pages 119-151, May.
    9. Bhat, Chandra R., 2011. "The maximum approximate composite marginal likelihood (MACML) estimation of multinomial probit-based unordered response choice models," Transportation Research Part B: Methodological, Elsevier, vol. 45(7), pages 923-939, August.
    10. Hao Bai & Yuan Zhong & Xin Gao & Wei Xu, 2020. "Multivariate Mixed Response Model with Pairwise Composite-Likelihood Method," Stats, MDPI, vol. 3(3), pages 1-18, July.
    11. Myrsini Katsikatsou & Irini Moustaki, 2016. "Pairwise Likelihood Ratio Tests and Model Selection Criteria for Structural Equation Models with Ordinal Variables," Psychometrika, Springer;The Psychometric Society, vol. 81(4), pages 1046-1068, December.
    12. Gourieroux, C. & Monfort, A., 2018. "Composite indirect inference with application to corporate risks," Econometrics and Statistics, Elsevier, vol. 7(C), pages 30-45.
    13. Nuo Xi & Michael W. Browne, 2014. "Contributions to the Underlying Bivariate Normal Method for Factor Analyzing Ordinal Data," Journal of Educational and Behavioral Statistics, , vol. 39(6), pages 583-611, December.
    14. Büscher, Sebastian & Bauer, Dietmar, 2024. "Weighting strategies for pairwise composite marginal likelihood estimation in case of unbalanced panels and unaccounted autoregressive structure of the errors," Transportation Research Part B: Methodological, Elsevier, vol. 181(C).
    15. Ipek Sener & Chandra Bhat, 2012. "Flexible spatial dependence structures for unordered multinomial choice models: formulation and application to teenagers’ activity participation," Transportation, Springer, vol. 39(3), pages 657-683, May.
    16. Paleti, Rajesh & Bhat, Chandra R., 2013. "The composite marginal likelihood (CML) estimation of panel ordered-response models," Journal of choice modelling, Elsevier, vol. 7(C), pages 24-43.
    17. Emil Aas Stoltenberg & Nils Lid Hjort, 2021. "Models and inference for on–off data via clipped Ornstein–Uhlenbeck processes," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 48(3), pages 908-929, September.
    18. Stanislav Anatolyev & Renat Khabibullin & Artem Prokhorov, 2012. "Reconstructing high dimensional dynamic distributions from distributions of lower dimension," Working Papers 12003, Concordia University, Department of Economics.
    19. N. Withanage & A.R. de Leon & C.J. Rudnisky, 2014. "Joint estimation of disease-specific sensitivities and specificities in reader-based multi-disease diagnostic studies of paired organs," Journal of Applied Statistics, Taylor & Francis Journals, vol. 41(10), pages 2282-2297, October.
    20. Paik, Jane & Ying, Zhiliang, 2012. "A composite likelihood approach for spatially correlated survival data," Computational Statistics & Data Analysis, Elsevier, vol. 56(1), pages 209-216, January.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:stapro:v:81:y:2011:i:7:p:892-901. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/wps/find/journaldescription.cws_home/622892/description#description .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.