IDEAS home Printed from https://ideas.repec.org/a/eee/stapro/v81y2011i7p892-901.html
   My bibliography  Save this article

Pseudo-likelihood methodology for partitioned large and complex samples

Author

Listed:
  • Molenberghs, Geert
  • Verbeke, Geert
  • Iddi, Samuel

Abstract

Large data sets, either coming from a large number of independent replications, or because of hierarchies in the data with large numbers of within-unit replication, may pose challenges to the data analyst up to the point of making conventional inferential methods, such as maximum likelihood, prohibitive. Based on general pseudo-likelihood concepts, we propose a method to partition such a set of data, analyze each partition member, and properly combine the inferences into a single one. It is shown that the method is fully efficient for independent partitions, while with dependent sub-samples efficiency is sometimes but not always equal to one. It is argued that, for important realistic settings, efficiency is often very high. Illustrative examples enhance insight in the method's operation, while real-data analysis underscores its power for practice.

Suggested Citation

  • Molenberghs, Geert & Verbeke, Geert & Iddi, Samuel, 2011. "Pseudo-likelihood methodology for partitioned large and complex samples," Statistics & Probability Letters, Elsevier, vol. 81(7), pages 892-901, July.
  • Handle: RePEc:eee:stapro:v:81:y:2011:i:7:p:892-901
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0167-7152(11)00019-8
    Download Restriction: Full text for ScienceDirect subscribers only
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Steffen Fieuws & Geert Verbeke, 2006. "Pairwise Fitting of Mixed Models for the Joint Modeling of Multivariate Longitudinal Profiles," Biometrics, The International Biometric Society, vol. 62(2), pages 424-431, June.
    2. Cristiano Varin, 2008. "On composite marginal likelihoods," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 92(1), pages 1-28, February.
    3. Steffen Fieuws & Geert Verbeke & Filip Boen & Christophe Delecluse, 2006. "High dimensional multivariate mixed models for binary questionnaire data," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 55(4), pages 449-460, August.
    4. D. R. Cox, 2004. "A note on pseudolikelihood constructed from marginal densities," Biometrika, Biometrika Trust, vol. 91(3), pages 729-737, September.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Craiu, Radu V. & Duchesne, Thierry, 2018. "A scalable and efficient covariate selection criterion for mixed effects regression models with unknown random effects structure," Computational Statistics & Data Analysis, Elsevier, vol. 117(C), pages 154-161.
    2. Hillary Koch & Cheryl A. Keller & Guanjue Xiang & Belinda Giardine & Feipeng Zhang & Yicheng Wang & Ross C. Hardison & Qunhua Li, 2022. "CLIMB: High-dimensional association detection in large scale genomic data," Nature Communications, Nature, vol. 13(1), pages 1-15, December.
    3. Liu, Wei & Zhang, Bo & Zhang, Zhiwei & Chen, Baojiang & Zhou, Xiao-Hua, 2015. "A pseudo-likelihood approach for estimating diagnostic accuracy of multiple binary medical tests," Computational Statistics & Data Analysis, Elsevier, vol. 84(C), pages 85-98.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Cristiano Varin, 2008. "On composite marginal likelihoods," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 92(1), pages 1-28, February.
    2. Papageorgiou, Ioulia & Moustaki, Irini, 2019. "Sampling of pairs in pairwise likelihood estimation for latent variable models with categorical observed variables," LSE Research Online Documents on Economics 87592, London School of Economics and Political Science, LSE Library.
    3. K. Florios & I. Moustaki & D. Rizopoulos & V. Vasdekis, 2015. "A modified weighted pairwise likelihood estimator for a class of random effects models," METRON, Springer;Sapienza Università di Roma, vol. 73(2), pages 217-228, August.
    4. Ioulia Papageorgiou, 2016. "Sampling from Correlated Populations: Optimal Strategies and Comparison Study," Sankhya B: The Indian Journal of Statistics, Springer;Indian Statistical Institute, vol. 78(1), pages 119-151, May.
    5. Bhat, Chandra R., 2011. "The maximum approximate composite marginal likelihood (MACML) estimation of multinomial probit-based unordered response choice models," Transportation Research Part B: Methodological, Elsevier, vol. 45(7), pages 923-939, August.
    6. Paik, Jane & Ying, Zhiliang, 2012. "A composite likelihood approach for spatially correlated survival data," Computational Statistics & Data Analysis, Elsevier, vol. 56(1), pages 209-216, January.
    7. Jeonghye Choi & David R. Bell & Leonard M. Lodish, 2012. "Traditional and IS-Enabled Customer Acquisition on the Internet," Management Science, INFORMS, vol. 58(4), pages 754-769, April.
    8. Vassilis Vasdekis & Silvia Cagnone & Irini Moustaki, 2012. "A Composite Likelihood Inference in Latent Variable Models for Ordinal Longitudinal Responses," Psychometrika, Springer;The Psychometric Society, vol. 77(3), pages 425-441, July.
    9. Lee Fawcett & David Walshaw, 2014. "Estimating the probability of simultaneous rainfall extremes within a region: a spatial approach," Journal of Applied Statistics, Taylor & Francis Journals, vol. 41(5), pages 959-976, May.
    10. Qiurong Cui & Zhengjun Zhang, 2018. "Max-Linear Competing Factor Models," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 36(1), pages 62-74, January.
    11. Hao Bai & Yuan Zhong & Xin Gao & Wei Xu, 2020. "Multivariate Mixed Response Model with Pairwise Composite-Likelihood Method," Stats, MDPI, vol. 3(3), pages 1-18, July.
    12. Hillary Koch & Cheryl A. Keller & Guanjue Xiang & Belinda Giardine & Feipeng Zhang & Yicheng Wang & Ross C. Hardison & Qunhua Li, 2022. "CLIMB: High-dimensional association detection in large scale genomic data," Nature Communications, Nature, vol. 13(1), pages 1-15, December.
    13. Bhat, Chandra R. & Sener, Ipek N. & Eluru, Naveen, 2010. "A flexible spatially dependent discrete choice model: Formulation and application to teenagers' weekday recreational activity participation," Transportation Research Part B: Methodological, Elsevier, vol. 44(8-9), pages 903-921, September.
    14. Nobel, Anne & Lizin, Sebastien & Malina, Robert, 2023. "What drives the designation of protected areas? Accounting for spatial dependence using a composite marginal likelihood approach," Ecological Economics, Elsevier, vol. 205(C).
    15. Myrsini Katsikatsou & Irini Moustaki, 2016. "Pairwise Likelihood Ratio Tests and Model Selection Criteria for Structural Equation Models with Ordinal Variables," Psychometrika, Springer;The Psychometric Society, vol. 81(4), pages 1046-1068, December.
    16. Alexander Robitzsch, 2024. "A Comparison of Limited Information Estimation Methods for the Two-Parameter Normal-Ogive Model with Locally Dependent Items," Stats, MDPI, vol. 7(3), pages 1-16, June.
    17. Gourieroux, C. & Monfort, A., 2018. "Composite indirect inference with application to corporate risks," Econometrics and Statistics, Elsevier, vol. 7(C), pages 30-45.
    18. Margaux Delporte & Steffen Fieuws & Geert Molenberghs & Geert Verbeke & Simeon Situma Wanyama & Elpis Hatziagorou & Christiane De Boeck, 2022. "A joint normal‐binary (probit) model," International Statistical Review, International Statistical Institute, vol. 90(S1), pages 37-51, December.
    19. Nuo Xi & Michael W. Browne, 2014. "Contributions to the Underlying Bivariate Normal Method for Factor Analyzing Ordinal Data," Journal of Educational and Behavioral Statistics, , vol. 39(6), pages 583-611, December.
    20. Stanislav Anatolyev & Renat Khabibullin & Artem Prokhorov, 2012. "Reconstructing high dimensional dynamic distributions from distributions of lower dimension," Working Papers 12003, Concordia University, Department of Economics.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:stapro:v:81:y:2011:i:7:p:892-901. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/wps/find/journaldescription.cws_home/622892/description#description .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.