IDEAS home Printed from https://ideas.repec.org/p/ehl/lserod/89707.html
   My bibliography  Save this paper

Analysis of categorical data for complex surveys

Author

Listed:
  • Skinner, Chris J.

Abstract

This paper reviews methods for handling complex sampling schemes when analysing categorical survey data. It is generally assumed that the complex sampling scheme does not affect the specification of the parameters of interest, only the methodology for making inference about these parameters. The organisation of the paper is loosely chronological. Contingency table data is emphasized first before moving on to the analysis of unit-level data. Weighted least squares methods, introduced in the mid 1970s along with methods for two-way tables, receive early attention. They are followed by more general methods based on maximum likelihood, particularly pseudo maximum likelihood estimation. Point estimation methods typically involve the use of survey weights in some way. Variance estimation methods are described in broad terms. There is a particular emphasis on methods of testing. The main modelling methods considered are log-linear models, logit models, generalized linear models and latent variable models. There is no coverage of multilevel models.

Suggested Citation

  • Skinner, Chris J., 2018. "Analysis of categorical data for complex surveys," LSE Research Online Documents on Economics 89707, London School of Economics and Political Science, LSE Library.
  • Handle: RePEc:ehl:lserod:89707
    as

    Download full text from publisher

    File URL: http://eprints.lse.ac.uk/89707/
    File Function: Open access version.
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Patterson B.H. & Dayton C.M. & Graubard B.I., 2002. "Latent Class Analysis of Complex Sample Survey Data: Application to Dietary Data," Journal of the American Statistical Association, American Statistical Association, vol. 97, pages 721-741, September.
    2. Skinner, Chris J. & Vallet, L.-A., 2010. "Fitting log-linear models to contingency tables from surveys with complex sampling designs: an investigation of the Clogg-Eliason approach," LSE Research Online Documents on Economics 39118, London School of Economics and Political Science, LSE Library.
    3. Sundar Natarajan & Stuart R. Lipsitz & Garrett M. Fitzmaurice & Debajyoti Sinha & Joseph G. Ibrahim & Jennifer Haas & Walid Gellad, 2012. "An extension of the Wilcoxon rank sum test for complex sample survey data," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 61(4), pages 653-664, August.
    4. Clifford C. Clogg & Scott R. Eliason, 1987. "Some Common Problems in Log-Linear Analysis," Sociological Methods & Research, , vol. 16(1), pages 8-44, August.
    5. Chris Skinner & Louis-André Vallet, 2010. "Fitting Log-Linear Models to Contingency Tables From Surveys With Complex Sampling Designs: An Investigation of the Clogg-Eliason Approach," Sociological Methods & Research, , vol. 39(1), pages 83-108, August.
    6. Alastair Scott & Chris Wild, 2002. "On the robustness of weighted methods for fitting models to case–control data," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 64(2), pages 207-219, May.
    7. Oberski, Daniel, 2014. "lavaan.survey: An R Package for Complex Survey Analysis of Structural Equation Models," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 57(i01).
    8. C., E. A. Molina & Skinner, C. J., 1992. "Pseudo-likelihood and quasi-likelihood estimation for complex sampling schemes," Computational Statistics & Data Analysis, Elsevier, vol. 13(4), pages 395-405, May.
    9. Yan Li & Barry I. Graubard & Ralph DiGaetano, 2011. "Weighting methods for population‐based case–control studies with complex sampling," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 60(2), pages 165-185, March.
    10. Anders Christoffersson, 1975. "Factor analysis of dichotomized variables," Psychometrika, Springer;The Psychometric Society, vol. 40(1), pages 5-32, March.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Skinner, Chris J. & Vallet, L.-A., 2010. "Fitting log-linear models to contingency tables from surveys with complex sampling designs: an investigation of the Clogg-Eliason approach," LSE Research Online Documents on Economics 39118, London School of Economics and Political Science, LSE Library.
    2. Anthony Evans & Willem Sleegers & Žan Mlakar, 2020. "Individual differences in receptivity to scientific bullshit," Judgment and Decision Making, Society for Judgment and Decision Making, vol. 15(3), pages 401-412, May.
    3. Pezzuti, Lina & Tommasi, Marco & Saggino, Aristide & Dawe, James & Lauriola, Marco, 2020. "Gender differences and measurement bias in the assessment of adult intelligence: Evidence from the Italian WAIS-IV and WAIS-R standardizations," Intelligence, Elsevier, vol. 79(C).
    4. Reynolds, J.P. & Pilling, M. & Marteau, T.M., 2018. "Communicating quantitative evidence of policy effectiveness and support for the policy: Three experimental studies," Social Science & Medicine, Elsevier, vol. 218(C), pages 1-12.
    5. Bas Donkers & Richard Paap & Jedid‐Jah Jonker & Philip Hans Franses, 2006. "Deriving target selection rules from endogenously selected samples," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 21(5), pages 549-562, July.
    6. Yang Yixin & Lü Xin & Ma Jian & Qiao Han, 2014. "A Robust Factor Analysis Model for Dichotomous Data," Journal of Systems Science and Information, De Gruyter, vol. 2(5), pages 437-450, October.
    7. Ana Isabel Maldonado & Carol B. Cunradi & Anna María Nápoles, 2020. "Racial/Ethnic Discrimination and Intimate Partner Violence Perpetration in Latino Men: The Mediating Effects of Mental Health," IJERPH, MDPI, vol. 17(21), pages 1-17, November.
    8. Kunz, Werner H. & Hogreve, Jens, 2011. "Toward a deeper understanding of service marketing: The past, the present, and the future," International Journal of Research in Marketing, Elsevier, vol. 28(3), pages 231-247.
    9. Wu, Jianmin & Bentler, Peter M., 2013. "Limited information estimation in binary factor analysis: A review and extension," Computational Statistics & Data Analysis, Elsevier, vol. 57(1), pages 392-403.
    10. Steffen Grønneberg & Jonas Moss & Njål Foldnes, 2020. "Partial Identification of Latent Correlations with Binary Data," Psychometrika, Springer;The Psychometric Society, vol. 85(4), pages 1028-1051, December.
    11. Sheppard, Leah D. & O'Reilly, Jane & van Dijke, Marius & Restubog, Simon Lloyd D. & Aquino, Karl, 2020. "The stress-relieving benefits of positively experienced social sexual behavior in the workplace," Organizational Behavior and Human Decision Processes, Elsevier, vol. 156(C), pages 38-52.
    12. Sun, Bindong & Liu, Jiahang & Yin, Chun & Cao, Jason, 2022. "Residential and workplace neighborhood environments and life satisfaction: Exploring chain-mediation effects of activity and place satisfaction," Journal of Transport Geography, Elsevier, vol. 104(C).
    13. Dylan Molenaar, 2015. "Heteroscedastic Latent Trait Models for Dichotomous Data," Psychometrika, Springer;The Psychometric Society, vol. 80(3), pages 625-644, September.
    14. Robert Cudeck & Kelli Klebe & Susan Henly, 1993. "A simple Gauss-Newton procedure for covariance structure analysis with high-level computer languages," Psychometrika, Springer;The Psychometric Society, vol. 58(2), pages 211-232, June.
    15. Meyers, Maria Christina & van Woerkom, Marianne & Bauwens, Robin, 2023. "Stronger together: A multilevel study of collective strengths use and team performance," Journal of Business Research, Elsevier, vol. 159(C).
    16. Benjamin Motte-Baumvol & Olivier Bonin, 2018. "The spatial dimensions of immobility in France," Transportation, Springer, vol. 45(5), pages 1231-1247, September.
    17. Lingxiao Wang & Barry I. Graubard & Hormuzd A. Katki & and Yan Li, 2020. "Improving external validity of epidemiologic cohort analyses: a kernel weighting approach," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 183(3), pages 1293-1311, June.
    18. Pilar Rivera & Albert Satorra, 2000. "Country effects in ISSP-1993 environmental data: Comparison of SEM approaches," Economics Working Papers 458, Department of Economics and Business, Universitat Pompeu Fabra.
    19. Ann Owen & Julio Videras, 2009. "Reconsidering social capital: a latent class approach," Empirical Economics, Springer, vol. 37(3), pages 555-582, December.
    20. Marcello D’Orazio, 2010. "Evaluating reliability of combined responses through latent class models," Rivista di statistica ufficiale, ISTAT - Italian National Institute of Statistics - (Rome, ITALY), vol. 12(1), pages 6-16, April.

    More about this item

    Keywords

    pseudo maximum likelihood; Rao-Scott adjustment; score test; survey weight; weighted least squares;
    All these keywords.

    JEL classification:

    • C1 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods and Methodology: General

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:ehl:lserod:89707. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: LSERO Manager (email available below). General contact details of provider: https://edirc.repec.org/data/lsepsuk.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.