IDEAS home Printed from https://ideas.repec.org/a/sae/somere/v24y1996i4p492-516.html
   My bibliography  Save this article

Bootstrapping Goodness-of-Fit Measures in Categorical Data Analysis

Author

Listed:
  • ROLF LANGEHEINE

    (University of Kiel, Germany)

  • JEROEN PANNEKOEK

    (Statistics Netherlands)

  • FRANK VAN DE POL

    (Statistics Netherlands)

Abstract

When sparse data have to be fitted to a log-linear or latent class model, one cannot use the theoretical chi-square distribution to evaluate model fit, because with sparse data the observed cross-table has too many cells in relation to the number of observations to use a distribution that only holds asymptotically. The choice of a theoretical distribution is also difficult when model-expected frequencies are 0 or when model probabilities are estimated 0 or 1. The authors propose to solve these problems by estimating the distribution of a fit measure, using bootstrap methods. An algorithm is presented for estimating this distribution by drawing bootstrap samples from the model-expected proportions, the so-called nonnaive bootstrap method. For the first time the method is applied to empirical data of varying sparseness, from five different data sets. Results show that the asymptotic chi-square distribution is not at all valid for sparse data.

Suggested Citation

  • Rolf Langeheine & Jeroen Pannekoek & Frank Van De Pol, 1996. "Bootstrapping Goodness-of-Fit Measures in Categorical Data Analysis," Sociological Methods & Research, , vol. 24(4), pages 492-516, May.
  • Handle: RePEc:sae:somere:v:24:y:1996:i:4:p:492-516
    DOI: 10.1177/0049124196024004004
    as

    Download full text from publisher

    File URL: https://journals.sagepub.com/doi/10.1177/0049124196024004004
    Download Restriction: no

    File URL: https://libkey.io/10.1177/0049124196024004004?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Erling Andersen, 1985. "Estimating latent correlations between repeated testings," Psychometrika, Springer;The Psychometric Society, vol. 50(1), pages 3-16, March.
    2. Vermunt, J.K., 1993. "lEM : Log-linear and event history analysis with missing data using the EM algorithm," WORC Paper 93.09.015/7, Tilburg University, Work and Organization Research Centre.
    3. Ulrich Rendtel & Rolf Langeheine & Roland Berntsen, 1992. "Mobilitätsprozesse und Einkommensarmut: erfragtes und errechnetes Haushaltseinkommen im Vergleich," Discussion Papers of DIW Berlin 56, DIW Berlin, German Institute for Economic Research.
    4. Gert G. Wagner & Jürgen Schupp & Ulrich Rendtel, 1991. "The Socio-Economic Panel (SOEP) for Germany - Methods of Production and Management of Longitudinal Data," Discussion Papers of DIW Berlin 31a, DIW Berlin, German Institute for Economic Research.
    5. Davis, Charles S., 1993. "The computer generation of multinomial random variates," Computational Statistics & Data Analysis, Elsevier, vol. 16(2), pages 205-217, August.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Pasquale Anselmi & Egidio Robusto & Luca Stefanutti & Debora Chiusole, 2016. "An Upgrading Procedure for Adaptive Assessment of Knowledge," Psychometrika, Springer;The Psychometric Society, vol. 81(2), pages 461-482, June.
    2. Fetene B. Tekle & Dereje W. Gudicha & Jeroen K. Vermunt, 2016. "Power analysis for the bootstrap likelihood ratio test for the number of classes in latent class models," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 10(2), pages 209-224, June.
    3. Paul S. Albert & Lisa M. McShane & Joanna H. Shih, 2001. "Latent Class Modeling Approaches for Assessing Diagnostic Error without a Gold Standard: With Applications to p53 Immunohistochemical Assays in Bladder Tumors," Biometrics, The International Biometric Society, vol. 57(2), pages 610-619, June.
    4. Huiping Xu & Bruce A. Craig, 2009. "A Probit Latent Class Model with General Correlation Structures for Evaluating Accuracy of Diagnostic Tests," Biometrics, The International Biometric Society, vol. 65(4), pages 1145-1155, December.
    5. Li, Xiaoshu & Boyle, Kevin J. & Holmes, Thomas P. & LaRouche, Genevieve Pullis, 2014. "The effect of on-site forest experience on stated preferences for low-impact timber harvesting programs," Journal of Forest Economics, Elsevier, vol. 20(4), pages 348-362.
    6. Matthew J. Madison & Laine P. Bradshaw, 2018. "Assessing Growth in a Diagnostic Classification Model Framework," Psychometrika, Springer;The Psychometric Society, vol. 83(4), pages 963-990, December.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Christoph F. Büchtemann & Jürgen Schupp, 1992. "Repercussions of Reunification: Patterns and Trends in the Socio-Economic Transformation of East Germany," Discussion Papers of DIW Berlin 44, DIW Berlin, German Institute for Economic Research.
    2. Chun Wang & Steven W. Nydick, 2020. "On Longitudinal Item Response Theory Models: A Didactic," Journal of Educational and Behavioral Statistics, , vol. 45(3), pages 339-368, June.
    3. Dirven, H.J. & Fouarge, D.J.A.G., 1996. "Income mobility and deprivation dynamics among the elderly in Belgium and the Netherlands," WORC Paper 96.05.005/2, Tilburg University, Work and Organization Research Centre.
    4. Licht, Georg & Steiner, Viktor, 1993. "Assimilation, labour market experience, and earnings profiles of temporary and permanent immigrant workers in germany," ZEW Discussion Papers 93-06, ZEW - Leibniz Centre for European Economic Research.
    5. Sun-Joo Cho & Allan Cohen & Brian Bottge, 2013. "Detecting Intervention Effects Using a Multilevel Latent Transition Analysis with a Mixture IRT Model," Psychometrika, Springer;The Psychometric Society, vol. 78(3), pages 576-600, July.
    6. Robert Gibbons & R. Bock, 1987. "Trend in correlated proportions," Psychometrika, Springer;The Psychometric Society, vol. 52(1), pages 113-124, March.
    7. Dimitris Pavlopoulos & Katja Chkalova, 2022. "Short-time work: A bridge to employment security or a springboard to unemployment?," Economic and Industrial Democracy, Department of Economic History, Uppsala University, Sweden, vol. 43(1), pages 168-197, February.
    8. Peida Zhan & Hong Jiao & Dandan Liao & Feiming Li, 2019. "A Longitudinal Higher-Order Diagnostic Classification Model," Journal of Educational and Behavioral Statistics, , vol. 44(3), pages 251-281, June.
    9. Albert Yu & Jeffrey A. Douglas, 2023. "IRT Models for Learning With Item-Specific Learning Parameters," Journal of Educational and Behavioral Statistics, , vol. 48(6), pages 866-888, December.
    10. Joachim Merz & Rainer Lang, 1997. "Preferred vs. Actual Working Hours - A Ten Years Paneleconometric Analysis for Professions, Entrepreneurs and Employees in Germany," FFB-Discussionpaper 23, Research Institute on Professions (Forschungsinstitut Freie Berufe (FFB)), LEUPHANA University Lüneburg.
    11. Geoffrey Evans & Colin Mills, 1998. "Assessing the Cross-Sex Validity of the Goldthorpe Class Schema Using Log-linear Models with Latent Variables," Quality & Quantity: International Journal of Methodology, Springer, vol. 32(3), pages 275-296, August.
    12. Shiyu Wang & Yan Yang & Steven Andrew Culpepper & Jeffrey A. Douglas, 2018. "Tracking Skill Acquisition With Cognitive Diagnosis Models: A Higher-Order, Hidden Markov Model With Covariates," Journal of Educational and Behavioral Statistics, , vol. 43(1), pages 57-87, February.
    13. Vermunt, J.K., 1994. "Causal log-linear modeling with missing data," WORC Paper 94.05.021/7, Tilburg University, Work and Organization Research Centre.
    14. Ulf Böckenholt & William Dillon, 1997. "Modeling within-subject dependencies in ordinal paired comparison data," Psychometrika, Springer;The Psychometric Society, vol. 62(3), pages 411-434, September.
    15. Kraus, Florian & Steiner, Viktor, 1995. "Modelling heaping effects in unemployment duration models - With an application to retrospective event data in the German socio-economic panel," ZEW Discussion Papers 95-09, ZEW - Leibniz Centre for European Economic Research.
    16. Ulrich Rendtel ; Felix Büchel, 1994. "Tests for Non-ignorable Panel Attrition and Their Application on Wage Estimates from the German Socio-Economic Panel (GSOEP)," Discussion Papers of DIW Berlin 89, DIW Berlin, German Institute for Economic Research.
    17. Matthew J. Madison & Laine P. Bradshaw, 2018. "Assessing Growth in a Diagnostic Classification Model Framework," Psychometrika, Springer;The Psychometric Society, vol. 83(4), pages 963-990, December.
    18. Susan Embretson, 1991. "A multidimensional latent trait model for measuring learning and change," Psychometrika, Springer;The Psychometric Society, vol. 56(3), pages 495-515, September.
    19. Matthias Davier & Xueli Xu & Claus Carstensen, 2011. "Measuring Growth in a Longitudinal Large-Scale Assessment with a General Latent Variable Model," Psychometrika, Springer;The Psychometric Society, vol. 76(2), pages 318-336, April.
    20. Park, Taesung & Davis, Charles S. & Li, Ning, 1998. "Alternative Gee estimation procedures for discrete longitudinal data," Computational Statistics & Data Analysis, Elsevier, vol. 28(3), pages 243-256, September.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:sae:somere:v:24:y:1996:i:4:p:492-516. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: SAGE Publications (email available below). General contact details of provider: .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.