IDEAS home Printed from https://ideas.repec.org/a/spr/stmapp/v32y2023i4d10.1007_s10260-023-00686-1.html
   My bibliography  Save this article

Generalized residuals and outlier detection for ordinal data with challenging data structures

Author

Listed:
  • Maria Iannario

    (University of Naples Federico II)

  • Anna Clara Monti

    (University of Sannio)

Abstract

Motivated by the analysis of rating data concerning perceived health status, a crucial variable in biomedical, economic and life insurance models, the paper deals with diagnostic procedures for identifying anomalous and/or influential observations in ordinal response models with challenging data structures. Deviations due to some respondents’ atypical behavior, outlying covariates and gross errors may affect the reliability of likelihood based inference, especially when non robust link functions are adopted. The present paper investigates and exploits the properties of the generalized residuals. They appear in the estimating equations of the regression coefficients and hold the remarkable characteristic of interacting with the covariates in the same fashion as the linear regression residuals. Identification of statistical units incoherent with the model can be achieved by the analysis of the residuals produced by maximum likelihood or robust M-estimation, while the inspection of the weights generated by M-estimation allows to identify influential data. Simple guidelines are proposed to this end, which disclose information on the data structure. The purpose is twofold: recognizing statistical units that deserve specific attention for their peculiar features, and being aware of the sensitivity of the fitted model to small changes in the sample. In the analysis of the self-perceived health status, extreme design points associated with incoherent responses produce highly influential observations. The diagnostic procedures identify the outliers and assess their influence.

Suggested Citation

  • Maria Iannario & Anna Clara Monti, 2023. "Generalized residuals and outlier detection for ordinal data with challenging data structures," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 32(4), pages 1197-1216, October.
  • Handle: RePEc:spr:stmapp:v:32:y:2023:i:4:d:10.1007_s10260-023-00686-1
    DOI: 10.1007/s10260-023-00686-1
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s10260-023-00686-1
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s10260-023-00686-1?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Dungang Liu & Heping Zhang, 2018. "Residuals and Diagnostics for Ordinal Regression Models: A Surrogate Approach," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 113(522), pages 845-854, April.
    2. Maria Iannario & Anna Clara Monti & Domenico Piccolo, 2016. "Robustness issues for cub models," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 25(4), pages 731-750, December.
    3. Parminder Raina & Vicki Torrance-Rynard & Micheline Wong & Christel Woodward, 2002. "Agreement between Self-Reported and Routinely Collected Health Care Utilisation Data among Seniors," Social and Economic Dimensions of an Aging Population Research Papers 81, McMaster University.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Domenico Piccolo & Rosaria Simone, 2019. "The class of cub models: statistical foundations, inferential issues and empirical evidence," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 28(3), pages 389-435, September.
    2. Buckley, Neil J. & Denton, Frank T. & Leslie Robb, A. & Spencer, Byron G., 2004. "The transition from good to poor health: an econometric study of the older population," Journal of Health Economics, Elsevier, vol. 23(5), pages 1013-1034, September.
    3. Lemieux, Ashley J. & Roy, Laurence & Martin, Michael S. & Latimer, Eric A. & Crocker, Anne G., 2017. "Justice involvement among homeless individuals with mental illnesses: Are self-report and administrative measures comparable?," Evaluation and Program Planning, Elsevier, vol. 61(C), pages 86-95.
    4. Jiannan Lu & Yunshu Zhang & Peng Ding, 2020. "Sharp bounds on the relative treatment effect for ordinal outcomes," Biometrics, The International Biometric Society, vol. 76(2), pages 664-669, June.
    5. Hendrik Jürges & Vincent Pohl, 2012. "Medical guidelines, physician density, and quality of care: evidence from German SHARE data," The European Journal of Health Economics, Springer;Deutsche Gesellschaft für Gesundheitsökonomie (DGGÖ), vol. 13(5), pages 635-649, October.
    6. Weirong Li & Wensheng Zhu, 2024. "Subgroup analysis with concave pairwise fusion penalty for ordinal response," Statistical Papers, Springer, vol. 65(6), pages 3327-3355, August.
    7. Lebrun, Lydie A., 2012. "Effects of length of stay and language proficiency on health care experiences among Immigrants in Canada and the United States," Social Science & Medicine, Elsevier, vol. 74(7), pages 1062-1072.
    8. Neil J. Buckley & Frank T. Denton & A. Leslie Robb & Byron G. Spencer, 2006. "Socio-economic Influences on the Health of Older Canadians: Estimates Based on Two Longitudinal Surveys," Canadian Public Policy, University of Toronto Press, vol. 32(1), pages 59-84, March.
    9. Jean‐Sauveur Ay, 2021. "The Informational Content of Geographical Indications," American Journal of Agricultural Economics, John Wiley & Sons, vol. 103(2), pages 523-542, March.
    10. Marlies T van Dalen & Jacqueline J Suijker & Janet MacNeil-Vroomen & Marjon van Rijn & Eric P Moll van Charante & Sophia E de Rooij & Bianca M Buurman, 2014. "Self-Report of Healthcare Utilization among Community-Dwelling Older Persons: A Prospective Cohort Study," PLOS ONE, Public Library of Science, vol. 9(4), pages 1-11, April.
    11. Maria Iannario & Anna Clara Monti, 2022. "Modelling consumer perceptions of service quality for urban public transport systems using statistical models for ordinal data," METRON, Springer;Sapienza Università di Roma, vol. 80(1), pages 61-76, April.
    12. Rosaria Simone, 2023. "Uncertainty Diagnostics of Binomial Regression Trees for Ordered Rating Data," Journal of Classification, Springer;The Classification Society, vol. 40(1), pages 79-105, April.
    13. Zewei Lin & Dungang Liu, 2022. "Model diagnostics of discrete data regression: a unifying framework using functional residuals," Papers 2207.04299, arXiv.org.
    14. Nicola Pontarollo & Mercy Orellana & Joselin Segovia, 2020. "The Determinants of Subjective Well-Being in a Developing Country: The Ecuadorian Case," Journal of Happiness Studies, Springer, vol. 21(8), pages 3007-3035, December.
    15. Roberto Colombi & Sabrina Giordano, 2019. "Likelihood-based tests for a class of misspecified finite mixture models for ordinal categorical data," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 28(4), pages 1175-1202, December.
    16. Jochen Ranger & Kay Brauer, 2022. "On the Generalized S − X 2 –Test of Item Fit: Some Variants, Residuals, and a Graphical Visualization," Journal of Educational and Behavioral Statistics, , vol. 47(2), pages 202-230, April.
    17. Chowdhury, K.P., 2023. "Nonparametric functional analysis under joint estimation with applications to identifying highly cited papers," Journal of Informetrics, Elsevier, vol. 17(4).
    18. Emma Wallace & Frank Moriarty & Christine McGarrigle & Susan M Smith & Rose-Anne Kenny & Tom Fahey, 2018. "Self-report versus electronic medical record recorded healthcare utilisation in older community-dwelling adults: Comparison of two prospective cohort studies," PLOS ONE, Public Library of Science, vol. 13(10), pages 1-13, October.
    19. Yang Zhao & Lin Zhang & Yu Fu & Minyu Wang & Luwen Zhang, 2020. "Socioeconomic Disparities in Cancer Treatment, Service Utilization and Catastrophic Health Expenditure in China: A Cross-Sectional Analysis," IJERPH, MDPI, vol. 17(4), pages 1-13, February.
    20. Ejike R. Ugba & Daniel Mörlein & Jan Gertheiss, 2021. "Smoothing in Ordinal Regression: An Application to Sensory Data," Stats, MDPI, vol. 4(3), pages 1-18, July.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:stmapp:v:32:y:2023:i:4:d:10.1007_s10260-023-00686-1. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.