IDEAS home Printed from https://ideas.repec.org/a/vrs/offsta/v37y2021i1p97-119n2.html
   My bibliography  Save this article

Identifying Outliers in Response Quality Assessment by Using Multivariate Control Charts Based on Kernel Density Estimation

Author

Listed:
  • Jin Jiayun

    (Catholic University of Leuven, Centre for Sociological Research, Parkstraat 45, bus 3601, 3000 Leuven, Belgium.)

  • Loosveldt Geert

    (Catholic University of Leuven, Centre for Sociological Research, Parkstraat 45, bus 3601, 3000 Leuven, Belgium.)

Abstract

When monitoring industrial processes, a Statistical Process Control tool, such as a multivariate Hotelling T2 chart is frequently used to evaluate multiple quality characteristics. However, research into the use of T2 charts for survey fieldwork–essentially a production process in which data sets collected by means of interviews are produced–has been scant to date. In this study, using data from the eighth round of the European Social Survey in Belgium, we present a procedure for simultaneously monitoring six response quality indicators and identifying outliers: interviews with anomalous results. The procedure integrates Kernel Density Estimation (KDE) with a T2 chart, so that historical “in-control” data or reference to the assumption of a parametric distribution of the indicators is not required. In total, 75 outliers (4.25%) are iteratively removed, resulting in an in-control data set containing 1,691 interviews. The outliers are mainly characterized by having longer sequences of identical answers, a greater number of extreme answers, and against expectation, a lower item nonresponse rate. The procedure is validated by means of ten-fold cross-validation and comparison with the minimum covariance determinant algorithm as the criterion. By providing a method of obtaining in-control data, the present findings go some way toward a way to monitor response quality, identify problems, and provide rapid feedbacks during survey fieldwork.

Suggested Citation

  • Jin Jiayun & Loosveldt Geert, 2021. "Identifying Outliers in Response Quality Assessment by Using Multivariate Control Charts Based on Kernel Density Estimation," Journal of Official Statistics, Sciendo, vol. 37(1), pages 97-119, March.
  • Handle: RePEc:vrs:offsta:v:37:y:2021:i:1:p:97-119:n:2
    DOI: 10.2478/jos-2021-0005
    as

    Download full text from publisher

    File URL: https://doi.org/10.2478/jos-2021-0005
    Download Restriction: no

    File URL: https://libkey.io/10.2478/jos-2021-0005?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:vrs:offsta:v:37:y:2021:i:1:p:97-119:n:2. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Peter Golla (email available below). General contact details of provider: https://www.sciendo.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.