IDEAS home Printed from https://ideas.repec.org/a/eee/csdana/v103y2016icp170-192.html
   My bibliography  Save this article

Confidence intervals for an ordinal effect size measure based on partially validated series

Author

Listed:
  • Qiu, Shi-Fang
  • Poon, Wai-Yin
  • Tang, Man-Lai

Abstract

An ordinal effect size measure is used to assess whether one variable is stochastically larger than the other; therefore, this measure is a useful means by which to describe the difference between two ordinal categorical distributions. In practical analysis, it is desirable to obtain data only by a gold standard test, but such tests are often limited due to high costs or ethical considerations, especially in medical research applications. However, misclassification can arise when a cheaper, faster, and/or non-invasive, but fallible test is used to collect data. The use of partially validated data obtained by double sampling has become a popular compromise between these two approaches. In this study, we develop twelve estimators of the confidence interval (CI) for an ordinal effect size measure based on partially validated data. The performance of the proposed CIs are evaluated by simulation studies in terms of the empirical coverage probability, the empirical coverage width, and the ratio of the mesial non-coverage probability and non-coverage probability. Simulation results show that the Wald CI on logit scale, the Bootstrap percentile CI and the Bias-corrected Bootstrap normal CI have outstanding performance even in small sample designs. When sample sizes are moderate, all CIs except the Wald, Bias-corrected Bootstrap percentile and logit-transformation-based Bootstrap percentile-t CIs demonstrate good coverage properties. Moreover, all CIs perform well when sample sizes are large. All methods are illustrated by analyzing a real data set from a research study of highway safety on automobile accidents.

Suggested Citation

  • Qiu, Shi-Fang & Poon, Wai-Yin & Tang, Man-Lai, 2016. "Confidence intervals for an ordinal effect size measure based on partially validated series," Computational Statistics & Data Analysis, Elsevier, vol. 103(C), pages 170-192.
  • Handle: RePEc:eee:csdana:v:103:y:2016:i:c:p:170-192
    DOI: 10.1016/j.csda.2016.05.006
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0167947316301086
    Download Restriction: Full text for ScienceDirect subscribers only.

    File URL: https://libkey.io/10.1016/j.csda.2016.05.006?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Lee, Seung-Chun, 2006. "Interval estimation of binomial proportions based on weighted Polya posterior," Computational Statistics & Data Analysis, Elsevier, vol. 51(2), pages 1012-1021, November.
    2. Ng, Kai Wang & Tang, Man-Lai & Tan, Ming & Tian, Guo-Liang, 2008. "Grouped Dirichlet distribution: A new tool for incomplete categorical data analysis," Journal of Multivariate Analysis, Elsevier, vol. 99(3), pages 490-509, March.
    3. Wai-Yin Poon & Hai-Bin Wang, 2010. "Bayesian Analysis of Multivariate Probit Models with Surrogate Outcome Data," Psychometrika, Springer;The Psychometric Society, vol. 75(3), pages 498-520, September.
    4. Glen Meeden, 1999. "Interval estimators for the population mean for skewed distributions with a small sample size," Journal of Applied Statistics, Taylor & Francis Journals, vol. 26(1), pages 81-96.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Tang, Man-Lai & Qiu, Shi-Fang & Poon, Wai-Yin, 2012. "Confidence interval construction for disease prevalence based on partial validation series," Computational Statistics & Data Analysis, Elsevier, vol. 56(5), pages 1200-1220.
    2. Ongaro, A. & Migliorati, S., 2013. "A generalization of the Dirichlet distribution," Journal of Multivariate Analysis, Elsevier, vol. 114(C), pages 412-426.
    3. Liu, Feng & Zhao, Shaoqiong & Li, Yang, 2017. "How many, how often, and how new? A multivariate profiling of mobile app users," Journal of Retailing and Consumer Services, Elsevier, vol. 38(C), pages 71-80.
    4. Jamotton, Charlotte & Hainaut, Donatien, 2024. "Latent Dirichlet Allocation for structured insurance data," LIDAM Discussion Papers ISBA 2024008, Université catholique de Louvain, Institute of Statistics, Biostatistics and Actuarial Sciences (ISBA).
    5. Li, Huiqiong & Tian, Guoliang & Tang, Niansheng & Cao, Hongyuan, 2018. "Assessing non-inferiority for incomplete paired-data under non-ignorable missing mechanism," Computational Statistics & Data Analysis, Elsevier, vol. 127(C), pages 69-81.
    6. Tian, Guo-Liang & Tang, Man-Lai & Yuen, Kam Chuen & Ng, Kai Wang, 2010. "Further properties and new applications of the nested Dirichlet distribution," Computational Statistics & Data Analysis, Elsevier, vol. 54(2), pages 394-405, February.
    7. Surupa Roy & Kalyan Das & Angshuman Sarkar, 2013. "Analysis of binary data with the possibility of wrong ascertainment," Statistica Neerlandica, Netherlands Society for Statistics and Operations Research, vol. 67(3), pages 293-310, August.
    8. Rana, Subrata & Roy, Surupa & Das, Kalyan, 2018. "Analysis of ordinal longitudinal data under nonignorable missingness and misreporting: An application to Alzheimer’s disease study," Journal of Multivariate Analysis, Elsevier, vol. 166(C), pages 62-77.
    9. Lee, Seung-Chun, 2006. "Interval estimation of binomial proportions based on weighted Polya posterior," Computational Statistics & Data Analysis, Elsevier, vol. 51(2), pages 1012-1021, November.
    10. Nguyen, H.D. & Gouno, E., 2020. "Bayesian inference for Common cause failure rate based on causal inference with missing data," Reliability Engineering and System Safety, Elsevier, vol. 197(C).
    11. Lizbeth Naranjo & Luz Judith R. Esparza & Carlos J. Pérez, 2020. "A Hidden Markov Model to Address Measurement Errors in Ordinal Response Scale and Non-Decreasing Process," Mathematics, MDPI, vol. 8(4), pages 1-12, April.
    12. Tong-Yu Lu & Wai-Yin Poon & Siu Cheung, 2014. "A Unified Framework for the Comparison of Treatments with Ordinal Responses," Psychometrika, Springer;The Psychometric Society, vol. 79(4), pages 605-620, October.
    13. Debjit Sengupta & Tathagata Banerjee & Surupa Roy, 2020. "Estimation of Poisson mean with under‐reported counts: a double sampling approach," Australian & New Zealand Journal of Statistics, Australian Statistical Publishing Association Inc., vol. 62(4), pages 508-535, December.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:csdana:v:103:y:2016:i:c:p:170-192. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/csda .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.