IDEAS home Printed from https://ideas.repec.org/a/eee/csdana/v106y2017icp12-26.html
   My bibliography  Save this article

Parametric methods for confidence interval estimation of overlap coefficients

Author

Listed:
  • Wang, Dan
  • Tian, Lili

Abstract

Overlap coefficient (OVL), the proportion of overlap area between two probability distributions, is a direct measure of similarity between two distributions. It is useful in microarray analysis for the purpose of identifying differentially expressed biomarkers, especially when data follow multimodal distribution which cannot be transformed to normal. However, the inference methods about OVL are quite sparse. This article proposes two methods, a generalized inference (GI) approach and a parametric bootstrapping (PB) method, to construct confidence intervals of OVL under the assumption of normality. In conjunction with the EM algorithms, these methods are extended to mixture Gaussian (MG) distributions. The performances of these methods are evaluated empirically under a variety of distributions including normal, gamma and mixture Gaussian. At last, the proposed approaches are applied to a published microarray dataset from a gene expression study of three most prevalent adult lymphoid malignancies.

Suggested Citation

  • Wang, Dan & Tian, Lili, 2017. "Parametric methods for confidence interval estimation of overlap coefficients," Computational Statistics & Data Analysis, Elsevier, vol. 106(C), pages 12-26.
  • Handle: RePEc:eee:csdana:v:106:y:2017:i:c:p:12-26
    DOI: 10.1016/j.csda.2016.08.013
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S016794731630202X
    Download Restriction: Full text for ScienceDirect subscribers only.

    File URL: https://libkey.io/10.1016/j.csda.2016.08.013?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Anderson, Gordon & Linton, Oliver & Whang, Yoon-Jae, 2012. "Nonparametric estimation and inference about the overlap of two distributions," Journal of Econometrics, Elsevier, vol. 171(1), pages 1-23.
    2. K. Krishnamoorthy & Yong Lu, 2003. "Inferences on the Common Mean of Several Normal Populations Based on the Generalized Variable Method," Biometrics, The International Biometric Society, vol. 59(2), pages 237-247, June.
    3. Kelly Zou & W. J. Hall, 2000. "Two transformation models for estimating an ROC curve derived from continuous data," Journal of Applied Statistics, Taylor & Francis Journals, vol. 27(5), pages 621-631.
    4. Clemons, Traci E. & Jr., Edwin L. Bradley, 2000. "A nonparametric measure of the overlapping coefficient," Computational Statistics & Data Analysis, Elsevier, vol. 34(1), pages 51-61, July.
    5. Schmid, Friedrich & Schmidt, Axel, 2006. "Nonparametric estimation of the coefficient of overlapping--theory and empirical application," Computational Statistics & Data Analysis, Elsevier, vol. 50(6), pages 1583-1596, March.
    6. Mulekar, Madhuri S. & Mishra, Satya N., 2000. "Confidence interval estimation of overlap: equal means case," Computational Statistics & Data Analysis, Elsevier, vol. 34(2), pages 121-137, August.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Eidous, Omar M. & Ananbeh, Enas A., 2024. "Kernel method for estimating overlapping coefficient using numerical integration methods," Applied Mathematics and Computation, Elsevier, vol. 462(C).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Gordon Anderson & Oliver Linton & Yoon-Jae Wang, 2009. "Non Parametric Estimation of a Polarization Measure," Working Papers tecipa-363, University of Toronto, Department of Economics.
    2. Eidous, Omar M. & Ananbeh, Enas A., 2024. "Kernel method for estimating overlapping coefficient using numerical integration methods," Applied Mathematics and Computation, Elsevier, vol. 462(C).
    3. Schmid, Friedrich & Schmidt, Axel, 2006. "Nonparametric estimation of the coefficient of overlapping--theory and empirical application," Computational Statistics & Data Analysis, Elsevier, vol. 50(6), pages 1583-1596, March.
    4. Anderson, Gordon & Linton, Oliver & Whang, Yoon-Jae, 2012. "Nonparametric estimation and inference about the overlap of two distributions," Journal of Econometrics, Elsevier, vol. 171(1), pages 1-23.
    5. Yin, Jingjing & Tian, Lili, 2014. "Joint inference about sensitivity and specificity at the optimal cut-off point associated with Youden index," Computational Statistics & Data Analysis, Elsevier, vol. 77(C), pages 1-13.
    6. Y. Huang & M. S. Pepe, 2009. "A Parametric ROC Model-Based Approach for Evaluating the Predictiveness of Continuous Markers in Case–Control Studies," Biometrics, The International Biometric Society, vol. 65(4), pages 1133-1144, December.
    7. Liu, Shen & Maharaj, Elizabeth Ann & Inder, Brett, 2014. "Polarization of forecast densities: A new approach to time series classification," Computational Statistics & Data Analysis, Elsevier, vol. 70(C), pages 345-361.
    8. Gordon Anderson & Oliver Linton & Jasmin Thomas, 2017. "Similarity, dissimilarity and exceptionality: generalizing Gini’s transvariation to measure “differentness” in many distributions," METRON, Springer;Sapienza Università di Roma, vol. 75(2), pages 161-180, August.
    9. Li, Xinmin & Wang, Juan & Liang, Hua, 2011. "Comparison of several means: A fiducial based approach," Computational Statistics & Data Analysis, Elsevier, vol. 55(5), pages 1993-2002, May.
    10. Chang, Ching-Hui & Pal, Nabendu, 2008. "Testing on the common mean of several normal distributions," Computational Statistics & Data Analysis, Elsevier, vol. 53(2), pages 321-333, December.
    11. Lee, Sokbae & Song, Kyungchul & Whang, Yoon-Jae, 2018. "Testing For A General Class Of Functional Inequalities," Econometric Theory, Cambridge University Press, vol. 34(5), pages 1018-1064, October.
    12. Anderson, Gordon & Leo, Teng Wah, 2013. "An empirical examination of matching theories: The one child policy, partner choice and matching intensity in urban China," Journal of Comparative Economics, Elsevier, vol. 41(2), pages 468-489.
    13. H. Zakerzadeh & A. Jafari, 2015. "Inference on the parameters of two Weibull distributions based on record values," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 24(1), pages 25-40, March.
    14. Toru Kitagawa, 2013. "A bootstrap test for instrument validity in heterogeneous treatment effect models," CeMMAP working papers CWP53/13, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
    15. Zhongkai Liu & Howard D. Bondell, 2019. "Binormal Precision–Recall Curves for Optimal Classification of Imbalanced Data," Statistics in Biosciences, Springer;International Chinese Statistical Association, vol. 11(1), pages 141-161, April.
    16. François Gerard & Miikka Rokkanen & Christoph Rothe, 2020. "Bounds on treatment effects in regression discontinuity designs with a manipulated running variable," Quantitative Economics, Econometric Society, vol. 11(3), pages 839-870, July.
    17. Gordon Anderson & Oliver Linton & Maria Grazia Pittau & Yoon-Jae Whang & Roberto Zelli, 2021. "On unit free assessment of the extent of multilateral distributional variation," The Econometrics Journal, Royal Economic Society, vol. 24(3), pages 502-518.
    18. Kelly Zou & W. J. Hall, 2002. "Semiparametric and parametric transformation models for comparing diagnostic markers with paired design," Journal of Applied Statistics, Taylor & Francis Journals, vol. 29(6), pages 803-816.
    19. Cheam, Amay S.M. & McNicholas, Paul D., 2016. "Modelling receiver operating characteristic curves using Gaussian mixtures," Computational Statistics & Data Analysis, Elsevier, vol. 93(C), pages 192-208.
    20. Maribel Jiménez & Mónica Jiménez, 2019. "Intergenerational educational mobility in Latin America. An analysis from the equal opportunity approach," Revista Cuadernos de Economia, Universidad Nacional de Colombia, FCE, CID, vol. 38(76), pages 289-330, January.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:csdana:v:106:y:2017:i:c:p:12-26. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/csda .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.