IDEAS home Printed from https://ideas.repec.org/a/gam/jmathe/v9y2021i21p2826-d673830.html
   My bibliography  Save this article

Evaluating the Performances of Biomarkers over a Restricted Domain of High Sensitivity

Author

Listed:
  • Manuel Franco

    (Department of Statistics and Operations Research, University of Murcia, CEIR Campus Mare Nostrum, 30100 Murcia, Spain
    Bio-Health Research Institute of Murcia (IMIB-Arrixaca), 30120 Murcia, Spain)

  • Juana-María Vivo

    (Department of Statistics and Operations Research, University of Murcia, CEIR Campus Mare Nostrum, 30100 Murcia, Spain
    Bio-Health Research Institute of Murcia (IMIB-Arrixaca), 30120 Murcia, Spain)

Abstract

The burgeoning advances in high-throughput technologies have posed a great challenge to the identification of novel biomarkers for diagnosing, by contemporary models and methods, through bioinformatics-driven analysis. Diagnostic performance metrics such as the partial area under the R O C ( p A U C ) indexes exhibit limitations to analysing genomic data. Among other issues, the inability to differentiate between biomarkers whose R O C curves cross each other with the same p A U C value, the inappropriate expression of non-concave R O C curves, and the lack of a convenient interpretation, restrict their use in practice. Here, we have proposed the fitted partial area index ( F p A U C ), which is computable through an algorithm valid for any R O C curve shape, as an alternative performance summary for the evaluation of highly sensitive biomarkers. The proposed approach is based on fitter upper and lower bounds of the p A U C in a high-sensitivity region. Through variance estimates, simulations, and case studies for diagnosing leukaemia, and ovarian and colon cancers, we have proven the usefulness of the proposed metric in terms of restoring the interpretation and improving diagnostic accuracy. It is robust and feasible even when the R O C curve shows hooks, and solves performance ties between competitive biomarkers.

Suggested Citation

  • Manuel Franco & Juana-María Vivo, 2021. "Evaluating the Performances of Biomarkers over a Restricted Domain of High Sensitivity," Mathematics, MDPI, vol. 9(21), pages 1-20, November.
  • Handle: RePEc:gam:jmathe:v:9:y:2021:i:21:p:2826-:d:673830
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2227-7390/9/21/2826/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2227-7390/9/21/2826/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. James A. Hanley, 1988. "The Robustness of the "Binormal" Assumptions Used in Fitting ROC Curves," Medical Decision Making, , vol. 8(3), pages 197-203, August.
    2. Barbara J. McNeil & James A. Hanley, 1984. "Statistical Approaches to the Analysis of Receiver Operating Characteristic (ROC) Curves," Medical Decision Making, , vol. 4(2), pages 137-150, June.
    3. Juana-María Vivo & Manuel Franco & Donatella Vicari, 2018. "Rethinking an ROC partial area index for evaluating the classification performance at a high specificity range," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 12(3), pages 683-704, September.
    4. Peterson, A. Townsend & Papeş, Monica & Soberón, Jorge, 2008. "Rethinking receiver operating characteristic analysis applications in ecological niche modeling," Ecological Modelling, Elsevier, vol. 213(1), pages 63-72.
    5. Dudoit S. & Fridlyand J. & Speed T. P, 2002. "Comparison of Discrimination Methods for the Classification of Tumors Using Gene Expression Data," Journal of the American Statistical Association, American Statistical Association, vol. 97, pages 77-87, March.
    6. Margaret Sullivan Pepe & Gary Longton & Garnet L. Anderson & Michel Schummer, 2003. "Selecting Differentially Expressed Genes from Microarray Experiments," Biometrics, The International Biometric Society, vol. 59(1), pages 133-142, March.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Bickel David R., 2008. "Correcting the Estimated Level of Differential Expression for Gene Selection Bias: Application to a Microarray Study," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 7(1), pages 1-27, March.
    2. Juana-María Vivo & Manuel Franco & Donatella Vicari, 2018. "Rethinking an ROC partial area index for evaluating the classification performance at a high specificity range," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 12(3), pages 683-704, September.
    3. Kubokawa, Tatsuya & Srivastava, Muni S., 2008. "Estimation of the precision matrix of a singular Wishart distribution and its application in high-dimensional data," Journal of Multivariate Analysis, Elsevier, vol. 99(9), pages 1906-1928, October.
    4. Václavík, Tomáš & Meentemeyer, Ross K., 2009. "Invasive species distribution modeling (iSDM): Are absence data and dispersal constraints needed to predict actual distributions?," Ecological Modelling, Elsevier, vol. 220(23), pages 3248-3258.
    5. Hossain, Ahmed & Beyene, Joseph & Willan, Andrew R. & Hu, Pingzhao, 2009. "A flexible approximate likelihood ratio test for detecting differential expression in microarray data," Computational Statistics & Data Analysis, Elsevier, vol. 53(10), pages 3685-3695, August.
    6. Bilin Zeng & Xuerong Meggie Wen & Lixing Zhu, 2017. "A link-free sparse group variable selection method for single-index model," Journal of Applied Statistics, Taylor & Francis Journals, vol. 44(13), pages 2388-2400, October.
    7. Haoxiang Zhao & Shanqing Yi & Yu Zhang & Nianwan Yang & Jianyang Guo & Hongmei Li & Xiaoqing Xian & Wanxue Liu, 2024. "Estimating the Optimal Control Areas of Two Classical Biocontrol Agents Against the Fall Armyworm Based on Hotspot Matching Analysis," Agriculture, MDPI, vol. 14(12), pages 1-14, December.
    8. Ramos, Rodrigo Soares & Kumar, Lalit & Shabani, Farzin & Picanço, Marcelo Coutinho, 2019. "Risk of spread of tomato yellow leaf curl virus (TYLCV) in tomato crops under various climate change scenarios," Agricultural Systems, Elsevier, vol. 173(C), pages 524-535.
    9. Márton Gosztonyi & Csákné Filep Judit, 2022. "Profiling (Non-)Nascent Entrepreneurs in Hungary Based on Machine Learning Approaches," Sustainability, MDPI, vol. 14(6), pages 1-20, March.
    10. Wang, Tao & Xu, Pei-Rong & Zhu, Li-Xing, 2012. "Non-convex penalized estimation in high-dimensional models with single-index structure," Journal of Multivariate Analysis, Elsevier, vol. 109(C), pages 221-235.
    11. Lin, Yu-Pin & Wang, Cheng-Long & Yu, Hsiao-Hsuan & Huang, Chung-Wei & Wang, Yung-Chieh & Chen, Yu-Wen & Wu, Wei-Yao, 2011. "Monitoring and estimating the flow conditions and fish presence probability under various flow conditions at reach scale using genetic algorithms and kriging methods," Ecological Modelling, Elsevier, vol. 222(3), pages 762-775.
    12. Zhao, Jianhua & Yu, Philip L.H. & Shi, Lei & Li, Shulan, 2012. "Separable linear discriminant analysis," Computational Statistics & Data Analysis, Elsevier, vol. 56(12), pages 4290-4300.
    13. Shaheena Bashir & Edward Carter, 2010. "Penalized multinomial mixture logit model," Computational Statistics, Springer, vol. 25(1), pages 121-141, March.
    14. Park, Junyong & Park, DoHwan, 2015. "Stein’s method in high dimensional classification and applications," Computational Statistics & Data Analysis, Elsevier, vol. 82(C), pages 110-125.
    15. Osamu Komori, 2011. "A boosting method for maximization of the area under the ROC curve," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 63(5), pages 961-979, October.
    16. Chakraborty, Sounak, 2009. "Bayesian binary kernel probit model for microarray based cancer classification and gene selection," Computational Statistics & Data Analysis, Elsevier, vol. 53(12), pages 4198-4209, October.
    17. Ata Kabán & Efstratios Palias, 2024. "A Bhattacharyya-type Conditional Error Bound for Quadratic Discriminant Analysis," Methodology and Computing in Applied Probability, Springer, vol. 26(4), pages 1-17, December.
    18. Kasim Adetayo & Lin Dan & Van Sanden Suzy & Clevert Djork-Arné & Bijnens Luc & Göhlmann Hinrich & Amaratunga Dhammika & Hochreiter Sepp & Shkedy Ziv & Talloen Willem, 2010. "Informative or Noninformative Calls for Gene Expression: A Latent Variable Approach," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 9(1), pages 1-31, January.
    19. Xiaolu Wei & Yubo Tian & Na Li & Huanxin Peng, 2024. "Evaluating ensemble learning techniques for stock index trend prediction: a case of China," Portuguese Economic Journal, Springer;Instituto Superior de Economia e Gestao, vol. 23(3), pages 505-530, September.
    20. Martín, Gerardo & Yáñez-Arenas, Carlos & Chiappa-Carrara, Xavier, 2022. "Discrepancies between point process models and environmental envelopes identify the niche centroid – geography configuration," Ecological Modelling, Elsevier, vol. 469(C).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jmathe:v:9:y:2021:i:21:p:2826-:d:673830. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.