IDEAS home Printed from https://ideas.repec.org/a/eee/csdana/v88y2015icp15-27.html
   My bibliography  Save this article

Two simple algorithms on linear combination of multiple biomarkers to maximize partial area under the ROC curve

Author

Listed:
  • Yu, Wenbao
  • Park, Taesung

Abstract

In clinical practices, it is common that several biomakers are related to a specific disease and each single marker does not have enough diagnostic power. An effective way to improve the diagnostic accuracy is to combine multiple markers. It is known that the area under the receiver operating characteristic curve (AUC) is very popular for evaluation of a diagnostic tool. Su and Liu (1993) derived the best linear combination that maximizes AUC when the markers are multivariate normally distributed. However, there are many applications that do not operate in the entire range of the curve, but only in particular regions of it, for example, high specificity regions. In these cases, it is more practical to analyze the partial area under the curve (pAUC). In this paper, we propose two easy-implemented algorithms, to find the best linear combination of multiple biomarkers that optimizes the pAUC, for given range of specificity. Analysis of synthesized and real datasets shows that the proposed algorithms achieve larger predictive pAUC values on future observations than existing methods, such as Su and Liu’s method, logistic regression and others.

Suggested Citation

  • Yu, Wenbao & Park, Taesung, 2015. "Two simple algorithms on linear combination of multiple biomarkers to maximize partial area under the ROC curve," Computational Statistics & Data Analysis, Elsevier, vol. 88(C), pages 15-27.
  • Handle: RePEc:eee:csdana:v:88:y:2015:i:c:p:15-27
    DOI: 10.1016/j.csda.2014.12.002
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0167947314003405
    Download Restriction: Full text for ScienceDirect subscribers only.

    File URL: https://libkey.io/10.1016/j.csda.2014.12.002?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Yan, Jun, 2007. "Enjoy the Joy of Copulas: With a Package copula," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 21(i04).
    2. Lori E. Dodd & Margaret S. Pepe, 2003. "Partial AUC Estimation and Regression," Biometrics, The International Biometric Society, vol. 59(3), pages 614-623, September.
    3. Kelly Zou & W. J. Hall, 2002. "Semiparametric and parametric transformation models for comparing diagnostic markers with paired design," Journal of Applied Statistics, Taylor & Francis Journals, vol. 29(6), pages 803-816.
    4. Jin, Hua & Lu, Ying, 2009. "The optimal linear combination of multiple predictors under the generalized linear models," Statistics & Probability Letters, Elsevier, vol. 79(22), pages 2321-2327, November.
    5. Margaret Sullivan Pepe & Gary Longton & Garnet L. Anderson & Michel Schummer, 2003. "Selecting Differentially Expressed Genes from Microarray Experiments," Biometrics, The International Biometric Society, vol. 59(1), pages 133-142, March.
    6. Donna Katzman McClish, 1989. "Analyzing a Portion of the ROC Curve," Medical Decision Making, , vol. 9(3), pages 190-195, August.
    7. Man-Jen Hsu & Huey-Miin Hsueh, 2013. "The linear combinations of biomarkers which maximize the partial area under the ROC curves," Computational Statistics, Springer, vol. 28(2), pages 647-666, April.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Yusuf Yıldırım & Anirban Sanyal, 2022. "Evaluating the Effectiveness of Early Warning Indicators: An Application of Receiver Operating Characteristic Curve Approach to Panel Data," Scientific Annals of Economics and Business (continues Analele Stiintifice), Alexandru Ioan Cuza University, Faculty of Economics and Business Administration, vol. 69(4), pages 557-597, December.
    2. Rocío Aznar-Gimeno & Luis M. Esteban & Gerardo Sanz & Rafael del-Hoyo-Alonso & Ricardo Savirón-Cornudella, 2021. "Incorporating a New Summary Statistic into the Min–Max Approach: A Min–Max–Median, Min–Max–IQR Combination of Biomarkers for Maximising the Youden Index," Mathematics, MDPI, vol. 9(19), pages 1-17, October.
    3. Rocío Aznar-Gimeno & Luis M. Esteban & Rafael del-Hoyo-Alonso & Ángel Borque-Fernando & Gerardo Sanz, 2022. "A Stepwise Algorithm for Linearly Combining Biomarkers under Youden Index Maximization," Mathematics, MDPI, vol. 10(8), pages 1-26, April.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Jialiang Li & Jason P. Fine, 2010. "Weighted area under the receiver operating characteristic curve and its application to gene selection," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 59(4), pages 673-692, August.
    2. Man-Jen Hsu & Huey-Miin Hsueh, 2013. "The linear combinations of biomarkers which maximize the partial area under the ROC curves," Computational Statistics, Springer, vol. 28(2), pages 647-666, April.
    3. Gigliarano, Chiara & Figini, Silvia & Muliere, Pietro, 2014. "Making classifier performance comparisons when ROC curves intersect," Computational Statistics & Data Analysis, Elsevier, vol. 77(C), pages 300-312.
    4. Yousef, Waleed A., 2013. "Assessing classifiers in terms of the partial area under the ROC curve," Computational Statistics & Data Analysis, Elsevier, vol. 64(C), pages 51-70.
    5. Schmid Matthias & Hothorn Torsten & Krause Friedemann & Rabe Christina, 2012. "A PAUC-based Estimation Technique for Disease Classification and Biomarker Selection," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 11(5), pages 1-26, October.
    6. M.L. Nores & M.P. Díaz, 2016. "Bootstrap hypothesis testing in generalized additive models for comparing curves of treatments in longitudinal studies," Journal of Applied Statistics, Taylor & Francis Journals, vol. 43(5), pages 810-826, April.
    7. Chen, Zhelun & O’Neill, Zheng & Wen, Jin & Pradhan, Ojas & Yang, Tao & Lu, Xing & Lin, Guanjing & Miyata, Shohei & Lee, Seungjae & Shen, Chou & Chiosa, Roberto & Piscitelli, Marco Savino & Capozzoli, , 2023. "A review of data-driven fault detection and diagnostics for building HVAC systems," Applied Energy, Elsevier, vol. 339(C).
    8. Pisit Leeahtam & Chukiat Chaiboonsri & Kanchana Chokethaworn & Prasert Chaitip & Songsak Sriboonchitta, 2011. "The Appropriate Model and Dependence Measures of Thailand’s Exchange Rate and Malaysia’s Exchange Rate: Linear, Nonlinear and Copulas Approach," Journal of Knowledge Management, Economics and Information Technology, ScientificPapers.org, vol. 1(6), pages 1-14, October.
    9. Margaret Sullivan Pepe & Tianxi Cai, 2004. "The Analysis of Placement Values for Evaluating Discriminatory Measures," Biometrics, The International Biometric Society, vol. 60(2), pages 528-535, June.
    10. Ozonder, Gozde & Miller, Eric J., 2021. "Longitudinal investigation of skeletal activity episode timing decisions – A copula approach," Journal of choice modelling, Elsevier, vol. 40(C).
    11. Junker, Robert R. & Griessenberger, Florian & Trutschnig, Wolfgang, 2021. "Estimating scale-invariant directed dependence of bivariate distributions," Computational Statistics & Data Analysis, Elsevier, vol. 153(C).
    12. Miao, Ruiqing & Hennessy, David A. & Feng, Hongli, 2016. "The Effects of Crop Insurance Subsidies and Sodsaver on Land-Use Change," Journal of Agricultural and Resource Economics, Western Agricultural Economics Association, vol. 41(2), May.
    13. Jiří Dvořák & Tomáš Mrkvička, 2022. "Graphical tests of independence for general distributions," Computational Statistics, Springer, vol. 37(2), pages 671-699, April.
    14. Fantazzini, Dean, 2011. "Analysis of multidimensional probability distributions with copula functions," Applied Econometrics, Russian Presidential Academy of National Economy and Public Administration (RANEPA), vol. 22(2), pages 98-134.
    15. Göran Kauermann & Christian Schellhase & David Ruppert, 2013. "Flexible Copula Density Estimation with Penalized Hierarchical B-splines," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 40(4), pages 685-705, December.
    16. Osamu Komori, 2011. "A boosting method for maximization of the area under the ROC curve," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 63(5), pages 961-979, October.
    17. Michael Schomaker & Christian Heumann, 2020. "When and when not to use optimal model averaging," Statistical Papers, Springer, vol. 61(5), pages 2221-2240, October.
    18. Panagiota Galiatsatou & Christos Makris & Panayotis Prinos & Dimitrios Kokkinos, 2019. "Nonstationary joint probability analysis of extreme marine variables to assess design water levels at the shoreline in a changing climate," Natural Hazards: Journal of the International Society for the Prevention and Mitigation of Natural Hazards, Springer;International Society for the Prevention and Mitigation of Natural Hazards, vol. 98(3), pages 1051-1089, September.
    19. Shofiqul Islam & Sonia Anand & Jemila Hamid & Lehana Thabane & Joseph Beyene, 2020. "A copula-based method of classifying individuals into binary disease categories using dependent biomarkers," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 29(4), pages 871-897, December.
    20. Wyłupek, Grzegorz, 2023. "A nonparametric test for paired data," Journal of Multivariate Analysis, Elsevier, vol. 198(C).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:csdana:v:88:y:2015:i:c:p:15-27. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/csda .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.