IDEAS home Printed from https://ideas.repec.org/a/bpj/sagmbi/v11y2012i6p1-22n3.html
   My bibliography  Save this article

Time Dependent ROC Curves for the Estimation of True Prognostic Capacity of Microarray Data

Author

Listed:
  • Foucher Yohann

    (University of Nantes - EA 4275)

  • Danger Richard

    (INSERM U1064 - ITUN)

Abstract

Microarray data can be used to identify prognostic signatures based on time-to-event data. The analysis of microarrays is often associated with overfitting and many papers have dealt with this issue. However, little attention has been paid to incomplete time-to-event data (truncated and censored follow-up). We have adapted the 0.632+ bootstrap estimator for the evaluation of time-dependent ROC curves. The interpretation of ROC-based results is well-established among the scientific and medical community. Moreover, the results do not depend on the incidence of the event, as opposed to many other prognostic statistics. Here, we have tested this methodology by simulations. We have illustrated its utility by analyzing a data set of diffuse large-B-cell lymphoma patients. Our results demonstrate the well-adapted properties of the 0.632+ ROC-based approach to evaluate the true prognostic capacity of a microarray-based signature. This method has been implemented in an R package ROCt632.

Suggested Citation

  • Foucher Yohann & Danger Richard, 2012. "Time Dependent ROC Curves for the Estimation of True Prognostic Capacity of Microarray Data," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 11(6), pages 1-22, November.
  • Handle: RePEc:bpj:sagmbi:v:11:y:2012:i:6:p:1-22:n:3
    DOI: 10.1515/1544-6115.1815
    as

    Download full text from publisher

    File URL: https://doi.org/10.1515/1544-6115.1815
    Download Restriction: For access to full text, subscription to the journal or payment for the individual article is required.

    File URL: https://libkey.io/10.1515/1544-6115.1815?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Adler, Werner & Lausen, Berthold, 2009. "Bootstrap estimated true and false positive rates and ROC curve," Computational Statistics & Data Analysis, Elsevier, vol. 53(3), pages 718-729, January.
    2. Laura J. van 't Veer & Hongyue Dai & Marc J. van de Vijver & Yudong D. He & Augustinus A. M. Hart & Mao Mao & Hans L. Peterse & Karin van der Kooy & Matthew J. Marton & Anke T. Witteveen & George J. S, 2002. "Gene expression profiling predicts clinical outcome of breast cancer," Nature, Nature, vol. 415(6871), pages 530-536, January.
    3. Ishwaran, Hemant & Kogalur, Udaya B. & Gorodeski, Eiran Z. & Minn, Andy J. & Lauer, Michael S., 2010. "High-Dimensional Variable Selection for Survival Data," Journal of the American Statistical Association, American Statistical Association, vol. 105(489), pages 205-217.
    4. Patrick J. Heagerty & Thomas Lumley & Margaret S. Pepe, 2000. "Time-Dependent ROC Curves for Censored Survival Data and a Diagnostic Marker," Biometrics, The International Biometric Society, vol. 56(2), pages 337-344, June.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Zemin Zheng & Jie Zhang & Yang Li, 2022. "L 0 -Regularized Learning for High-Dimensional Additive Hazards Regression," INFORMS Journal on Computing, INFORMS, vol. 34(5), pages 2762-2775, September.
    2. Mogensen, Ulla B. & Ishwaran, Hemant & Gerds, Thomas A., 2012. "Evaluating Random Forests for Survival Analysis Using Prediction Error Curves," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 50(i11).
    3. Chen, Weijie & Yousef, Waleed A. & Gallas, Brandon D. & Hsu, Elizabeth R. & Lababidi, Samir & Tang, Rong & Pennello, Gene A. & Symmans, W. Fraser & Pusztai, Lajos, 2012. "Uncertainty estimation with a finite dataset in the assessment of classification models," Computational Statistics & Data Analysis, Elsevier, vol. 56(5), pages 1016-1027.
    4. Chen, Zhelun & O’Neill, Zheng & Wen, Jin & Pradhan, Ojas & Yang, Tao & Lu, Xing & Lin, Guanjing & Miyata, Shohei & Lee, Seungjae & Shen, Chou & Chiosa, Roberto & Piscitelli, Marco Savino & Capozzoli, , 2023. "A review of data-driven fault detection and diagnostics for building HVAC systems," Applied Energy, Elsevier, vol. 339(C).
    5. Tibshirani Robert J., 2009. "Univariate Shrinkage in the Cox Model for High Dimensional Data," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 8(1), pages 1-20, April.
    6. Claire L Heslop & Gregory E Miller & John S Hill, 2009. "Neighbourhood Socioeconomics Status Predicts Non-Cardiovascular Mortality in Cardiac Patients with Access to Universal Health Care," PLOS ONE, Public Library of Science, vol. 4(1), pages 1-8, January.
    7. Chin-Tsang Chiang & Shr-Yan Huang, 2009. "Estimation for the Optimal Combination of Markers without Modeling the Censoring Distribution," Biometrics, The International Biometric Society, vol. 65(1), pages 152-158, March.
    8. Jing Zhang & Qihua Wang & Xuan Wang, 2022. "Surrogate-variable-based model-free feature screening for survival data under the general censoring mechanism," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 74(2), pages 379-397, April.
    9. Sebastian Cremer & Lisa Pilgram & Alexander Berkowitsch & Melanie Stecher & Siegbert Rieg & Mariana Shumliakivska & Denisa Bojkova & Julian Uwe Gabriel Wagner & Galip Servet Aslan & Christoph Spinner , 2021. "Angiotensin II receptor blocker intake associates with reduced markers of inflammatory activation and decreased mortality in patients with cardiovascular comorbidities and COVID-19 disease," PLOS ONE, Public Library of Science, vol. 16(10), pages 1-17, October.
    10. Gaorong Li & Liugen Xue & Heng Lian, 2012. "SCAD-penalised generalised additive models with non-polynomial dimensionality," Journal of Nonparametric Statistics, Taylor & Francis Journals, vol. 24(3), pages 681-697.
    11. Lian, Heng & Du, Pang & Li, YuanZhang & Liang, Hua, 2014. "Partially linear structure identification in generalized additive models with NP-dimensionality," Computational Statistics & Data Analysis, Elsevier, vol. 80(C), pages 197-208.
    12. Te-Ling Ma & Tsung-Hui Hu & Chao-Hung Hung & Jing-Houng Wang & Sheng-Nan Lu & Chien-Hung Chen, 2019. "Incidence and predictors of retreatment in chronic hepatitis B patients after discontinuation of entecavir or tenofovir treatment," PLOS ONE, Public Library of Science, vol. 14(10), pages 1-16, October.
    13. Jung-sik Hong & Hyeongyu Yeo & Nam-Wook Cho & Taeuk Ahn, 2018. "Identification of Core Suppliers Based on E-Invoice Data Using Supervised Machine Learning," JRFM, MDPI, vol. 11(4), pages 1-13, October.
    14. Liu Xinhua & Jin Zhezhen, 2009. "A Non-Parametric Approach to Scale Reduction for Uni-Dimensional Screening Scales," The International Journal of Biostatistics, De Gruyter, vol. 5(1), pages 1-22, January.
    15. Jan, Budczies & Kosztyla, Daniel & von Törne, Christian & Stenzinger, Albrecht & Darb-Esfahani, Silvia & Dietel, Manfred & Denkert, Carsten, 2014. "cancerclass: An R Package for Development and Validation of Diagnostic Tests from High-Dimensional Molecular Data," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 59(i01).
    16. Coolen-Maturi, Tahani & Elkhafifi, Faiza F. & Coolen, Frank P.A., 2014. "Three-group ROC analysis: A nonparametric predictive approach," Computational Statistics & Data Analysis, Elsevier, vol. 78(C), pages 69-81.
    17. Yingye Zheng & Patrick Heagerty, 2004. "Semiparametric Estimation of Time-Dependent: ROC Curves for Longitudinal Marker Data," UW Biostatistics Working Paper Series 1052, Berkeley Electronic Press.
    18. Zhaoliang Wang & Liugen Xue & Gaorong Li & Fei Lu, 2019. "Spline estimator for ultra-high dimensional partially linear varying coefficient models," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 71(3), pages 657-677, June.
    19. Shang-Ming Zhou & Fabiola Fernandez-Gutierrez & Jonathan Kennedy & Roxanne Cooksey & Mark Atkinson & Spiros Denaxas & Stefan Siebert & William G Dixon & Terence W O’Neill & Ernest Choy & Cathie Sudlow, 2016. "Defining Disease Phenotypes in Primary Care Electronic Health Records by a Machine Learning Approach: A Case Study in Identifying Rheumatoid Arthritis," PLOS ONE, Public Library of Science, vol. 11(5), pages 1-14, May.
    20. Lian, I.B. & Chang, C.J. & Liang, Y.J. & Yang, M.J. & Fann, C.S.J., 2007. "Identifying differentially expressed genes in dye-swapped microarray experiments of small sample size," Computational Statistics & Data Analysis, Elsevier, vol. 51(5), pages 2602-2620, February.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bpj:sagmbi:v:11:y:2012:i:6:p:1-22:n:3. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Peter Golla (email available below). General contact details of provider: https://www.degruyter.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.