IDEAS home Printed from https://ideas.repec.org/a/gam/jmathe/v9y2021i24p3262-d703501.html
   My bibliography  Save this article

COSMONET: An R Package for Survival Analysis Using Screening-Network Methods

Author

Listed:
  • Antonella Iuliano

    (Dipartimento di Matematica, Informatica ed Economia (DIMIE), Università degli Studi della Basilicata, 85100 Potenza, Italy
    These first authors contributed equally to this work.)

  • Annalisa Occhipinti

    (School of Computing, Engineering and Digital Technologies, Teesside University, Middlesbrough TS1 3BX, UK
    These first authors contributed equally to this work.)

  • Claudia Angelini

    (Istituto per le Applicazioni del Calcolo “Mauro Picone” (IAC), Consiglio Nazionale delle Ricerche, 80131 Naples, Italy
    These authors contributed equally to this work.)

  • Italia De Feis

    (Istituto per le Applicazioni del Calcolo “Mauro Picone” (IAC), Consiglio Nazionale delle Ricerche, 80131 Naples, Italy
    These authors contributed equally to this work.)

  • Pietro Liò

    (Computer Laboratory, University of Cambridge, Cambridge CB2 1TN, UK
    These authors contributed equally to this work.)

Abstract

Identifying relevant genomic features that can act as prognostic markers for building predictive survival models is one of the central themes in medical research, affecting the future of personalized medicine and omics technologies. However, the high dimension of genome-wide omic data, the strong correlation among the features, and the low sample size significantly increase the complexity of cancer survival analysis, demanding the development of specific statistical methods and software. Here, we present a novel R package, COSMONET (COx Survival Methods based On NETworks), that provides a complete workflow from the pre-processing of omics data to the selection of gene signatures and prediction of survival outcomes. In particular, COSMONET implements (i) three different screening approaches to reduce the initial dimension of the data from a high-dimensional space p to a moderate scale d , (ii) a network-penalized Cox regression algorithm to identify the gene signature, (iii) several approaches to determine an optimal cut-off on the prognostic index ( P I ) to separate high- and low-risk patients, and (iv) a prediction step for patients’ risk class based on the evaluation of P I s . Moreover, COSMONET provides functions for data pre-processing, visualization, survival prediction, and gene enrichment analysis. We illustrate COSMONET through a step-by-step R vignette using two cancer datasets.

Suggested Citation

  • Antonella Iuliano & Annalisa Occhipinti & Claudia Angelini & Italia De Feis & Pietro Liò, 2021. "COSMONET: An R Package for Survival Analysis Using Screening-Network Methods," Mathematics, MDPI, vol. 9(24), pages 1-25, December.
  • Handle: RePEc:gam:jmathe:v:9:y:2021:i:24:p:3262-:d:703501
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2227-7390/9/24/3262/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2227-7390/9/24/3262/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Silvia Pineda & Francisco X Real & Manolis Kogevinas & Alfredo Carrato & Stephen J Chanock & Núria Malats & Kristel Van Steen, 2015. "Integration Analysis of Three Omics Data Using Penalized Regression Methods: An Application to Bladder Cancer," PLOS Genetics, Public Library of Science, vol. 11(12), pages 1-22, December.
    2. Wei Zhang & Takayo Ota & Viji Shridhar & Jeremy Chien & Baolin Wu & Rui Kuang, 2013. "Network-based Survival Analysis Reveals Subnetwork Signatures for Predicting Outcomes of Ovarian Cancer Treatment," PLOS Computational Biology, Public Library of Science, vol. 9(3), pages 1-16, March.
    3. Engler David & Li Yi, 2009. "Survival Analysis with High-Dimensional Covariates: An Application in Microarray Studies," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 8(1), pages 1-24, February.
    4. Jianqing Fan & Jinchi Lv, 2008. "Sure independence screening for ultrahigh dimensional feature space," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 70(5), pages 849-911, November.
    5. Renaud Tissier & Jeanine Houwing-Duistermaat & Mar Rodríguez-Girondo, 2018. "Improving stability of prediction models based on correlated omics data by using network approaches," PLOS ONE, Public Library of Science, vol. 13(2), pages 1-23, February.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Claudia Angelini & Daniela De Canditiis & Anna Plaksienko, 2022. "Jewel 2.0 : An Improved Joint Estimation Method for Multiple Gaussian Graphical Models," Mathematics, MDPI, vol. 10(21), pages 1-20, October.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Zhao, Xiaobing & Zhou, Xian, 2014. "Sufficient dimension reduction on marginal regression for gaps of recurrent events," Journal of Multivariate Analysis, Elsevier, vol. 127(C), pages 56-71.
    2. Shuichi Kawano, 2014. "Selection of tuning parameters in bridge regression models via Bayesian information criterion," Statistical Papers, Springer, vol. 55(4), pages 1207-1223, November.
    3. Sauvenier, Mathieu & Van Bellegem, Sébastien, 2023. "Direction Identification and Minimax Estimation by Generalized Eigenvalue Problem in High Dimensional Sparse Regression," LIDAM Discussion Papers CORE 2023005, Université catholique de Louvain, Center for Operations Research and Econometrics (CORE).
    4. Ahmed Ismaïl & Hartikainen Anna-Liisa & Järvelin Marjo-Riitta & Richardson Sylvia, 2011. "False Discovery Rate Estimation for Stability Selection: Application to Genome-Wide Association Studies," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 10(1), pages 1-20, November.
    5. Shan Luo & Zehua Chen, 2014. "Sequential Lasso Cum EBIC for Feature Selection With Ultra-High Dimensional Feature Space," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 109(507), pages 1229-1240, September.
    6. Shi Chen & Wolfgang Karl Hardle & Brenda L'opez Cabrera, 2020. "Regularization Approach for Network Modeling of German Power Derivative Market," Papers 2009.09739, arXiv.org.
    7. Wang, Christina Dan & Chen, Zhao & Lian, Yimin & Chen, Min, 2022. "Asset selection based on high frequency Sharpe ratio," Journal of Econometrics, Elsevier, vol. 227(1), pages 168-188.
    8. Laurent Ferrara & Anna Simoni, 2023. "When are Google Data Useful to Nowcast GDP? An Approach via Preselection and Shrinkage," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 41(4), pages 1188-1202, October.
    9. Caroline Jardet & Baptiste Meunier, 2022. "Nowcasting world GDP growth with high‐frequency data," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 41(6), pages 1181-1200, September.
    10. Peter Bühlmann & Jacopo Mandozzi, 2014. "High-dimensional variable screening and bias in subsequent inference, with an empirical comparison," Computational Statistics, Springer, vol. 29(3), pages 407-430, June.
    11. Sangjin Kim & Jong-Min Kim, 2019. "Two-Stage Classification with SIS Using a New Filter Ranking Method in High Throughput Data," Mathematics, MDPI, vol. 7(6), pages 1-16, May.
    12. Anders Bredahl Kock, 2012. "On the Oracle Property of the Adaptive Lasso in Stationary and Nonstationary Autoregressions," CREATES Research Papers 2012-05, Department of Economics and Business Economics, Aarhus University.
    13. Hung Hung & Su‐Yun Huang, 2019. "Sufficient dimension reduction via random‐partitions for the large‐p‐small‐n problem," Biometrics, The International Biometric Society, vol. 75(1), pages 245-255, March.
    14. Loann David Denis Desboulets, 2018. "A Review on Variable Selection in Regression Analysis," Econometrics, MDPI, vol. 6(4), pages 1-27, November.
    15. Li, Xinyi & Wang, Li & Nettleton, Dan, 2019. "Sparse model identification and learning for ultra-high-dimensional additive partially linear models," Journal of Multivariate Analysis, Elsevier, vol. 173(C), pages 204-228.
    16. Li, Peili & Jiao, Yuling & Lu, Xiliang & Kang, Lican, 2022. "A data-driven line search rule for support recovery in high-dimensional data analysis," Computational Statistics & Data Analysis, Elsevier, vol. 174(C).
    17. Zhang, Jing & Wang, Qihua & Kang, Jian, 2020. "Feature screening under missing indicator imputation with non-ignorable missing response," Computational Statistics & Data Analysis, Elsevier, vol. 149(C).
    18. Lee, Ji Hyung & Shi, Zhentao & Gao, Zhan, 2022. "On LASSO for predictive regression," Journal of Econometrics, Elsevier, vol. 229(2), pages 322-349.
    19. Ian W. McKeague & Min Qian, 2015. "An Adaptive Resampling Test for Detecting the Presence of Significant Predictors," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 110(512), pages 1422-1433, December.
    20. Victor Chernozhukov & Christian Hansen & Yuan Liao, 2015. "A lava attack on the recovery of sums of dense and sparse signals," CeMMAP working papers CWP56/15, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jmathe:v:9:y:2021:i:24:p:3262-:d:703501. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.