IDEAS home Printed from https://ideas.repec.org/p/pra/mprapa/58043.html
   My bibliography  Save this paper

Gene selection for survival data under dependent censoring: a copula-based approach

Author

Listed:
  • Emura, Takeshi
  • Chen, Yi-Hau

Abstract

Dependent censoring arises in biomedical studies when the survival outcome of interest is censored by competing risks. In survival data with microarray gene expressions, gene selection based on the univariate Cox regression analyses has been used extensively in medical research, which however, is only valid under the independent censoring assumption. In this paper, we first consider a copula-based framework to investigate the bias caused by dependent censoring on gene selection. Then, we utilize the copula-based dependence model to develop an alternative gene selection procedure. Simulations show that the proposed procedure adjusts for the effect of dependent censoring and thus outperforms the existing method when dependent censoring is indeed present. The non-small-cell lung cancer data is analyzed to demonstrate the usefulness of our proposal. We implemented the proposed method in an R “compound.Cox” package.

Suggested Citation

  • Emura, Takeshi & Chen, Yi-Hau, 2014. "Gene selection for survival data under dependent censoring: a copula-based approach," MPRA Paper 58043, University Library of Munich, Germany.
  • Handle: RePEc:pra:mprapa:58043
    as

    Download full text from publisher

    File URL: https://mpra.ub.uni-muenchen.de/58043/1/MPRA_paper_58043.pdf
    File Function: original version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Yi‐Hau Chen, 2010. "Semiparametric marginal regression analysis for dependent competing risks under an assumed copula," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 72(2), pages 235-251, March.
    2. Rivest, Louis-Paul & Wells, Martin T., 2001. "A Martingale Approach to the Copula-Graphic Estimator for the Survival Function under Dependent Censoring," Journal of Multivariate Analysis, Elsevier, vol. 79(1), pages 138-155, October.
    3. Takeshi Emura & Yi-Hau Chen & Hsuan-Yu Chen, 2012. "Survival Prediction Based on Compound Covariate under Cox Proportional Hazard Models," PLOS ONE, Public Library of Science, vol. 7(10), pages 1-12, October.
    4. Emura, Takeshi & Chen, Yi-Hau & Chen, Hsuan-Yu, 2012. "Survival prediction based on compound covariate under cox proportional hazard models," MPRA Paper 41149, University Library of Munich, Germany.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Takeshi Emura & Chi-Hung Pan, 2020. "Parametric likelihood inference and goodness-of-fit for dependently left-truncated data, a copula-based approach," Statistical Papers, Springer, vol. 61(1), pages 479-501, February.
    2. Emura, Takeshi & Shiu, Shau-Kai, 2014. "Estimation and model selection for left-truncated and right-censored lifetime data with application to electric power transformers analysis," MPRA Paper 57528, University Library of Munich, Germany.
    3. Jia-Han Shih & Takeshi Emura, 2018. "Likelihood-based inference for bivariate latent failure time models with competing risks under the generalized FGM copula," Computational Statistics, Springer, vol. 33(3), pages 1293-1323, September.
    4. Emura, Takeshi & Hsu, Jiun-Huang, 2020. "Estimation of the Mann–Whitney effect in the two-sample problem under dependent censoring," Computational Statistics & Data Analysis, Elsevier, vol. 150(C).
    5. T. Emura & K. Murotani, 2015. "An algorithm for estimating survival under a copula-based dependent truncation model," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 24(4), pages 734-751, December.
    6. Yicheng Zhou & Zhenzhou Lu & Yan Shi & Kai Cheng, 2019. "The copula-based method for statistical analysis of step-stress accelerated life test with dependent competing failure modes," Journal of Risk and Reliability, , vol. 233(3), pages 401-418, June.
    7. Cécile Chauvel & John O'Quigley, 2017. "Survival model construction guided by fit and predictive strength," Biometrics, The International Biometric Society, vol. 73(2), pages 483-494, June.
    8. Long, Ting-Hsuan & Emura, Takeshi, 2014. "A control chart using copula-based Markov chain models," MPRA Paper 57419, University Library of Munich, Germany.
    9. Jia-Han Shih & Takeshi Emura, 2019. "Bivariate dependence measures and bivariate competing risks models under the generalized FGM copula," Statistical Papers, Springer, vol. 60(4), pages 1101-1118, August.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Herbert Hove & Frank Beichelt & Parmod K. Kapur, 2017. "Estimation of the Frank copula model for dependent competing risks in accelerated life testing," International Journal of System Assurance Engineering and Management, Springer;The Society for Reliability, Engineering Quality and Operations Management (SREQOM),India, and Division of Operation and Maintenance, Lulea University of Technology, Sweden, vol. 8(4), pages 673-682, December.
    2. Deresa, Negera Wakgari & Van Keilegom, Ingrid, 2020. "A multivariate normal regression model for survival data subject to different types of dependent censoring," Computational Statistics & Data Analysis, Elsevier, vol. 144(C).
    3. Jia-Han Shih & Takeshi Emura, 2018. "Likelihood-based inference for bivariate latent failure time models with competing risks under the generalized FGM copula," Computational Statistics, Springer, vol. 33(3), pages 1293-1323, September.
    4. Lo Simon M.S. & Wilke Ralf A., 2014. "A Regression Model for the Copula-Graphic Estimator," Journal of Econometric Methods, De Gruyter, vol. 3(1), pages 21-46, January.
    5. Lo Simon M.S. & Wilke Ralf A., 2014. "A Regression Model for the Copula-Graphic Estimator," Journal of Econometric Methods, De Gruyter, vol. 3(1), pages 21-46, January.
    6. Lo, Simon M.S. & Wilke, Ralf A. & Emura, Takeshi, 2024. "A semiparametric model for the cause-specific hazard under risk proportionality," Computational Statistics & Data Analysis, Elsevier, vol. 195(C).
    7. Ahmed A. Ewees & Mohammed A. A. Al-qaness & Laith Abualigah & Diego Oliva & Zakariya Yahya Algamal & Ahmed M. Anter & Rehab Ali Ibrahim & Rania M. Ghoniem & Mohamed Abd Elaziz, 2021. "Boosting Arithmetic Optimization Algorithm with Genetic Algorithm Operators for Feature Selection: Case Study on Cox Proportional Hazards Model," Mathematics, MDPI, vol. 9(18), pages 1-22, September.
    8. Deresa, N.W. & Van Keilegom, I. & Antonio, K., 2022. "Copula-based inference for bivariate survival data with left truncation and dependent censoring," Insurance: Mathematics and Economics, Elsevier, vol. 107(C), pages 1-21.
    9. Kim, Dongwoo, 2023. "Partially identifying competing risks models: An application to the war on cancer," Journal of Econometrics, Elsevier, vol. 234(2), pages 536-564.
    10. Jialiang Li & Tonghui Yu & Jing Lv & Mei‐Ling Ting Lee, 2021. "Semiparametric model averaging prediction for lifetime data via hazards regression," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 70(5), pages 1187-1209, November.
    11. Emura, Takeshi & Kao, Fan-Hsuan & Michimae, Hirofumi, 2014. "An improved nonparametric estimator of sub-distribution function for bivariate competing risk models," Journal of Multivariate Analysis, Elsevier, vol. 132(C), pages 229-241.
    12. T. Emura & K. Murotani, 2015. "An algorithm for estimating survival under a copula-based dependent truncation model," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 24(4), pages 734-751, December.
    13. Ebrahimi, Nader & Molefe, Daniel, 2003. "Survival function estimation when lifetime and censoring time are dependent," Journal of Multivariate Analysis, Elsevier, vol. 87(1), pages 101-132, October.
    14. Schwarz, Maik & Jongbloed, Geurt & Van Keilegom, Ingrid, 2012. "On the identifiability of copulas in bivariate competing risks models," LIDAM Discussion Papers ISBA 2012032, Université catholique de Louvain, Institute of Statistics, Biostatistics and Actuarial Sciences (ISBA).
    15. Jacobo Uña-Álvarez & Noël Veraverbeke, 2013. "Generalized copula-graphic estimator," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 22(2), pages 343-360, June.
    16. Lajmi Lakhal & Louis-Paul Rivest & Belkacem Abdous, 2008. "Estimating Survival and Association in a Semicompeting Risks Model," Biometrics, The International Biometric Society, vol. 64(1), pages 180-188, March.
    17. Simon M. S. Lo & Ralf A. Wilke, 2010. "A copula model for dependent competing risks," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 59(2), pages 359-376, March.
    18. Liu, Yi & Wang, Qihua, 2015. "Copula-graphic estimators for the marginal survival function with censoring indicators missing at random," Statistics & Probability Letters, Elsevier, vol. 107(C), pages 101-110.
    19. Zhao, XiaoBing & Zhou, Xian, 2010. "Applying copula models to individual claim loss reserving methods," Insurance: Mathematics and Economics, Elsevier, vol. 46(2), pages 290-299, April.
    20. Emura, Takeshi & Hsu, Jiun-Huang, 2020. "Estimation of the Mann–Whitney effect in the two-sample problem under dependent censoring," Computational Statistics & Data Analysis, Elsevier, vol. 150(C).

    More about this item

    Keywords

    Bivariate survival distribution; Competing risk; Compound covariate prediction; Cox regression; Cross validation; Frailty; Kendall’s tau;
    All these keywords.

    JEL classification:

    • C12 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods and Methodology: General - - - Hypothesis Testing: General
    • C13 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods and Methodology: General - - - Estimation: General
    • C14 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods and Methodology: General - - - Semiparametric and Nonparametric Methods: General
    • C34 - Mathematical and Quantitative Methods - - Multiple or Simultaneous Equation Models; Multiple Variables - - - Truncated and Censored Models; Switching Regression Models

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:pra:mprapa:58043. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Joachim Winter (email available below). General contact details of provider: https://edirc.repec.org/data/vfmunde.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.