IDEAS home Printed from https://ideas.repec.org/a/eee/csdana/v172y2022ics0167947322000640.html
   My bibliography  Save this article

Variable selection for case-cohort studies with informatively interval-censored outcomes

Author

Listed:
  • Du, Mingyue
  • Zhao, Xingqiu
  • Sun, Jianguo

Abstract

Variable selection has recently attracted a great deal of attention and in particular, a couple of methods have been proposed for general interval-censored failure time data or the interval-censored data arising from case-cohort studies. However, all of them have some limitations or apply only to limited situations. Corresponding to these, a new, more general variable selection approach is proposed under a class of flexible semiparametric transformation models that allows for dependent interval censoring. In particular, the oracle property of the method under the broken adaptive ridge penalty function is established and for its implementation, a novel EM algorithm is developed. Also a simulation study is performed and suggests that the proposed approach works well in practical situations. Finally the method is applied to a HIV trial that motivated this study.

Suggested Citation

  • Du, Mingyue & Zhao, Xingqiu & Sun, Jianguo, 2022. "Variable selection for case-cohort studies with informatively interval-censored outcomes," Computational Statistics & Data Analysis, Elsevier, vol. 172(C).
  • Handle: RePEc:eee:csdana:v:172:y:2022:i:c:s0167947322000640
    DOI: 10.1016/j.csda.2022.107484
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0167947322000640
    Download Restriction: Full text for ScienceDirect subscribers only.

    File URL: https://libkey.io/10.1016/j.csda.2022.107484?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Zou, Hui, 2006. "The Adaptive Lasso and Its Oracle Properties," Journal of the American Statistical Association, American Statistical Association, vol. 101, pages 1418-1429, December.
    2. Hui Zhao & Qiwei Wu & Gang Li & Jianguo Sun, 2020. "Simultaneous Estimation and Variable Selection for Interval-Censored Data With Broken Adaptive Ridge Regression," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 115(529), pages 204-216, January.
    3. Hongyuan Cao & Mathew M. Churpek & Donglin Zeng & Jason P. Fine, 2015. "Analysis of the Proportional Hazards Model With Sparse Longitudinal Covariates," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 110(511), pages 1187-1196, September.
    4. Chiung-Yu Huang & Mei-Cheng Wang, 2004. "Joint Modeling and Estimation for Recurrent Event Processes and Failure Time Data," Journal of the American Statistical Association, American Statistical Association, vol. 99, pages 1153-1165, December.
    5. Scolas, Sylvie & El Ghouch, Anouar & Legrand, Catherine & Oulhaj, Abderrahim, 2016. "Variable selection in a flexible parametric mixture cure model with interval-censored data," LIDAM Reprints ISBA 2016016, Université catholique de Louvain, Institute of Statistics, Biostatistics and Actuarial Sciences (ISBA).
    6. Ai Ni & Jianwen Cai & Donglin Zeng, 2016. "Variable selection for case-cohort studies with failure time outcome," Biometrika, Biometrika Trust, vol. 103(3), pages 547-562.
    7. Peijie Wang & Hui Zhao & Jianguo Sun, 2016. "Regression analysis of case K interval‐censored failure time data in the presence of informative censoring," Biometrics, The International Biometric Society, vol. 72(4), pages 1103-1112, December.
    8. Jing Qin & Yu Shen, 2010. "Statistical Methods for Analyzing Right-Censored Length-Biased Data under Cox Model," Biometrics, The International Biometric Society, vol. 66(2), pages 382-392, June.
    9. Q. Zhou & H. Zhou & J. Cai, 2017. "Case-cohort studies with interval-censored failure time data," Biometrika, Biometrika Trust, vol. 104(1), pages 17-29.
    10. Xiaoxi Liu & Donglin Zeng, 2013. "Variable selection in semiparametric transformation models for right-censored data," Biometrika, Biometrika Trust, vol. 100(4), pages 859-876.
    11. Ying Wu & Richard J. Cook, 2015. "Penalized regression for interval‐censored times of disease progression: Selection of HLA markers in psoriatic arthritis," Biometrics, The International Biometric Society, vol. 71(3), pages 782-791, September.
    12. Peijie Wang & Hui Zhao & Mingyue Du & Jianguo Sun, 2018. "Inference on semiparametric transformation model with general interval-censored failure time data," Journal of Nonparametric Statistics, Taylor & Francis Journals, vol. 30(3), pages 758-773, July.
    13. Fan J. & Li R., 2001. "Variable Selection via Nonconcave Penalized Likelihood and its Oracle Properties," Journal of the American Statistical Association, American Statistical Association, vol. 96, pages 1348-1360, December.
    14. Mingzhe Wu & Ming Zheng & Wen Yu & Ruofan Wu, 2018. "Estimation and variable selection for semiparametric transformation models under a more efficient cohort sampling design," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 27(3), pages 570-596, September.
    15. Hao Helen Zhang & Wenbin Lu, 2007. "Adaptive Lasso for Cox's proportional hazards model," Biometrika, Biometrika Trust, vol. 94(3), pages 691-703.
    16. Mingyue Du & Qingning Zhou & Shishun Zhao & Jianguo Sun, 2021. "Regression analysis of case-cohort studies in the presence of dependent interval censoring," Journal of Applied Statistics, Taylor & Francis Journals, vol. 48(5), pages 846-865, April.
    17. Lianming Wang & Christopher S. McMahan & Michael G. Hudgens & Zaina P. Qureshi, 2016. "A flexible, computationally efficient method for fitting the proportional hazards model to interval-censored data," Biometrics, The International Biometric Society, vol. 72(1), pages 222-231, March.
    18. Wang, Lu & Shen, Jincheng & Thall, Peter F., 2014. "A modified adaptive Lasso for identifying interactions in the Cox model with the heredity constraint," Statistics & Probability Letters, Elsevier, vol. 93(C), pages 126-133.
    19. Dai, Linlin & Chen, Kani & Sun, Zhihua & Liu, Zhenqiu & Li, Gang, 2018. "Broken adaptive ridge regression and its asymptotic properties," Journal of Multivariate Analysis, Elsevier, vol. 168(C), pages 334-351.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Xu, Yang & Zhao, Shishun & Hu, Tao & Sun, Jianguo, 2021. "Variable selection for generalized odds rate mixture cure models with interval-censored failure time data," Computational Statistics & Data Analysis, Elsevier, vol. 156(C).
    2. Fengting Yi & Niansheng Tang & Jianguo Sun, 2022. "Simultaneous variable selection and estimation for joint models of longitudinal and failure time data with interval censoring," Biometrics, The International Biometric Society, vol. 78(1), pages 151-164, March.
    3. Liuquan Sun & Shuwei Li & Lianming Wang & Xinyuan Song & Xuemei Sui, 2022. "Simultaneous variable selection in regression analysis of multivariate interval‐censored data," Biometrics, The International Biometric Society, vol. 78(4), pages 1402-1413, December.
    4. Fan Feng & Guanghui Cheng & Jianguo Sun, 2023. "Variable Selection for Length-Biased and Interval-Censored Failure Time Data," Mathematics, MDPI, vol. 11(22), pages 1-20, November.
    5. Rong Liu & Shishun Zhao & Tao Hu & Jianguo Sun, 2022. "Variable Selection for Generalized Linear Models with Interval-Censored Failure Time Data," Mathematics, MDPI, vol. 10(5), pages 1-18, February.
    6. Qu, Lianqiang & Song, Xinyuan & Sun, Liuquan, 2018. "Identification of local sparsity and variable selection for varying coefficient additive hazards models," Computational Statistics & Data Analysis, Elsevier, vol. 125(C), pages 119-135.
    7. Jeongjin Lee & Taehwa Choi & Sangbum Choi, 2024. "Censored broken adaptive ridge regression in high-dimension," Computational Statistics, Springer, vol. 39(6), pages 3457-3482, September.
    8. Ying Wu & Richard J. Cook, 2018. "Variable selection and prediction in biased samples with censored outcomes," Lifetime Data Analysis: An International Journal Devoted to Statistical Methods and Applications for Time-to-Event Data, Springer, vol. 24(1), pages 72-93, January.
    9. Yuxiang Wu & Hui Zhao & Jianguo Sun, 2023. "Group variable selection for the Cox model with interval‐censored failure time data," Biometrics, The International Biometric Society, vol. 79(4), pages 3082-3095, December.
    10. Takumi Saegusa & Tianzhou Ma & Gang Li & Ying Qing Chen & Mei-Ling Ting Lee, 2020. "Variable Selection in Threshold Regression Model with Applications to HIV Drug Adherence Data," Statistics in Biosciences, Springer;International Chinese Statistical Association, vol. 12(3), pages 376-398, December.
    11. Guang Cheng & Hao Zhang & Zuofeng Shang, 2015. "Sparse and efficient estimation for partial spline models with increasing dimension," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 67(1), pages 93-127, February.
    12. Lian, Heng & Li, Jianbo & Hu, Yuao, 2013. "Shrinkage variable selection and estimation in proportional hazards models with additive structure and high dimensionality," Computational Statistics & Data Analysis, Elsevier, vol. 63(C), pages 99-112.
    13. Zhao, Sihai Dave & Li, Yi, 2012. "Principled sure independence screening for Cox models with ultra-high-dimensional covariates," Journal of Multivariate Analysis, Elsevier, vol. 105(1), pages 397-411.
    14. Na You & Shun He & Xueqin Wang & Junxian Zhu & Heping Zhang, 2018. "Subtype classification and heterogeneous prognosis model construction in precision medicine," Biometrics, The International Biometric Society, vol. 74(3), pages 814-822, September.
    15. T. Cai & J. Huang & L. Tian, 2009. "Regularized Estimation for the Accelerated Failure Time Model," Biometrics, The International Biometric Society, vol. 65(2), pages 394-404, June.
    16. Heng Lian & Xin Chen & Jian-Yi Yang, 2012. "Identification of Partially Linear Structure in Additive Models with an Application to Gene Expression Prediction from Sequences," Biometrics, The International Biometric Society, vol. 68(2), pages 437-445, June.
    17. Engler David & Li Yi, 2009. "Survival Analysis with High-Dimensional Covariates: An Application in Microarray Studies," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 8(1), pages 1-24, February.
    18. Belli, Edoardo, 2022. "Smoothly adaptively centered ridge estimator," Journal of Multivariate Analysis, Elsevier, vol. 189(C).
    19. Zhihua Sun & Yi Liu & Kani Chen & Gang Li, 2022. "Broken adaptive ridge regression for right-censored survival data," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 74(1), pages 69-91, February.
    20. Elena Ivona DUMITRESCU & Sullivan HUE & Christophe HURLIN & Sessi TOKPAVI, 2020. "Machine Learning or Econometrics for Credit Scoring: Let’s Get the Best of Both Worlds," LEO Working Papers / DR LEO 2839, Orleans Economics Laboratory / Laboratoire d'Economie d'Orleans (LEO), University of Orleans.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:csdana:v:172:y:2022:i:c:s0167947322000640. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/csda .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.