IDEAS home Printed from https://ideas.repec.org/a/spr/compst/v39y2024i6d10.1007_s00180-023-01446-1.html
   My bibliography  Save this article

Censored broken adaptive ridge regression in high-dimension

Author

Listed:
  • Jeongjin Lee

    (Ohio State University)

  • Taehwa Choi

    (Duke University)

  • Sangbum Choi

    (Korea University)

Abstract

Broken adaptive ridge (BAR) is a penalized regression method that performs variable selection via a computationally scalable surrogate to $$L_0$$ L 0 regularization. The BAR regression has many appealing features; it converges to selection with $$L_0$$ L 0 penalties as a result of reweighting $$L_2$$ L 2 penalties, and satisfies the oracle property with grouping effect for highly correlated covariates. In this paper, we investigate the BAR procedure for variable selection in a semiparametric accelerated failure time model with complex high-dimensional censored data. Coupled with Buckley-James-type responses, BAR-based variable selection procedures can be performed when event times are censored in complex ways, such as right-censored, left-censored, or double-censored. Our approach utilizes a two-stage cyclic coordinate descent algorithm to minimize the objective function by iteratively estimating the pseudo survival response and regression coefficients along the direction of coordinates. Under some weak regularity conditions, we establish both the oracle property and the grouping effect of the proposed BAR estimator. Numerical studies are conducted to investigate the finite-sample performance of the proposed algorithm and an application to real data is provided as a data example.

Suggested Citation

  • Jeongjin Lee & Taehwa Choi & Sangbum Choi, 2024. "Censored broken adaptive ridge regression in high-dimension," Computational Statistics, Springer, vol. 39(6), pages 3457-3482, September.
  • Handle: RePEc:spr:compst:v:39:y:2024:i:6:d:10.1007_s00180-023-01446-1
    DOI: 10.1007/s00180-023-01446-1
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s00180-023-01446-1
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s00180-023-01446-1?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Zou, Hui, 2006. "The Adaptive Lasso and Its Oracle Properties," Journal of the American Statistical Association, American Statistical Association, vol. 101, pages 1418-1429, December.
    2. Ralph C A Rippe & Jacqueline J Meulman & Paul H C Eilers, 2012. "Visualization of Genomic Changes by Segmented Smoothing Using an L0 Penalty," PLOS ONE, Public Library of Science, vol. 7(6), pages 1-14, June.
    3. Zeng, Donglin & Lin, D.Y., 2007. "Efficient Estimation for the Accelerated Failure Time Model," Journal of the American Statistical Association, American Statistical Association, vol. 102, pages 1387-1396, December.
    4. Hui Zhao & Qiwei Wu & Gang Li & Jianguo Sun, 2020. "Simultaneous Estimation and Variable Selection for Interval-Censored Data With Broken Adaptive Ridge Regression," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 115(529), pages 204-216, January.
    5. Zhezhen Jin, 2003. "Rank-based inference for the accelerated failure time model," Biometrika, Biometrika Trust, vol. 90(2), pages 341-353, June.
    6. Fan J. & Li R., 2001. "Variable Selection via Nonconcave Penalized Likelihood and its Oracle Properties," Journal of the American Statistical Association, American Statistical Association, vol. 96, pages 1348-1360, December.
    7. Sijian Wang & Bin Nan & Ji Zhu & David G. Beer, 2008. "Doubly Penalized Buckley–James Method for Survival Data with High-Dimensional Covariates," Biometrics, The International Biometric Society, vol. 64(1), pages 132-140, March.
    8. Fei Gao & Donglin Zeng & Dan‐Yu Lin, 2017. "Semiparametric estimation of the accelerated failure time model with partly interval‐censored data," Biometrics, The International Biometric Society, vol. 73(4), pages 1161-1168, December.
    9. Johnson, Brent A. & Lin, D.Y. & Zeng, Donglin, 2008. "Penalized Estimating Functions and Variable Selection in Semiparametric Regression Models," Journal of the American Statistical Association, American Statistical Association, vol. 103, pages 672-680, June.
    10. Florian Frommlet & Grégory Nuel, 2016. "An Adaptive Ridge Procedure for L0 Regularization," PLOS ONE, Public Library of Science, vol. 11(2), pages 1-23, February.
    11. Dai, Linlin & Chen, Kani & Sun, Zhihua & Liu, Zhenqiu & Li, Gang, 2018. "Broken adaptive ridge regression and its asymptotic properties," Journal of Multivariate Analysis, Elsevier, vol. 168(C), pages 334-351.
    12. Zhihua Sun & Yi Liu & Kani Chen & Gang Li, 2022. "Broken adaptive ridge regression for right-censored survival data," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 74(1), pages 69-91, February.
    13. Choi, Taehwa & Kim, Arlene K.H. & Choi, Sangbum, 2021. "Semiparametric least-squares regression with doubly-censored data," Computational Statistics & Data Analysis, Elsevier, vol. 164(C).
    14. Zhezhen Jin & D. Y. Lin & Zhiliang Ying, 2006. "On least-squares regression with censored data," Biometrika, Biometrika Trust, vol. 93(1), pages 147-161, March.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Zhihua Sun & Yi Liu & Kani Chen & Gang Li, 2022. "Broken adaptive ridge regression for right-censored survival data," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 74(1), pages 69-91, February.
    2. Dong, Qingkai & Liu, Binxia & Zhao, Hui, 2023. "Weighted least squares model averaging for accelerated failure time models," Computational Statistics & Data Analysis, Elsevier, vol. 184(C).
    3. Takumi Saegusa & Tianzhou Ma & Gang Li & Ying Qing Chen & Mei-Ling Ting Lee, 2020. "Variable Selection in Threshold Regression Model with Applications to HIV Drug Adherence Data," Statistics in Biosciences, Springer;International Chinese Statistical Association, vol. 12(3), pages 376-398, December.
    4. Xingwei Tong & Xin He & Liuquan Sun & Jianguo Sun, 2009. "Variable Selection for Panel Count Data via Non‐Concave Penalized Estimating Function," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 36(4), pages 620-635, December.
    5. T. Cai & J. Huang & L. Tian, 2009. "Regularized Estimation for the Accelerated Failure Time Model," Biometrics, The International Biometric Society, vol. 65(2), pages 394-404, June.
    6. Ying Ding & Bin Nan, 2015. "Estimating Mean Survival Time: When is it Possible?," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 42(2), pages 397-413, June.
    7. Du, Mingyue & Zhao, Xingqiu, 2024. "A conditional approach for regression analysis of case K interval-censored failure time data with informative censoring," Computational Statistics & Data Analysis, Elsevier, vol. 198(C).
    8. Fengting Yi & Niansheng Tang & Jianguo Sun, 2022. "Simultaneous variable selection and estimation for joint models of longitudinal and failure time data with interval censoring," Biometrics, The International Biometric Society, vol. 78(1), pages 151-164, March.
    9. Du, Mingyue & Zhao, Xingqiu & Sun, Jianguo, 2022. "Variable selection for case-cohort studies with informatively interval-censored outcomes," Computational Statistics & Data Analysis, Elsevier, vol. 172(C).
    10. Li Liu & Hao Wang & Yanyan Liu & Jian Huang, 2021. "Model pursuit and variable selection in the additive accelerated failure time model," Statistical Papers, Springer, vol. 62(6), pages 2627-2659, December.
    11. Rong Liu & Shishun Zhao & Tao Hu & Jianguo Sun, 2022. "Variable Selection for Generalized Linear Models with Interval-Censored Failure Time Data," Mathematics, MDPI, vol. 10(5), pages 1-18, February.
    12. Yuxiang Wu & Hui Zhao & Jianguo Sun, 2023. "Group variable selection for the Cox model with interval‐censored failure time data," Biometrics, The International Biometric Society, vol. 79(4), pages 3082-3095, December.
    13. Xu, Yang & Zhao, Shishun & Hu, Tao & Sun, Jianguo, 2021. "Variable selection for generalized odds rate mixture cure models with interval-censored failure time data," Computational Statistics & Data Analysis, Elsevier, vol. 156(C).
    14. Yize Zhao & Matthias Chung & Brent A. Johnson & Carlos S. Moreno & Qi Long, 2016. "Hierarchical Feature Selection Incorporating Known and Novel Biological Information: Identifying Genomic Features Related to Prostate Cancer Recurrence," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 111(516), pages 1427-1439, October.
    15. Xu, Linzhi & Zhang, Jiajia, 2010. "An EM-like algorithm for the semiparametric accelerated failure time gamma frailty model," Computational Statistics & Data Analysis, Elsevier, vol. 54(6), pages 1467-1474, June.
    16. Joseph G. Ibrahim & Hongtu Zhu & Ramon I. Garcia & Ruixin Guo, 2011. "Fixed and Random Effects Selection in Mixed Effects Models," Biometrics, The International Biometric Society, vol. 67(2), pages 495-503, June.
    17. Yongjin Li & Qingzhao Zhang & Qihua Wang, 2017. "Penalized estimation equation for an extended single-index model," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 69(1), pages 169-187, February.
    18. Zhangong Zhou & Rong Jiang & Weimin Qian, 2013. "LAD variable selection for linear models with randomly censored data," Metrika: International Journal for Theoretical and Applied Statistics, Springer, vol. 76(2), pages 287-300, February.
    19. Zhang, Jiajia & Peng, Yingwei, 2009. "Crossing hazard functions in common survival models," Statistics & Probability Letters, Elsevier, vol. 79(20), pages 2124-2130, October.
    20. Choi, Taehwa & Kim, Arlene K.H. & Choi, Sangbum, 2021. "Semiparametric least-squares regression with doubly-censored data," Computational Statistics & Data Analysis, Elsevier, vol. 164(C).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:compst:v:39:y:2024:i:6:d:10.1007_s00180-023-01446-1. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.