IDEAS home Printed from https://ideas.repec.org/a/spr/aistmt/v74y2022i3d10.1007_s10463-021-00809-z.html
   My bibliography  Save this article

A high-dimensional M-estimator framework for bi-level variable selection

Author

Listed:
  • Bin Luo

    (Duke University)

  • Xiaoli Gao

    (The University of North Carolina at Greensboro)

Abstract

In high-dimensional data analysis, bi-level sparsity is often assumed when covariates function group-wisely and sparsity can appear either at the group level or within certain groups. In such cases, an ideal model should be able to encourage the bi-level variable selection consistently. Bi-level variable selection has become even more challenging when data have heavy-tailed distribution or outliers exist in random errors and covariates. In this paper, we study a framework of high-dimensional M-estimation for bi-level variable selection. This framework encourages bi-level sparsity through a computationally efficient two-stage procedure. In theory, we provide sufficient conditions under which our two-stage penalized M-estimator possesses simultaneous local estimation consistency and the bi-level variable selection consistency if certain non-convex penalty functions are used at the group level. Both our simulation studies and real data analysis demonstrate satisfactory finite sample performance of the proposed estimators under different irregular settings.

Suggested Citation

  • Bin Luo & Xiaoli Gao, 2022. "A high-dimensional M-estimator framework for bi-level variable selection," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 74(3), pages 559-579, June.
  • Handle: RePEc:spr:aistmt:v:74:y:2022:i:3:d:10.1007_s10463-021-00809-z
    DOI: 10.1007/s10463-021-00809-z
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s10463-021-00809-z
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s10463-021-00809-z?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Patrick Breheny, 2015. "The group exponential lasso for bi‐level variable selection," Biometrics, The International Biometric Society, vol. 71(3), pages 731-740, September.
    2. NESTEROV, Yurii, 2013. "Gradient methods for minimizing composite functions," LIDAM Reprints CORE 2510, Université catholique de Louvain, Center for Operations Research and Econometrics (CORE).
    3. Mingqiu Wang & Guo-Liang Tian, 2016. "Robust group non-convex estimations for high-dimensional partially linear models," Journal of Nonparametric Statistics, Taylor & Francis Journals, vol. 28(1), pages 49-67, March.
    4. Jian Huang & Shuange Ma & Huiliang Xie & Cun-Hui Zhang, 2009. "A group bridge approach for variable selection," Biometrika, Biometrika Trust, vol. 96(2), pages 339-355.
    5. Fan J. & Li R., 2001. "Variable Selection via Nonconcave Penalized Likelihood and its Oracle Properties," Journal of the American Statistical Association, American Statistical Association, vol. 96, pages 1348-1360, December.
    6. Guo, Xiao & Zhang, Hai & Wang, Yao & Wu, Jiang-Lun, 2015. "Model selection and estimation in high dimensional regression models with group SCAD," Statistics & Probability Letters, Elsevier, vol. 103(C), pages 86-92.
    7. Ming Yuan & Yi Lin, 2006. "Model selection and estimation in regression with grouped variables," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 68(1), pages 49-67, February.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Reetika Sarkar & Sithija Manage & Xiaoli Gao, 2024. "Stable Variable Selection for High-Dimensional Genomic Data with Strong Correlations," Annals of Data Science, Springer, vol. 11(4), pages 1139-1164, August.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Huang Hailin & Shangguan Jizi & Ruan Peifeng & Liang Hua, 2019. "Bi-level feature selection in high dimensional AFT models with applications to a genomic study," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 18(5), pages 1-11, October.
    2. Zhang, Xin & Zhao, Junlong, 2024. "Group variable selection via group sparse neural network," Computational Statistics & Data Analysis, Elsevier, vol. 192(C).
    3. Wenyan Zhong & Xuewen Lu & Jingjing Wu, 2021. "Bi-level variable selection in semiparametric transformation models with right-censored data," Computational Statistics, Springer, vol. 36(3), pages 1661-1692, September.
    4. Shen, Lijuan & Tang, Yanlin & Tang, Loon Ching, 2021. "Understanding key factors affecting power systems resilience," Reliability Engineering and System Safety, Elsevier, vol. 212(C).
    5. Sunghoon Kwon & Jeongyoun Ahn & Woncheol Jang & Sangin Lee & Yongdai Kim, 2017. "A doubly sparse approach for group variable selection," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 69(5), pages 997-1025, October.
    6. Shuichi Kawano, 2014. "Selection of tuning parameters in bridge regression models via Bayesian information criterion," Statistical Papers, Springer, vol. 55(4), pages 1207-1223, November.
    7. Loann David Denis Desboulets, 2018. "A Review on Variable Selection in Regression Analysis," Econometrics, MDPI, vol. 6(4), pages 1-27, November.
    8. Takumi Saegusa & Tianzhou Ma & Gang Li & Ying Qing Chen & Mei-Ling Ting Lee, 2020. "Variable Selection in Threshold Regression Model with Applications to HIV Drug Adherence Data," Statistics in Biosciences, Springer;International Chinese Statistical Association, vol. 12(3), pages 376-398, December.
    9. Pei Wang & Shunjie Chen & Sijia Yang, 2022. "Recent Advances on Penalized Regression Models for Biological Data," Mathematics, MDPI, vol. 10(19), pages 1-24, October.
    10. Xianwen Ding & Zhihuang Yang, 2024. "Adaptive Bi-Level Variable Selection for Quantile Regression Models with a Diverging Number of Covariates," Mathematics, MDPI, vol. 12(20), pages 1-23, October.
    11. Yanfang Zhang & Chuanhua Wei & Xiaolin Liu, 2022. "Group Logistic Regression Models with l p,q Regularization," Mathematics, MDPI, vol. 10(13), pages 1-15, June.
    12. Hu, Jianhua & Liu, Xiaoqian & Liu, Xu & Xia, Ningning, 2022. "Some aspects of response variable selection and estimation in multivariate linear regression," Journal of Multivariate Analysis, Elsevier, vol. 188(C).
    13. Yanming Li & Bin Nan & Ji Zhu, 2015. "Multivariate sparse group lasso for the multivariate multiple linear regression with an arbitrary group structure," Biometrics, The International Biometric Society, vol. 71(2), pages 354-363, June.
    14. Arfan Raheen Afzal & Jing Yang & Xuewen Lu, 2021. "Variable selection in partially linear additive hazards model with grouped covariates and a diverging number of parameters," Computational Statistics, Springer, vol. 36(2), pages 829-855, June.
    15. Mehmet Caner & Xu Han, 2014. "Selecting the Correct Number of Factors in Approximate Factor Models: The Large Panel Case With Group Bridge Estimators," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 32(3), pages 359-374, July.
    16. Mike K. P. So & Wing Ki Liu & Amanda M. Y. Chu, 2018. "Bayesian Shrinkage Estimation Of Time-Varying Covariance Matrices In Financial Time Series," Advances in Decision Sciences, Asia University, Taiwan, vol. 22(1), pages 369-404, December.
    17. Wang, Ning & Guo, Ziyu & Shang, Dawei & Li, Keyuyang, 2024. "Carbon trading price forecasting in digitalization social change era using an explainable machine learning approach: The case of China as emerging country evidence," Technological Forecasting and Social Change, Elsevier, vol. 200(C).
    18. Zhigeng Geng & Sijian Wang & Menggang Yu & Patrick O. Monahan & Victoria Champion & Grace Wahba, 2015. "Group variable selection via convex log-exp-sum penalty with application to a breast cancer survivor study," Biometrics, The International Biometric Society, vol. 71(1), pages 53-62, March.
    19. Behrendt, Simon & Schweikert, Karsten, 2021. "A Note on Adaptive Group Lasso for Structural Break Time Series," Econometrics and Statistics, Elsevier, vol. 17(C), pages 156-172.
    20. Shanshan Qin & Hao Ding & Yuehua Wu & Feng Liu, 2021. "High-dimensional sign-constrained feature selection and grouping," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 73(4), pages 787-819, August.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:aistmt:v:74:y:2022:i:3:d:10.1007_s10463-021-00809-z. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.