IDEAS home Printed from https://ideas.repec.org/a/spr/metron/v82y2024i2d10.1007_s40300-023-00253-4.html
   My bibliography  Save this article

Generalized regression estimators with concave penalties and a comparison to lasso type estimators

Author

Listed:
  • Elena McDonald

    (Archer Daniels Midland)

  • Xin Wang

    (San Diego State University)

Abstract

The generalized regression (GREG) estimator is usually used in survey sampling when incorporating auxiliary information. Generally, not all available covariates significantly contribute to the estimation process when there are multiple covariates. We propose two new GREG estimators based on concave penalties: one built from the smoothly clipped absolute deviation (SCAD) penalty and the other built from the minimax concave penalty (MCP). The performances of these estimators are compared to lasso-type estimators through a simulation study in a simple random sample (SRS) setting and a probability proportional to size (PPS) sample setting. It is shown that the proposed estimators produce improved estimates of the population total compared to that of the traditional GREG estimator and the estimators built from LASSO. Asymptotic properties are also derived for the proposed estimators. Bootstrap methods are also explored to improve coverage probability when the sample size is small.

Suggested Citation

  • Elena McDonald & Xin Wang, 2024. "Generalized regression estimators with concave penalties and a comparison to lasso type estimators," METRON, Springer;Sapienza Università di Roma, vol. 82(2), pages 213-239, August.
  • Handle: RePEc:spr:metron:v:82:y:2024:i:2:d:10.1007_s40300-023-00253-4
    DOI: 10.1007/s40300-023-00253-4
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s40300-023-00253-4
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s40300-023-00253-4?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Wang, Lifeng & Li, Hongzhe & Huang, Jianhua Z., 2008. "Variable Selection in Nonparametric Varying-Coefficient Models for Analysis of Repeated Measurements," Journal of the American Statistical Association, American Statistical Association, vol. 103(484), pages 1556-1569.
    2. Fan J. & Li R., 2001. "Variable Selection via Nonconcave Penalized Likelihood and its Oracle Properties," Journal of the American Statistical Association, American Statistical Association, vol. 96, pages 1348-1360, December.
    3. Wang, Hansheng & Li, Guodong & Jiang, Guohua, 2007. "Robust Regression Shrinkage and Consistent Variable Selection Through the LAD-Lasso," Journal of Business & Economic Statistics, American Statistical Association, vol. 25, pages 347-355, July.
    4. Marco Avella-Medina & Elvezio Ronchetti, 2018. "Robust and consistent variable selection in high-dimensional generalized linear models," Biometrika, Biometrika Trust, vol. 105(1), pages 31-44.
    5. Shujie Ma & Jian Huang, 2017. "A Concave Pairwise Fusion Approach to Subgroup Analysis," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 112(517), pages 410-423, January.
    6. Zou, Hui, 2006. "The Adaptive Lasso and Its Oracle Properties," Journal of the American Statistical Association, American Statistical Association, vol. 101, pages 1418-1429, December.
    7. Friedman, Jerome H. & Hastie, Trevor & Tibshirani, Rob, 2010. "Regularization Paths for Generalized Linear Models via Coordinate Descent," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 33(i01).
    8. Kim, Yongdai & Choi, Hosik & Oh, Hee-Seok, 2008. "Smoothly Clipped Absolute Deviation on High Dimensions," Journal of the American Statistical Association, American Statistical Association, vol. 103(484), pages 1665-1673.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Gaorong Li & Liugen Xue & Heng Lian, 2012. "SCAD-penalised generalised additive models with non-polynomial dimensionality," Journal of Nonparametric Statistics, Taylor & Francis Journals, vol. 24(3), pages 681-697.
    2. Loann David Denis Desboulets, 2018. "A Review on Variable Selection in Regression Analysis," Econometrics, MDPI, vol. 6(4), pages 1-27, November.
    3. Li, Xinjue & Zboňáková, Lenka & Wang, Weining & Härdle, Wolfgang Karl, 2019. "Combining Penalization and Adaption in High Dimension with Application in Bond Risk Premia Forecasting," IRTG 1792 Discussion Papers 2019-030, Humboldt University of Berlin, International Research Training Group 1792 "High Dimensional Nonstationary Time Series".
    4. N. Neykov & P. Filzmoser & P. Neytchev, 2014. "Ultrahigh dimensional variable selection through the penalized maximum trimmed likelihood estimator," Statistical Papers, Springer, vol. 55(1), pages 187-207, February.
    5. Long Feng & Changliang Zou & Zhaojun Wang & Xianwu Wei & Bin Chen, 2015. "Robust spline-based variable selection in varying coefficient model," Metrika: International Journal for Theoretical and Applied Statistics, Springer, vol. 78(1), pages 85-118, January.
    6. Zhaoping Hong & Yuao Hu & Heng Lian, 2013. "Variable selection for high-dimensional varying coefficient partially linear models via nonconcave penalty," Metrika: International Journal for Theoretical and Applied Statistics, Springer, vol. 76(7), pages 887-908, October.
    7. Ayanendranath Basu & Abhik Ghosh & Maria Jaenada & Leandro Pardo, 2024. "Robust adaptive LASSO in high-dimensional logistic regression," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 33(5), pages 1217-1249, November.
    8. Kean Ming Tan & Lan Wang & Wen‐Xin Zhou, 2022. "High‐dimensional quantile regression: Convolution smoothing and concave regularization," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 84(1), pages 205-233, February.
    9. Benjamin G. Stokell & Rajen D. Shah & Ryan J. Tibshirani, 2021. "Modelling high‐dimensional categorical data using nonconvex fusion penalties," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 83(3), pages 579-611, July.
    10. Lian, Heng & Li, Jianbo & Tang, Xingyu, 2014. "SCAD-penalized regression in additive partially linear proportional hazards models with an ultra-high-dimensional linear part," Journal of Multivariate Analysis, Elsevier, vol. 125(C), pages 50-64.
    11. Mingqiu Wang & Guo-Liang Tian, 2016. "Robust group non-convex estimations for high-dimensional partially linear models," Journal of Nonparametric Statistics, Taylor & Francis Journals, vol. 28(1), pages 49-67, March.
    12. Yanxin Wang & Qibin Fan & Li Zhu, 2018. "Variable selection and estimation using a continuous approximation to the $$L_0$$ L 0 penalty," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 70(1), pages 191-214, February.
    13. Muhammad Amin & Lixin Song & Milton Abdul Thorlie & Xiaoguang Wang, 2015. "SCAD-penalized quantile regression for high-dimensional data analysis and variable selection," Statistica Neerlandica, Netherlands Society for Statistics and Operations Research, vol. 69(3), pages 212-235, August.
    14. Zhao, Weihua & Lian, Heng, 2017. "Quantile index coefficient model with variable selection," Journal of Multivariate Analysis, Elsevier, vol. 154(C), pages 40-58.
    15. Tutz, Gerhard & Pößnecker, Wolfgang & Uhlmann, Lorenz, 2015. "Variable selection in general multinomial logit models," Computational Statistics & Data Analysis, Elsevier, vol. 82(C), pages 207-222.
    16. Hui Xiao & Yiguo Sun, 2020. "Forecasting the Returns of Cryptocurrency: A Model Averaging Approach," JRFM, MDPI, vol. 13(11), pages 1-15, November.
    17. Zhu Wang, 2022. "MM for penalized estimation," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 31(1), pages 54-75, March.
    18. Naimoli, Antonio, 2022. "Modelling the persistence of Covid-19 positivity rate in Italy," Socio-Economic Planning Sciences, Elsevier, vol. 82(PA).
    19. Xiaotong Shen & Wei Pan & Yunzhang Zhu & Hui Zhou, 2013. "On constrained and regularized high-dimensional regression," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 65(5), pages 807-832, October.
    20. Umberto Amato & Anestis Antoniadis & Italia De Feis & Irene Gijbels, 2021. "Penalised robust estimators for sparse and high-dimensional linear models," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 30(1), pages 1-48, March.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:metron:v:82:y:2024:i:2:d:10.1007_s40300-023-00253-4. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.