IDEAS home Printed from https://ideas.repec.org/a/eee/jmvana/v141y2015icp81-103.html
   My bibliography  Save this article

Spline estimator for simultaneous variable selection and constant coefficient identification in high-dimensional generalized varying-coefficient models

Author

Listed:
  • Lian, Heng
  • Meng, Jie
  • Zhao, Kaifeng

Abstract

In this paper, we are concerned with two common and related problems for generalized varying-coefficient models, variable selection and constant coefficient identification. Starting with a specification of generalized varying-coefficient models assuming possible nonlinear interactions between the index variable and all other predictors, we propose a polynomial-spline based procedure that simultaneously eliminates irrelevant predictors and identifies predictors that do not interact with the index variable. Our approach is based on a double-penalization strategy where two penalty functions are used for these two related purposes respectively, in a single functional. In a “large p, small n” setting, we demonstrate the convergence rates of the estimator under suitable regularity assumptions. Based on its previous success on parametric models, we use the extended Bayesian information criterion (eBIC) to automatically choose the regularization parameters. Finally, post-penalization estimator is proposed to further reduce the bias of the resulting estimator. Monte Carlo simulations are conducted to examine the finite sample performance of the proposed procedures and an application to a leukemia dataset is presented.

Suggested Citation

  • Lian, Heng & Meng, Jie & Zhao, Kaifeng, 2015. "Spline estimator for simultaneous variable selection and constant coefficient identification in high-dimensional generalized varying-coefficient models," Journal of Multivariate Analysis, Elsevier, vol. 141(C), pages 81-103.
  • Handle: RePEc:eee:jmvana:v:141:y:2015:i:c:p:81-103
    DOI: 10.1016/j.jmva.2015.06.011
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0047259X15001566
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.jmva.2015.06.011?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Zou, Hui, 2006. "The Adaptive Lasso and Its Oracle Properties," Journal of the American Statistical Association, American Statistical Association, vol. 101, pages 1418-1429, December.
    2. Heng Lian & Xin Chen & Jian-Yi Yang, 2012. "Identification of Partially Linear Structure in Additive Models with an Application to Gene Expression Prediction from Sequences," Biometrics, The International Biometric Society, vol. 68(2), pages 437-445, June.
    3. Jiahua Chen & Zehua Chen, 2008. "Extended Bayesian information criteria for model selection with large model spaces," Biometrika, Biometrika Trust, vol. 95(3), pages 759-771.
    4. Lam, Clifford & Fan, Jianqing, 2008. "Profile-kernel likelihood inference with diverging number of parameters," LSE Research Online Documents on Economics 31548, London School of Economics and Political Science, LSE Library.
    5. Yingcun Xia, 2004. "Efficient estimation for semivarying-coefficient models," Biometrika, Biometrika Trust, vol. 91(3), pages 661-681, September.
    6. Jianhua Z. Huang, 2002. "Varying-coefficient models and basis function approximations for the analysis of repeated measurements," Biometrika, Biometrika Trust, vol. 89(1), pages 111-128, March.
    7. Fan, Jianqing & Feng, Yang & Song, Rui, 2011. "Nonparametric Independence Screening in Sparse Ultra-High-Dimensional Additive Models," Journal of the American Statistical Association, American Statistical Association, vol. 106(494), pages 544-557.
    8. Noh, Hohsuk & Van Keilegom, Ingrid, 2012. "Efficient model selection in semivarying coefficient models," LIDAM Reprints ISBA 2012029, Université catholique de Louvain, Institute of Statistics, Biostatistics and Actuarial Sciences (ISBA).
    9. Zhang, Hao Helen & Cheng, Guang & Liu, Yufeng, 2011. "Linear or Nonlinear? Automatic Structure Discovery for Partially Linear Models," Journal of the American Statistical Association, American Statistical Association, vol. 106(495), pages 1099-1112.
    10. Noh, Hohsuk & Van Keilegom, Ingrid, 2012. "Efficient Model Selection in Semivarying Coefficient Models," LIDAM Discussion Papers ISBA 2012025, Université catholique de Louvain, Institute of Statistics, Biostatistics and Actuarial Sciences (ISBA).
    11. Fan J. & Li R., 2001. "Variable Selection via Nonconcave Penalized Likelihood and its Oracle Properties," Journal of the American Statistical Association, American Statistical Association, vol. 96, pages 1348-1360, December.
    12. Ming Yuan & Yi Lin, 2006. "Model selection and estimation in regression with grouped variables," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 68(1), pages 49-67, February.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Li, Xinyi & Wang, Li & Nettleton, Dan, 2019. "Sparse model identification and learning for ultra-high-dimensional additive partially linear models," Journal of Multivariate Analysis, Elsevier, vol. 173(C), pages 204-228.
    2. Zhang, Shucong & Zhou, Yong, 2018. "Variable screening for ultrahigh dimensional heterogeneous data via conditional quantile correlations," Journal of Multivariate Analysis, Elsevier, vol. 165(C), pages 1-13.
    3. Zhaoping Hong & Yuao Hu & Heng Lian, 2013. "Variable selection for high-dimensional varying coefficient partially linear models via nonconcave penalty," Metrika: International Journal for Theoretical and Applied Statistics, Springer, vol. 76(7), pages 887-908, October.
    4. Lian, Heng, 2014. "Semiparametric Bayesian information criterion for model selection in ultra-high dimensional additive models," Journal of Multivariate Analysis, Elsevier, vol. 123(C), pages 304-310.
    5. Yanhang Zhang & Junxian Zhu & Jin Zhu & Xueqin Wang, 2023. "A Splicing Approach to Best Subset of Groups Selection," INFORMS Journal on Computing, INFORMS, vol. 35(1), pages 104-119, January.
    6. Zhang, Ting, 2015. "Semiparametric model building for regression models with time-varying parameters," Journal of Econometrics, Elsevier, vol. 187(1), pages 189-200.
    7. Xiangyu Wang & Chenlei Leng, 2016. "High dimensional ordinary least squares projection for screening variables," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 78(3), pages 589-611, June.
    8. Yoshida, Takuma, 2018. "Semiparametric method for model structure discovery in additive regression models," Econometrics and Statistics, Elsevier, vol. 5(C), pages 124-136.
    9. Lichun Wang & Peng Lai & Heng Lian, 2013. "Polynomial spline estimation for generalized varying coefficient partially linear models with a diverging number of components," Metrika: International Journal for Theoretical and Applied Statistics, Springer, vol. 76(8), pages 1083-1103, November.
    10. Jiawei Hou & Yunquan Song, 2022. "Interquantile shrinkage in spatial additive autoregressive models," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 31(4), pages 1030-1057, December.
    11. Weihua Zhao & Riquan Zhang & Jicai Liu & Yazhao Lv, 2014. "Robust and efficient variable selection for semiparametric partially linear varying coefficient model based on modal regression," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 66(1), pages 165-191, February.
    12. Byeong U. Park & Enno Mammen & Young K. Lee & Eun Ryung Lee, 2015. "Varying Coefficient Regression Models: A Review and New Developments," International Statistical Review, International Statistical Institute, vol. 83(1), pages 36-64, April.
    13. Du, Pang & Cheng, Guang & Liang, Hua, 2012. "Semiparametric regression models with additive nonparametric components and high dimensional parametric components," Computational Statistics & Data Analysis, Elsevier, vol. 56(6), pages 2006-2017.
    14. Lian, Heng & Feng, Sanying & Zhao, Kaifeng, 2015. "Parametric and semiparametric reduced-rank regression with flexible sparsity," Journal of Multivariate Analysis, Elsevier, vol. 136(C), pages 163-174.
    15. HONDA, Toshio & 本田, 敏雄 & ING, Ching-Kang & WU, Wei-Ying, 2017. "Adaptively weighted group Lasso for semiparametric quantile regression models," Discussion Papers 2017-04, Graduate School of Economics, Hitotsubashi University.
    16. Shan Luo & Zehua Chen, 2014. "Sequential Lasso Cum EBIC for Feature Selection With Ultra-High Dimensional Feature Space," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 109(507), pages 1229-1240, September.
    17. Loann David Denis Desboulets, 2018. "A Review on Variable Selection in Regression Analysis," Econometrics, MDPI, vol. 6(4), pages 1-27, November.
    18. Fang Lu & Jing Yang & Xuewen Lu, 2022. "One-step oracle procedure for semi-parametric spatial autoregressive model and its empirical application to Boston housing price data," Empirical Economics, Springer, vol. 62(6), pages 2645-2671, June.
    19. Qingliang Fan & Yaqian Wu, 2020. "Endogenous Treatment Effect Estimation with some Invalid and Irrelevant Instruments," Papers 2006.14998, arXiv.org.
    20. Luke Mosley & Idris A. Eckley & Alex Gibberd, 2022. "Sparse temporal disaggregation," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 185(4), pages 2203-2233, October.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:jmvana:v:141:y:2015:i:c:p:81-103. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/wps/find/journaldescription.cws_home/622892/description#description .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.