IDEAS home Printed from https://ideas.repec.org/a/eee/csdana/v191y2024ics0167947323001688.html
   My bibliography  Save this article

Distributed debiased estimation of high-dimensional partially linear models with jumps

Author

Listed:
  • Zhao, Yan-Yong
  • Zhang, Yuchun
  • Liu, Yuan
  • Ismail, Noriszura

Abstract

In this paper, we focus on the estimations of both parameter vector and nonparametric component in a high-dimensional partially linear model with jumps within the framework of divide and conquer strategy. We find that a three-stage estimation procedure works well in this setting. Applying the lasso penalty and projected spline approximation, first a profiled estimator for the linear part and a projected spline estimator for the nonparametric part are obtained on each local machine. In the second stage, an efficient jump detection algorithm is developed to obtain the new knot sequence, and then based on this, the estimate of the nonparametric function is obtained and averaged after plugging in the linear part estimate on each local machine. The aggregated estimate of the nonparametric function is then computed by pooling these local estimates. In the third stage, a debiased lasso estimator is averaged to obtain a distributed debiased estimator of the linear part after plugging in the aggregated estimate of nonparametric function. Asymptotic properties of resultant estimators are established under some mild assumptions. Some simulations are conducted to illustrate the empirical performances of our proposed method.

Suggested Citation

  • Zhao, Yan-Yong & Zhang, Yuchun & Liu, Yuan & Ismail, Noriszura, 2024. "Distributed debiased estimation of high-dimensional partially linear models with jumps," Computational Statistics & Data Analysis, Elsevier, vol. 191(C).
  • Handle: RePEc:eee:csdana:v:191:y:2024:i:c:s0167947323001688
    DOI: 10.1016/j.csda.2023.107857
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0167947323001688
    Download Restriction: Full text for ScienceDirect subscribers only.

    File URL: https://libkey.io/10.1016/j.csda.2023.107857?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Heng Lian & Xin Chen & Jian-Yi Yang, 2012. "Identification of Partially Linear Structure in Additive Models with an Application to Gene Expression Prediction from Sequences," Biometrics, The International Biometric Society, vol. 68(2), pages 437-445, June.
    2. Robinson, Peter M, 1988. "Root- N-Consistent Semiparametric Regression," Econometrica, Econometric Society, vol. 56(4), pages 931-954, July.
    3. Holland, Ashley D., 2017. "Penalized spline estimation in the partially linear model," Journal of Multivariate Analysis, Elsevier, vol. 153(C), pages 211-235.
    4. Shi, Chengchun & Lu, Wenbin & Song, Rui, 2018. "A massive data framework for M-estimators with cubic-rate," LSE Research Online Documents on Economics 102111, London School of Economics and Political Science, LSE Library.
    5. Liang, Hua & Li, Runze, 2009. "Variable Selection for Partially Linear Models With Measurement Errors," Journal of the American Statistical Association, American Statistical Association, vol. 104(485), pages 234-248.
    6. Zhang, Hao Helen & Cheng, Guang & Liu, Yufeng, 2011. "Linear or Nonlinear? Automatic Structure Discovery for Partially Linear Models," Journal of the American Statistical Association, American Statistical Association, vol. 106(495), pages 1099-1112.
    7. Chengchun Shi & Wenbin Lu & Rui Song, 2018. "A Massive Data Framework for M-Estimators with Cubic-Rate," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 113(524), pages 1698-1709, October.
    8. Yong He & Xinbing Kong & Long Yu & Xinsheng Zhang, 2022. "Large-Dimensional Factor Analysis Without Moment Constraints," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 40(1), pages 302-312, January.
    9. Roozbeh, M. & Arashi, M., 2013. "Feasible ridge estimator in partially linear models," Journal of Multivariate Analysis, Elsevier, vol. 116(C), pages 35-44.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Chen, Canyi & Xu, Wangli & Zhu, Liping, 2022. "Distributed estimation in heterogeneous reduced rank regression: With application to order determination in sufficient dimension reduction," Journal of Multivariate Analysis, Elsevier, vol. 190(C).
    2. Naoya Sueishi & Arihiro Yoshimura, 2017. "Focused Information Criterion for Series Estimation in Partially Linear Models," The Japanese Economic Review, Japanese Economic Association, vol. 68(3), pages 352-363, September.
    3. Wang, Xiuli & Zhao, Shengli & Wang, Mingqiu, 2017. "Restricted profile estimation for partially linear models with large-dimensional covariates," Statistics & Probability Letters, Elsevier, vol. 128(C), pages 71-76.
    4. Ruiqi Liu & Ben Boukai & Zuofeng Shang, 2019. "Statistical Inference on Partially Linear Panel Model under Unobserved Linearity," Papers 1911.08830, arXiv.org.
    5. Fengrui Di & Lei Wang, 2022. "Multi-round smoothed composite quantile regression for distributed data," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 74(5), pages 869-893, October.
    6. Marcelo M. Taddeo & Pedro A. Morettin, 2023. "Bayesian P-Splines Applied to Semiparametric Models with Errors Following a Scale Mixture of Normals," Sankhya A: The Indian Journal of Statistics, Springer;Indian Statistical Institute, vol. 85(2), pages 1331-1355, August.
    7. Jun Zhang & Zhenghui Feng & Peirong Xu & Hua Liang, 2017. "Generalized varying coefficient partially linear measurement errors models," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 69(1), pages 97-120, February.
    8. Jiawei Hou & Yunquan Song, 2022. "Interquantile shrinkage in spatial additive autoregressive models," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 31(4), pages 1030-1057, December.
    9. Yang, Shuquan & Ling, Nengxiang, 2023. "Robust projected principal component analysis for large-dimensional semiparametric factor modeling," Journal of Multivariate Analysis, Elsevier, vol. 195(C).
    10. Zhang, Haixiang & Wang, HaiYing, 2021. "Distributed subdata selection for big data via sampling-based approach," Computational Statistics & Data Analysis, Elsevier, vol. 153(C).
    11. Le-Yu Chen & Sokbae Lee, 2018. "High Dimensional Classification through $\ell_0$-Penalized Empirical Risk Minimization," Papers 1811.09540, arXiv.org.
    12. Xuewen Lu & Peter X.-K. Song, 2015. "Efficient Estimation of the Partly Linear Additive Hazards Model with Current Status Data," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 42(1), pages 306-328, March.
    13. Wang, Jia & Cai, Xizhen & Li, Runze, 2021. "Variable selection for partially linear models via Bayesian subset modeling with diffusing prior," Journal of Multivariate Analysis, Elsevier, vol. 183(C).
    14. Xuejun Ma & Shaochen Wang & Wang Zhou, 2022. "Statistical inference in massive datasets by empirical likelihood," Computational Statistics, Springer, vol. 37(3), pages 1143-1164, July.
    15. Bao, Yajie & Ren, Haojie, 2023. "Semi-profiled distributed estimation for high-dimensional partially linear model," Computational Statistics & Data Analysis, Elsevier, vol. 188(C).
    16. Tom Boot & Art=uras Juodis, 2023. "Uniform Inference in Linear Error-in-Variables Models: Divide-and-Conquer," Papers 2301.04439, arXiv.org.
    17. Feng Li & Lu Lin & Yuxia Su, 2013. "Variable selection and parameter estimation for partially linear models via Dantzig selector," Metrika: International Journal for Theoretical and Applied Statistics, Springer, vol. 76(2), pages 225-238, February.
    18. Lian, Heng & Meng, Jie & Zhao, Kaifeng, 2015. "Spline estimator for simultaneous variable selection and constant coefficient identification in high-dimensional generalized varying-coefficient models," Journal of Multivariate Analysis, Elsevier, vol. 141(C), pages 81-103.
    19. Shi, Jianwei & Qin, Guoyou & Zhu, Huichen & Zhu, Zhongyi, 2021. "Communication-efficient distributed M-estimation with missing data," Computational Statistics & Data Analysis, Elsevier, vol. 161(C).
    20. Lulu Zuo & Haixiang Zhang & HaiYing Wang & Liuquan Sun, 2021. "Optimal subsample selection for massive logistic regression with distributed data," Computational Statistics, Springer, vol. 36(4), pages 2535-2562, December.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:csdana:v:191:y:2024:i:c:s0167947323001688. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/csda .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.