IDEAS home Printed from https://ideas.repec.org/a/eee/csdana/v188y2023ics0167947323001354.html
   My bibliography  Save this article

Semi-profiled distributed estimation for high-dimensional partially linear model

Author

Listed:
  • Bao, Yajie
  • Ren, Haojie

Abstract

As a leading example in the semiparametric model, the partially linear model is prominent in modeling complex data. The challenges associated with designing an efficient estimation algorithm in the distributed environment are not yet well addressed. The existing works require a constraint on the number of local machines to guarantee the optimality of the global estimator. In addition, a multi-round profiled estimator will lead to huge communication complexity. To further reduce communication costs, a novel semi-profiled estimation procedure is proposed, which provides an iterative skeleton to reduce estimation error. A new multi-round distributed algorithm based on the centralized semi-profiled estimator is developed, which can estimate the parametric and nonparametric simultaneously. The theoretical results indicate that the corresponding convergence rates can achieve the optimal orders within a constant number of communication rounds. The advantages of the proposed estimation method are demonstrated via numerical experiments on synthetic and real data.

Suggested Citation

  • Bao, Yajie & Ren, Haojie, 2023. "Semi-profiled distributed estimation for high-dimensional partially linear model," Computational Statistics & Data Analysis, Elsevier, vol. 188(C).
  • Handle: RePEc:eee:csdana:v:188:y:2023:i:c:s0167947323001354
    DOI: 10.1016/j.csda.2023.107824
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0167947323001354
    Download Restriction: Full text for ScienceDirect subscribers only.

    File URL: https://libkey.io/10.1016/j.csda.2023.107824?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Xi Chen & Jason D. Lee & He Li & Yun Yang, 2022. "Distributed Estimation for Principal Component Analysis: An Enlarged Eigenspace Analysis," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 117(540), pages 1775-1786, October.
    2. Robinson, Peter M, 1988. "Root- N-Consistent Semiparametric Regression," Econometrica, Econometric Society, vol. 56(4), pages 931-954, July.
    3. Jianqing Fan & Yunbei Ma & Wei Dai, 2014. "Nonparametric Independence Screening in Sparse Ultra-High-Dimensional Varying Coefficient Models," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 109(507), pages 1270-1284, September.
    4. Wang, Yue & Zhou, Yan & Li, Rui & Lian, Heng, 2022. "Sparse high-dimensional semi-nonparametric quantile regression in a reproducing kernel Hilbert space," Computational Statistics & Data Analysis, Elsevier, vol. 168(C).
    5. Liang, Hua & Li, Runze, 2009. "Variable Selection for Partially Linear Models With Measurement Errors," Journal of the American Statistical Association, American Statistical Association, vol. 104(485), pages 234-248.
    6. Michael I. Jordan & Jason D. Lee & Yun Yang, 2019. "Communication-Efficient Distributed Statistical Inference," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 114(526), pages 668-681, April.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Wang, Jia & Cai, Xizhen & Li, Runze, 2021. "Variable selection for partially linear models via Bayesian subset modeling with diffusing prior," Journal of Multivariate Analysis, Elsevier, vol. 183(C).
    2. Wang, Xiuli & Zhao, Shengli & Wang, Mingqiu, 2017. "Restricted profile estimation for partially linear models with large-dimensional covariates," Statistics & Probability Letters, Elsevier, vol. 128(C), pages 71-76.
    3. Feng Li & Lu Lin & Yuxia Su, 2013. "Variable selection and parameter estimation for partially linear models via Dantzig selector," Metrika: International Journal for Theoretical and Applied Statistics, Springer, vol. 76(2), pages 225-238, February.
    4. Jun Zhang & Zhenghui Feng & Peirong Xu & Hua Liang, 2017. "Generalized varying coefficient partially linear measurement errors models," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 69(1), pages 97-120, February.
    5. Zhao, Yan-Yong & Zhang, Yuchun & Liu, Yuan & Ismail, Noriszura, 2024. "Distributed debiased estimation of high-dimensional partially linear models with jumps," Computational Statistics & Data Analysis, Elsevier, vol. 191(C).
    6. Gao, Jiti & Tong, Howell & Wolff, Rodney, 2002. "Model Specification Tests in Nonparametric Stochastic Regression Models," Journal of Multivariate Analysis, Elsevier, vol. 83(2), pages 324-359, November.
    7. Bingyao Huang & Yanyan Liu & Liuhua Peng, 2023. "Distributed inference for two‐sample U‐statistics in massive data analysis," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 50(3), pages 1090-1115, September.
    8. Daniel L. Millimet & John A. List & Thanasis Stengos, 2003. "The Environmental Kuznets Curve: Real Progress or Misspecified Models?," The Review of Economics and Statistics, MIT Press, vol. 85(4), pages 1038-1047, November.
    9. Florin Maican & Matilda Orth, 2017. "Productivity Dynamics and the Role of ‘Big-Box’ Entrants in Retailing," Journal of Industrial Economics, Wiley Blackwell, vol. 65(2), pages 397-438, June.
    10. repec:spo:wpmain:info:hdl:2441/7182 is not listed on IDEAS
    11. Liang, Hua & Härdle, Wolfgang, 1997. "Large sample theory of the estimation of the error distribution for a semiparametric model," SFB 373 Discussion Papers 1997,101, Humboldt University of Berlin, Interdisciplinary Research Project 373: Quantification and Simulation of Economic Processes.
    12. Xiaohong Chen & Oliver Linton & Ingrid Van Keilegom, 2003. "Estimation of Semiparametric Models when the Criterion Function Is Not Smooth," Econometrica, Econometric Society, vol. 71(5), pages 1591-1608, September.
    13. Kyle Colangelo & Ying-Ying Lee, 2019. "Double debiased machine learning nonparametric inference with continuous treatments," CeMMAP working papers CWP72/19, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
    14. Molestina Vivar, Luis & Wedow, Michael & Weistroffer, Christian, 2023. "Burned by leverage? Flows and fragility in bond mutual funds," Journal of Empirical Finance, Elsevier, vol. 72(C), pages 354-380.
    15. Ai, Chunrong & Chen, Xiaohong, 2007. "Estimation of possibly misspecified semiparametric conditional moment restriction models with different conditioning variables," Journal of Econometrics, Elsevier, vol. 141(1), pages 5-43, November.
    16. Xiaohong Chen & Andres Santos, 2018. "Overidentification in Regular Models," Econometrica, Econometric Society, vol. 86(5), pages 1771-1817, September.
    17. Giesecke, Matthias & Schuß, Eric, 2019. "Heterogeneity in marginal returns to language training of immigrants," IAB-Discussion Paper 201919, Institut für Arbeitsmarkt- und Berufsforschung (IAB), Nürnberg [Institute for Employment Research, Nuremberg, Germany].
    18. Jinhyun Lee, 2013. "A Consistent Nonparametric Bootstrap Test of Exogeneity," Discussion Paper Series, School of Economics and Finance 201316, School of Economics and Finance, University of St Andrews.
    19. Kaspar Wuthrich & Ying Zhu, 2019. "Omitted variable bias of Lasso-based inference methods: A finite sample analysis," Papers 1903.08704, arXiv.org, revised Sep 2021.
    20. Roberto Martino & Phu Nguyen-Van, 2014. "Labour market regulation and fiscal parameters: A structural model for European regions," Working Papers of BETA 2014-19, Bureau d'Economie Théorique et Appliquée, UDS, Strasbourg.
    21. Yi Zhang & Kosuke Imai, 2023. "Individualized Policy Evaluation and Learning under Clustered Network Interference," Papers 2311.02467, arXiv.org, revised Feb 2024.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:csdana:v:188:y:2023:i:c:s0167947323001354. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/csda .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.