High-Dimensional $L_2$Boosting: Rate of Convergence

My bibliography Save this paper

High-Dimensional $L_2$Boosting: Rate of Convergence

Author

Listed:

Ye Luo
Martin Spindler
Jannis Kuck

Registered:

Abstract

Boosting is one of the most significant developments in machine learning. This paper studies the rate of convergence of $L_2$Boosting, which is tailored for regression, in a high-dimensional setting. Moreover, we introduce so-called \textquotedblleft post-Boosting\textquotedblright. This is a post-selection estimator which applies ordinary least squares to the variables selected in the first stage by $L_2$Boosting. Another variant is \textquotedblleft Orthogonal Boosting\textquotedblright\ where after each step an orthogonal projection is conducted. We show that both post-$L_2$Boosting and the orthogonal boosting achieve the same rate of convergence as LASSO in a sparse, high-dimensional setting. We show that the rate of convergence of the classical $L_2$Boosting depends on the design matrix described by a sparse eigenvalue constant. To show the latter results, we derive new approximation results for the pure greedy algorithm, based on analyzing the revisiting behavior of $L_2$Boosting. We also introduce feasible rules for early stopping, which can be easily implemented and used in applied work. Our results also allow a direct comparison between LASSO and boosting which has been missing from the literature. Finally, we present simulation studies and applications to illustrate the relevance of our theoretical results and to provide insights into the practical aspects of boosting. In these simulation studies, post-$L_2$Boosting clearly outperforms LASSO.

Suggested Citation

Ye Luo & Martin Spindler & Jannis Kuck, 2016. "High-Dimensional $L_2$Boosting: Rate of Convergence," Papers 1602.08927, arXiv.org, revised Jul 2022.

Handle: RePEc:arx:papers:1602.08927

Download full text from publisher

References listed on IDEAS

A. Belloni & D. Chen & V. Chernozhukov & C. Hansen, 2012. "Sparse Models and Methods for Optimal Instruments With an Application to Eminent Domain," Econometrica, Econometric Society, vol. 80(6), pages 2369-2429, November.
- Alexandre Belloni & D. Chen & Victor Chernozhukov & Christian Hansen, 2010. "Sparse models and methods for optimal instruments with an application to eminent domain," CeMMAP working papers CWP31/10, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
- Alexandre Belloni & Daniel Chen & Victor Chernozhukov & Christian Hansen, 2010. "Sparse Models and Methods for Optimal Instruments with an Application to Eminent Domain," Papers 1010.4345, arXiv.org, revised Apr 2015.
Buhlmann P. & Yu B., 2003. "Boosting With the L2 Loss: Regression and Classification," Journal of the American Statistical Association, American Statistical Association, vol. 98, pages 324-339, January.

Full references (including those not matched with items on IDEAS)

Citations

Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.

Cited by:

Victor Chernozhukov & Denis Chetverikov & Mert Demirer & Esther Duflo & Christian Hansen & Whitney Newey & James Robins, 2018. "Double/debiased machine learning for treatment and structural parameters," Econometrics Journal, Royal Economic Society, vol. 21(1), pages 1-68, February.
- Victor Chernozhukov & Denis Chetverikov & Mert Demirer & Esther Duflo & Christian Hansen & Whitney Newey & James Robins, 2017. "Double/Debiased Machine Learning for Treatment and Structural Parameters," NBER Working Papers 23564, National Bureau of Economic Research, Inc.
- Victor Chernozhukov & Denis Chetverikov & Mert Demirer & Esther Duflo & Christian Hansen & Whitney K. Newey & James Robins, 2017. "Double/debiased machine learning for treatment and structural parameters," CeMMAP working papers CWP28/17, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
- Victor Chernozhukov & Denis Chetverikov & Mert Demirer & Esther Duflo & Christian Hansen & Whitney K. Newey & James Robins, 2017. "Double/debiased machine learning for treatment and structural parameters," CeMMAP working papers 28/17, Institute for Fiscal Studies.
Michela Bia & Martin Huber & Lukáš Lafférs, 2024. "Double Machine Learning for Sample Selection Models," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 42(3), pages 958-969, July.
- Michela Bia & Martin Huber & Luk'av{s} Laff'ers, 2020. "Double machine learning for sample selection models," Papers 2012.00745, arXiv.org, revised Jul 2021.
Jannis Kueck & Ye Luo & Martin Spindler & Zigan Wang, 2017. "Estimation and Inference of Treatment Effects with $L_2$-Boosting in High-Dimensional Settings," Papers 1801.00364, arXiv.org, revised Jul 2021.
Victor Chernozhukov & Vira Semenova, 2018. "Simultaneous inference for Best Linear Predictor of the Conditional Average Treatment Effect and other structural functions," CeMMAP working papers CWP40/18, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
Yue, Mu & Li, Jialiang & Cheng, Ming-Yen, 2019. "Two-step sparse boosting for high-dimensional longitudinal data with varying coefficients," Computational Statistics & Data Analysis, Elsevier, vol. 131(C), pages 222-234.
Sven Klaassen & Jan Teichert-Kluge & Philipp Bach & Victor Chernozhukov & Martin Spindler & Suhas Vijaykumar, 2024. "DoubleMLDeep: Estimation of Causal Effects with Multimodal Data," Papers 2402.01785, arXiv.org.
Yang, Jui-Chung & Chuang, Hui-Ching & Kuan, Chung-Ming, 2020. "Double machine learning with gradient boosting and its application to the Big N audit quality effect," Journal of Econometrics, Elsevier, vol. 216(1), pages 268-283.
Phillip Heiler & Michael C. Knaus, 2021. "Effect or Treatment Heterogeneity? Policy Evaluation with Aggregated and Disaggregated Treatments," Papers 2110.01427, arXiv.org, revised Aug 2023.
- Heiler, Phillip & Knaus, Michael C., 2022. "Effect or Treatment Heterogeneity? Policy Evaluation with Aggregated and Disaggregated Treatments," IZA Discussion Papers 15580, Institute of Labor Economics (IZA).
Victor Chernozhukov & Denis Chetverikov & Mert Demirer & Esther Duflo & Christian Hansen & Whitney Newey & James Robins, 2016. "Double/Debiased Machine Learning for Treatment and Causal Parameters," Papers 1608.00060, arXiv.org, revised Nov 2024.
Helmut Farbmacher & Martin Huber & LukÃ¡Å¡ LaffÃ©rs & Henrika Langen & Martin Spindler, 2022. "Causal mediation analysis with double machine learning [Mediation analysis via potential outcomes models]," The Econometrics Journal, Royal Economic Society, vol. 25(2), pages 277-300.
- Helmut Farbmacher & Martin Huber & Luk'av{s} Laff'ers & Henrika Langen & Martin Spindler, 2020. "Causal mediation analysis with double machine learning," Papers 2002.12710, arXiv.org, revised Feb 2021.
- Farbmacher, Helmut & Huber, Martin & Langen, Henrika & Spindler, Martin, 2020. "Causal mediation analysis with double machine learning," FSES Working Papers 515, Faculty of Economics and Social Sciences, University of Freiburg/Fribourg Switzerland.
Hugo Bodory & Martin Huber & LukÃ¡Å¡ LaffÃ©rs, 2022. "Evaluating (weighted) dynamic treatment effects by double machine learning [Identification of causal effects using instrumental variables]," The Econometrics Journal, Royal Economic Society, vol. 25(3), pages 628-648.
- Hugo Bodory & Martin Huber & Luk'av{s} Laff'ers, 2020. "Evaluating (weighted) dynamic treatment effects by double machine learning," Papers 2012.00370, arXiv.org, revised Jun 2021.

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Peter C. B. Phillips & Zhentao Shi, 2021. "Boosting: Why You Can Use The Hp Filter," International Economic Review, Department of Economics, University of Pennsylvania and Osaka University Institute of Social and Economic Research Association, vol. 62(2), pages 521-570, May.
- Peter C.B. Phillips & Zhentao Shi, 2019. "Boosting: Why you Can Use the HP Filter," Cowles Foundation Discussion Papers 2212, Cowles Foundation for Research in Economics, Yale University.
- Peter C. B. Phillips & Zhentao Shi, 2019. "Boosting: Why You Can Use the HP Filter," Papers 1905.00175, arXiv.org, revised Nov 2020.
Jianqing Fan & Kunpeng Li & Yuan Liao, 2020. "Recent Developments on Factor Models and its Applications in Econometric Learning," Papers 2009.10103, arXiv.org.
Tutz, Gerhard & Pößnecker, Wolfgang & Uhlmann, Lorenz, 2015. "Variable selection in general multinomial logit models," Computational Statistics & Data Analysis, Elsevier, vol. 82(C), pages 207-222.
Alexandre Belloni & Victor Chernozhukov & Denis Chetverikov & Christian Hansen & Kengo Kato, 2018. "High-dimensional econometrics and regularized GMM," CeMMAP working papers CWP35/18, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
- Alexandre Belloni & Victor Chernozhukov & Denis Chetverikov & Christian Hansen & Kengo Kato, 2018. "High-Dimensional Econometrics and Regularized GMM," Papers 1806.01888, arXiv.org, revised Jun 2018.
Alexandre Belloni & Victor Chernozhukov & Kengo Kato, 2019. "Valid Post-Selection Inference in High-Dimensional Approximately Sparse Quantile Regression Models," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 114(526), pages 749-758, April.
- Alexandre Belloni & Victor Chernozhukov & Kengo Kato, 2013. "Valid Post-Selection Inference in High-Dimensional Approximately Sparse Quantile Regression Models," Papers 1312.7186, arXiv.org, revised Jun 2016.
- Alexandre Belloni & Victor Chernozhukov & Kengo Kato, 2014. "Valid post-selection inference in high-dimensional approximately sparse quantile regression models," CeMMAP working papers CWP53/14, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
- Alexandre Belloni & Victor Chernozhukov & Kengo Kato, 2014. "Valid post-selection inference in high-dimensional approximately sparse quantile regression models," CeMMAP working papers 53/14, Institute for Fiscal Studies.
Arne Henningsen & Guy Low & David Wuepper & Tobias Dalhaus & Hugo Storm & Dagim Belay & Stefan Hirsch, 2024. "Estimating Causal Effects with Observational Data: Guidelines for Agricultural and Applied Economists," IFRO Working Paper 2024/03, University of Copenhagen, Department of Food and Resource Economics.
Anil Kumar, 2018. "Do Restrictions on Home Equity Extraction Contribute to Lower Mortgage Defaults? Evidence from a Policy Discontinuity at the Texas Border," American Economic Journal: Economic Policy, American Economic Association, vol. 10(1), pages 268-297, February.
- Anil Kumar, 2014. "Do restrictions on home equity extraction contribute to lower mortgage defaults? evidence from a policy discontinuity at the Texas border," Working Papers 1410, Federal Reserve Bank of Dallas.
Gerhard Tutz & Moritz Berger, 2018. "Tree-structured modelling of categorical predictors in generalized additive regression," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 12(3), pages 737-758, September.
Fan, Jianqing & Jiang, Bai & Sun, Qiang, 2022. "Bayesian factor-adjusted sparse regression," Journal of Econometrics, Elsevier, vol. 230(1), pages 3-19.
Jun Li & Serguei Netessine & Sergei Koulayev, 2018. "Price to Compete … with Many: How to Identify Price Competition in High-Dimensional Space," Management Science, INFORMS, vol. 64(9), pages 4118-4136, September.
Daniel Paravisini & Veronica Rappoport & Philipp Schnabl & Daniel Wolfenzon, 2015. "Dissecting the Effect of Credit Supply on Trade: Evidence from Matched Credit-Export Data," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 82(1), pages 333-359.
- Paravisini, Daniel & Rappoport, Veronica & Schnabl, Philipp & Wolfenzon, Daniel, 2010. "Dissecting the Effect of Credit Supply on Trade: Evidence from Matched Credit-Export Data," Working Papers 2010-022, Banco Central de Reserva del Perú.
- Veronica Rappoport & Philipp Schnabl & Daniel Wolfenzon & Daniel Paravisini, 2011. "Dissecting the Effect of Credit Supply on Trade: Evidence from Matched Credit-Export Data," 2011 Meeting Papers 180, Society for Economic Dynamics.
- Paravisini, Daniel & Rappoport, Veronica & Schnabl, Philipp & Wolfenzon, Daniel, 2015. "Dissecting the effect of credit supply on trade: evidence from matched credit-export data," LSE Research Online Documents on Economics 59575, London School of Economics and Political Science, LSE Library.
- Daniel Paravisini & Veronica Rappoport & Philipp Schnabl & Daniel Wolfenzon, 2011. "Dissecting the Effect of Credit Supply on Trade: Evidence from Matched Credit-Export Data," NBER Working Papers 16975, National Bureau of Economic Research, Inc.
Alexandre Belloni & Victor Chernozhukov & Christian Hansen & Damian Kozbur, 2016. "Inference in High-Dimensional Panel Models With an Application to Gun Control," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 34(4), pages 590-605, October.
- Alexandre Belloni & Victor Chernozhukov & Christian Hansen & Damian Kozbur, 2014. "Inference in high dimensional panel models with an application to gun control," CeMMAP working papers 50/14, Institute for Fiscal Studies.
- Alexandre Belloni & Victor Chernozhukov & Christian Hansen & Damian Kozbur, 2014. "Inference in High Dimensional Panel Models with an Application to Gun Control," Papers 1411.6507, arXiv.org.
- Alexandre Belloni & Victor Chernozhukov & Christian Hansen & Damian Kozbur, 2014. "Inference in high dimensional panel models with an application to gun control," CeMMAP working papers CWP50/14, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
Alexandre Belloni & Victor Chernozhukov, 2011. "High Dimensional Sparse Econometric Models: An Introduction," Papers 1106.5242, arXiv.org, revised Sep 2011.
Brian Quistorff & Gentry Johnson, 2020. "Machine Learning for Experimental Design: Methods for Improved Blocking," Papers 2010.15966, arXiv.org.
A. Belloni & D. Chen & V. Chernozhukov & C. Hansen, 2012. "Sparse Models and Methods for Optimal Instruments With an Application to Eminent Domain," Econometrica, Econometric Society, vol. 80(6), pages 2369-2429, November.
- Alexandre Belloni & D. Chen & Victor Chernozhukov & Christian Hansen, 2010. "Sparse models and methods for optimal instruments with an application to eminent domain," CeMMAP working papers CWP31/10, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
- Alexandre Belloni & Daniel Chen & Victor Chernozhukov & Christian Hansen, 2010. "Sparse Models and Methods for Optimal Instruments with an Application to Eminent Domain," Papers 1010.4345, arXiv.org, revised Apr 2015.
Taisuke Otsu & Myung Hwan Seo, 2014. "Asymptotics for maximum score method under general conditions," STICERD - Econometrics Paper Series 571, Suntory and Toyota International Centres for Economics and Related Disciplines, LSE.
Nelson, Kelly P. & Parton, Lee C. & Brown, Zachary S., 2022. "Biofuels policy and innovation impacts: Evidence from biofuels and agricultural patent indicators," Energy Policy, Elsevier, vol. 162(C).
- Nelson, Kelly & Brown, Zachary S. & Parton, Lee, 2019. "Biofuels Policy and Innovation Impacts: Evidence from Biofuels and Agricultural Patent Indicators," 2019 Annual Meeting, July 21-23, Atlanta, Georgia 291243, Agricultural and Applied Economics Association.
Mittnik, Stefan & Robinzonov, Nikolay & Spindler, Martin, 2015. "Stock market volatility: Identifying major drivers and the nature of their impact," Journal of Banking & Finance, Elsevier, vol. 58(C), pages 1-14.
Sauvenier, Mathieu & Van Bellegem, Sébastien, 2023. "Direction Identification and Minimax Estimation by Generalized Eigenvalue Problem in High Dimensional Sparse Regression," LIDAM Discussion Papers CORE 2023005, Université catholique de Louvain, Center for Operations Research and Econometrics (CORE).
Arthur Charpentier & Emmanuel Flachaire & Antoine Ly, 2017. "Econom\'etrie et Machine Learning," Papers 1708.06992, arXiv.org, revised Mar 2018.

More about this item

NEP fields

This paper has been announced in the following NEP Reports:

NEP-BIG-2017-10-01 (Big Data)
NEP-CMP-2017-10-01 (Computational Economics)

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:1602.08927. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

High-Dimensional $L_2$Boosting: Rate of Convergence

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Citations

Most related items

More about this item

NEP fields

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data