Early stopping in L2Boosting

My bibliography Save this article

Early stopping in L2Boosting

Author

Listed:

Ivan Chang, Yuan-Chin
Huang, Yufen
Huang, Yu-Pai

Registered:

Abstract

It is well known that the boosting-like algorithms, such as AdaBoost and many of its modifications, may over-fit the training data when the number of boosting iterations becomes large. Therefore, how to stop a boosting algorithm at an appropriate iteration time is a longstanding problem for the past decade (see Meir and Rätsch, 2003). Bühlmann and Yu (2005) applied model selection criteria to estimate the stopping iteration for L2Boosting, but it is still necessary to compute all boosting iterations under consideration for the training data. Thus, the main purpose of this paper is focused on studying the early stopping rule for L2Boosting during the training stage to seek a very substantial computational saving. The proposed method is based on a change point detection method on the values of model selection criteria during the training stage. This method is also extended to two-class classification problems which are very common in medical and bioinformatics applications. A simulation study and a real data example to these approaches are provided for illustrations, and comparisons are made with LogitBoost.

Suggested Citation

Ivan Chang, Yuan-Chin & Huang, Yufen & Huang, Yu-Pai, 2010. "Early stopping in L2Boosting," Computational Statistics & Data Analysis, Elsevier, vol. 54(10), pages 2203-2213, October.

Handle: RePEc:eee:csdana:v:54:y:2010:i:10:p:2203-2213

Download full text from publisher

As the access to this document is restricted, you may want to search for a different version of it.

References listed on IDEAS

Hansen M. H & Yu B., 2001. "Model Selection and the Principle of Minimum Description Length," Journal of the American Statistical Association, American Statistical Association, vol. 96, pages 746-774, June.
T. Speed & Bin Yu, 1993. "Model selection and prediction: Normal regression," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 45(1), pages 35-54, March.
Buhlmann P. & Yu B., 2003. "Boosting With the L2 Loss: Regression and Classification," Journal of the American Statistical Association, American Statistical Association, vol. 98, pages 324-339, January.
Tsao, C. Andy & Chang, Yuan-chin Ivan, 2007. "A stochastic approximation view of boosting," Computational Statistics & Data Analysis, Elsevier, vol. 52(1), pages 325-334, September.
Clifford M. Hurvich & Jeffrey S. Simonoff & Chih‐Ling Tsai, 1998. "Smoothing parameter selection in nonparametric regression using an improved Akaike information criterion," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 60(2), pages 271-293.

Full references (including those not matched with items on IDEAS)

Citations

Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.

Cited by:

Jing Zeng, 2014. "Forecasting Aggregates with Disaggregate Variables: Does Boosting Help to Select the Most Relevant Predictors?," Working Paper Series of the Department of Economics, University of Konstanz 2014-20, Department of Economics, University of Konstanz.

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Christian Pierdzioch & Rangan Gupta & Hossein Hassani & Emmanuel Silva, 2018. "Forecasting Changes of Economic Inequality: A Boosting Approach," Working Papers 201868, University of Pretoria, Department of Economics.
Leitenstorfer, Florian & Tutz, Gerhard, 2007. "Knot selection by boosting techniques," Computational Statistics & Data Analysis, Elsevier, vol. 51(9), pages 4605-4621, May.
Klaus Wohlrabe & Teresa Buchen, 2014. "Assessing the Macroeconomic Forecasting Performance of Boosting: Evidence for the United States, the Euro Area and Germany," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 33(4), pages 231-242, July.
- Teresa Buchen & Klaus Wohlrabe, 2013. "Assessing the Macroeconomic Forecasting Performance of Boosting - Evidence for the United States, the Euro Area, and Germany," CESifo Working Paper Series 4148, CESifo.
- Teresa, Buchen & Wohlrabe, Klaus, 2014. "Assessing the Macroeconomic Forecasting Performance of Boosting: Evidence for the United States, the Euro Area, and Germany," VfS Annual Conference 2014 (Hamburg): Evidence-based Economic Policy 100626, Verein für Socialpolitik / German Economic Association.
Tutz, Gerhard & Leitenstorfer, Florian, 2006. "Response shrinkage estimators in binary regression," Computational Statistics & Data Analysis, Elsevier, vol. 50(10), pages 2878-2901, June.
Ng, Serena, 2013. "Variable Selection in Predictive Regressions," Handbook of Economic Forecasting, in: G. Elliott & C. Granger & A. Timmermann (ed.), Handbook of Economic Forecasting, edition 1, volume 2, chapter 0, pages 752-789, Elsevier.
Schmid, Matthias & Hothorn, Torsten, 2008. "Boosting additive models using component-wise P-Splines," Computational Statistics & Data Analysis, Elsevier, vol. 53(2), pages 298-311, December.
Ching-Kang Ing, 2005. "Accumulated Prediction Errors, Information Criteria And Optimal Forecasting For Autoregressive Time Series," Econometrics 0503020, University Library of Munich, Germany.
Jing Zeng, 2014. "Forecasting Aggregates with Disaggregate Variables: Does Boosting Help to Select the Most Relevant Predictors?," Working Paper Series of the Department of Economics, University of Konstanz 2014-20, Department of Economics, University of Konstanz.
Daye, Z. John & Jeng, X. Jessie, 2009. "Shrinkage and model selection with correlated variables via weighted fusion," Computational Statistics & Data Analysis, Elsevier, vol. 53(4), pages 1284-1298, February.
Tutz, Gerhard & Pößnecker, Wolfgang & Uhlmann, Lorenz, 2015. "Variable selection in general multinomial logit models," Computational Statistics & Data Analysis, Elsevier, vol. 82(C), pages 207-222.
Gerhard Tutz & Moritz Berger, 2018. "Tree-structured modelling of categorical predictors in generalized additive regression," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 12(3), pages 737-758, September.
Hans R. A. Koster & Jos N. van Ommeren & Piet Rietveld, 2016. "Historic amenities, income and sorting of households," Journal of Economic Geography, Oxford University Press, vol. 16(1), pages 203-236.
- Koster, Hans R. A. & Rietveld, Piet & Van Ommeren, Jos, 2013. "Historic amenities, income and sorting of households," LSE Research Online Documents on Economics 58433, London School of Economics and Political Science, LSE Library.
- Hans R. A. Koster & Piet Rietveld & Jos Van Ommeren, 2013. "Historic Amenities, Income and Sorting of Households," SERC Discussion Papers 0124, Centre for Economic Performance, LSE.
Bethany Everett & David Rehkopf & Richard Rogers, 2013. "The Nonlinear Relationship Between Education and Mortality: An Examination of Cohort, Race/Ethnic, and Gender Differences," Population Research and Policy Review, Springer;Southern Demographic Association (SDA), vol. 32(6), pages 893-917, December.
Shuichi Kawano, 2014. "Selection of tuning parameters in bridge regression models via Bayesian information criterion," Statistical Papers, Springer, vol. 55(4), pages 1207-1223, November.
Tsimpanos, Apostolos & Tsimbos, Cleon & Kalogirou, Stamatis, 2018. "Assessing spatial variation and heterogeneity of fertility in Greece at local authority level," MPRA Paper 100406, University Library of Munich, Germany.
Mittnik, Stefan & Robinzonov, Nikolay & Spindler, Martin, 2015. "Stock market volatility: Identifying major drivers and the nature of their impact," Journal of Banking & Finance, Elsevier, vol. 58(C), pages 1-14.
Don Harding, 2010. "Applying shape and phase restrictions in generalized dynamic categorical models of the business cycle," NCER Working Paper Series 58, National Centre for Econometric Research.
- Don Harding, 2010. "Applying shape and phase restrictions in generalized dynamic categorical models of the business cycle," Working Papers 2010.05, School of Economics, La Trobe University.
- Don Harding, 2010. "Applying Shape and Phase Restrictions in Generalized Dynamic Categorical Models of the Business Cycle," CAMA Working Papers 2010-25, Centre for Applied Macroeconomic Analysis, Crawford School of Public Policy, The Australian National University.
- Don Harding, 2010. "Applying shape and phase restrictions in generalized dynamic categorical models of the business cycle," Working Papers 2010.05, School of Economics, La Trobe University.
Michael S. Delgado & Daniel J. Henderson & Christopher F. Parmeter, 2014. "Does Education Matter for Economic Growth?," Oxford Bulletin of Economics and Statistics, Department of Economics, University of Oxford, vol. 76(3), pages 334-359, June.
- Michael S. Delgado & Daniel J. Henderson & Christopher F. Parmeter, 2011. "Does Education Matter for Economic Growth?," Working Papers 2011-13, University of Miami, Department of Economics.
- Delgado, Michael S. & Henderson, Daniel J. & Parmeter, Christopher F., 2012. "Does Education Matter for Economic Growth?," IZA Discussion Papers 7089, Institute of Labor Economics (IZA).
Yana Melnykov & Marcus Perry, 2024. "On Robust Change Point Detection and Estimation in Multisubject Studies," Sankhya A: The Indian Journal of Statistics, Springer;Indian Statistical Institute, vol. 86(2), pages 827-879, August.
Seongkyoon Jeong & Jae Young Choi, 2012. "The taxonomy of research collaboration in science and technology: evidence from mechanical research through probabilistic clustering analysis," Scientometrics, Springer;Akadémiai Kiadó, vol. 91(3), pages 719-735, June.

More about this item

Keywords

AICc BIC gMDL Change point detection method L2Boosting LogitBoost Stopping rule;

JEL classification:

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:csdana:v:54:y:2010:i:10:p:2203-2213. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/csda .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Early stopping in L2Boosting

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Citations

Most related items

More about this item

Keywords

JEL classification:

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data