IDEAS home Printed from https://ideas.repec.org/a/spr/compst/v33y2018i2d10.1007_s00180-018-0797-8.html
   My bibliography  Save this article

Extending AIC to best subset regression

Author

Listed:
  • J. G. Liao

    (Penn State University)

  • Joseph E. Cavanaugh

    (University of Iowa)

  • Timothy L. McMurry

    (University of Virginia)

Abstract

The Akaike information criterion (AIC) is routinely used for model selection in best subset regression. The standard AIC, however, generally under-penalizes model complexity in the best subset regression setting, potentially leading to grossly overfit models. Recently, Zhang and Cavanaugh (Comput Stat 31(2):643–669, 2015) made significant progress towards addressing this problem by introducing an effective multistage model selection procedure. In this paper, we present a rigorous and coherent conceptual framework for extending AIC to best subset regression. A new model selection algorithm derived from our framework possesses well understood and desirable asymptotic properties and consistently outperforms the procedure of Zhang and Cavanaugh in simulation studies. It provides an effective tool for combating the pervasive overfitting that detrimentally impacts best subset regression analysis so that the selected models contain fewer irrelevant predictors and predict future observations more accurately.

Suggested Citation

  • J. G. Liao & Joseph E. Cavanaugh & Timothy L. McMurry, 2018. "Extending AIC to best subset regression," Computational Statistics, Springer, vol. 33(2), pages 787-806, June.
  • Handle: RePEc:spr:compst:v:33:y:2018:i:2:d:10.1007_s00180-018-0797-8
    DOI: 10.1007/s00180-018-0797-8
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s00180-018-0797-8
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s00180-018-0797-8?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Liao J.G. & McGee D., 2003. "Adjusted Coefficients of Determination for Logistic Regression," The American Statistician, American Statistical Association, vol. 57, pages 161-165, August.
    2. White, Halbert, 1982. "Maximum Likelihood Estimation of Misspecified Models," Econometrica, Econometric Society, vol. 50(1), pages 1-25, January.
    3. Bengtsson, Thomas & Cavanaugh, Joseph E., 2006. "An improved Akaike information criterion for state-space model selection," Computational Statistics & Data Analysis, Elsevier, vol. 50(10), pages 2635-2654, June.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Pereira Teodoro da Silva, Kairo & Kalbusch, Andreza & Henning, Elisa, 2023. "Detection of unauthorized consumption in water supply systems: A case study using logistic regression," Utilities Policy, Elsevier, vol. 84(C).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. P.A.V.B. Swamy & I-Lok Chang & Jatinder S. Mehta & William H. Greene & Stephen G. Hall & George S. Tavlas, 2016. "Removing Specification Errors from the Usual Formulation of Binary Choice Models," Econometrics, MDPI, vol. 4(2), pages 1-21, June.
    2. Carlo Altavilla & Raffaella Giacomini & Giuseppe Ragusa, 2017. "Anchoring the yield curve using survey expectations," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 32(6), pages 1055-1068, September.
    3. Fernando Rios-Avila & Gustavo Canavire-Bacarreza, 2018. "Standard-error correction in two-stage optimization models: A quasi–maximum likelihood estimation approach," Stata Journal, StataCorp LP, vol. 18(1), pages 206-222, March.
    4. Sandy Fréret & Denis Maguain, 2017. "The effects of agglomeration on tax competition: evidence from a two-regime spatial panel model on French data," International Tax and Public Finance, Springer;International Institute of Public Finance, vol. 24(6), pages 1100-1140, December.
    5. Ai, Chunrong & Chen, Xiaohong, 2007. "Estimation of possibly misspecified semiparametric conditional moment restriction models with different conditioning variables," Journal of Econometrics, Elsevier, vol. 141(1), pages 5-43, November.
    6. Gregory, Allan W. & McCurdy, Thomas H., 1986. "The unbiasedness hypothesis in the forward foreign exchange market: A specification analysis with application to France, Italy, Japan, the United Kingdom and West Germany," European Economic Review, Elsevier, vol. 30(2), pages 365-381, April.
    7. B. Praag & T. Dijkstra & J. Velzen, 1985. "Least-squares theory based on general distributional assumptions with an application to the incomplete observations problem," Psychometrika, Springer;The Psychometric Society, vol. 50(1), pages 25-36, March.
    8. Reischmann, Markus, 2016. "Creative accounting and electoral motives: Evidence from OECD countries," Journal of Comparative Economics, Elsevier, vol. 44(2), pages 243-257.
    9. Czudaj Robert L., 2020. "The role of uncertainty on agricultural futures markets momentum trading and volatility," Studies in Nonlinear Dynamics & Econometrics, De Gruyter, vol. 24(3), pages 1-39, June.
    10. Vassilios Babalos & Mehmet Balcilar & Rangan Gupta, 2014. "Revisiting Herding Behavior in REITs: A Regime-Switching Approach," Working Papers 201448, University of Pretoria, Department of Economics.
    11. Topi Miettinen & Sigrid Suetens, 2008. "Communication and Guilt in a Prisoner's Dilemma," Journal of Conflict Resolution, Peace Science Society (International), vol. 52(6), pages 945-960, December.
    12. Towfiqul Islam Khan & Mashfique Ibne Akbar, 2015. "Illicit Financial Flow in view of Financing the Post-2015 Development Agenda," Southern Voice Occasional Paper 25, Southern Voice.
    13. Catherine Doz & Domenico Giannone & Lucrezia Reichlin, 2012. "A Quasi–Maximum Likelihood Approach for Large, Approximate Dynamic Factor Models," The Review of Economics and Statistics, MIT Press, vol. 94(4), pages 1014-1024, November.
    14. Potrafke, Niklas, 2019. "Electoral cycles in perceived corruption: International empirical evidence," Journal of Comparative Economics, Elsevier, vol. 47(1), pages 215-224.
    15. Corradi, Valentina & Swanson, Norman R., 2004. "A test for the distributional comparison of simulated and historical data," Economics Letters, Elsevier, vol. 85(2), pages 185-193, November.
    16. Hendrik Thiel & Stephan L. Thomsen, 2015. "Individual Poverty Paths and the Stability of Control-Perception," SOEPpapers on Multidisciplinary Panel Data Research 794, DIW Berlin, The German Socio-Economic Panel (SOEP).
    17. Arzheimer, Kai & Evans, Jocelyn, 2010. "Bread and butter à la française: Multiparty forecasts of the French legislative vote (1981-2007)," International Journal of Forecasting, Elsevier, vol. 26(1), pages 19-31, January.
    18. repec:ebl:ecbull:v:3:y:2008:i:5:p:1-7 is not listed on IDEAS
    19. Tobias Hartl & Roland Jucknewitz, 2022. "Approximate state space modelling of unobserved fractional components," Econometric Reviews, Taylor & Francis Journals, vol. 41(1), pages 75-98, January.
    20. Hao Wu & Michael Browne, 2015. "Random Model Discrepancy: Interpretations and Technicalities (A Rejoinder)," Psychometrika, Springer;The Psychometric Society, vol. 80(3), pages 619-624, September.
    21. Tue Gørgens & Allan Würtz, 2012. "Testing a parametric function against a non‐parametric alternative in IV and GMM settings," Econometrics Journal, Royal Economic Society, vol. 15(3), pages 462-489, October.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:compst:v:33:y:2018:i:2:d:10.1007_s00180-018-0797-8. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.