IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2408.03930.html
   My bibliography  Save this paper

Robust Estimation of Regression Models with Potentially Endogenous Outliers via a Modern Optimization Lens

Author

Listed:
  • Zhan Gao
  • Hyungsik Roger Moon

Abstract

This paper addresses the robust estimation of linear regression models in the presence of potentially endogenous outliers. Through Monte Carlo simulations, we demonstrate that existing $L_1$-regularized estimation methods, including the Huber estimator and the least absolute deviation (LAD) estimator, exhibit significant bias when outliers are endogenous. Motivated by this finding, we investigate $L_0$-regularized estimation methods. We propose systematic heuristic algorithms, notably an iterative hard-thresholding algorithm and a local combinatorial search refinement, to solve the combinatorial optimization problem of the \(L_0\)-regularized estimation efficiently. Our Monte Carlo simulations yield two key results: (i) The local combinatorial search algorithm substantially improves solution quality compared to the initial projection-based hard-thresholding algorithm while offering greater computational efficiency than directly solving the mixed integer optimization problem. (ii) The $L_0$-regularized estimator demonstrates superior performance in terms of bias reduction, estimation accuracy, and out-of-sample prediction errors compared to $L_1$-regularized alternatives. We illustrate the practical value of our method through an empirical application to stock return forecasting.

Suggested Citation

  • Zhan Gao & Hyungsik Roger Moon, 2024. "Robust Estimation of Regression Models with Potentially Endogenous Outliers via a Modern Optimization Lens," Papers 2408.03930, arXiv.org.
  • Handle: RePEc:arx:papers:2408.03930
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2408.03930
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Ivo Welch & Amit Goyal, 2008. "A Comprehensive Look at The Empirical Performance of Equity Premium Prediction," The Review of Financial Studies, Society for Financial Studies, vol. 21(4), pages 1455-1508, July.
    2. Koo, Bonsoo & Anderson, Heather M. & Seo, Myung Hwan & Yao, Wenying, 2020. "High-dimensional predictive regression in the presence of cointegration," Journal of Econometrics, Elsevier, vol. 219(2), pages 456-477.
    3. She, Yiyuan & Owen, Art B., 2011. "Outlier Detection Using Nonconvex Penalized Regression," Journal of the American Statistical Association, American Statistical Association, vol. 106(494), pages 626-639.
    4. Ziwei Mei & Zhentao Shi, 2022. "On LASSO for High Dimensional Predictive Regression," Papers 2212.07052, arXiv.org, revised Jan 2024.
    5. Alexandros Kostakis & Tassos Magdalinos & Michalis P. Stamatogiannis, 2015. "Robust Econometric Inference for Stock Return Predictability," The Review of Financial Studies, Society for Financial Studies, vol. 28(5), pages 1506-1553.
    6. Thompson, Ryan, 2022. "Robust subset selection," Computational Statistics & Data Analysis, Elsevier, vol. 169(C).
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Tu, Yundong & Xie, Xinling, 2023. "Penetrating sporadic return predictability," Journal of Econometrics, Elsevier, vol. 237(1).
    2. Lee, Ji Hyung & Shi, Zhentao & Gao, Zhan, 2022. "On LASSO for predictive regression," Journal of Econometrics, Elsevier, vol. 229(2), pages 322-349.
    3. Zhou, Weilun & Gao, Jiti & Harris, David & Kew, Hsein, 2024. "Semi-parametric single-index predictive regression models with cointegrated regressors," Journal of Econometrics, Elsevier, vol. 238(1).
    4. Chaohua Dong & Jiti Gao & Bin Peng & Yundong Tu, 2023. "Robust M-Estimation for Additive Single-Index Cointegrating Time Series Models," Monash Econometrics and Business Statistics Working Papers 2/23, Monash University, Department of Econometrics and Business Statistics.
    5. Zhan Gao & Ji Hyung Lee & Ziwei Mei & Zhentao Shi, 2024. "Econometric Inference for High Dimensional Predictive Regressions," Papers 2409.10030, arXiv.org, revised Nov 2024.
    6. Zongwu Cai & Haiqiang Chen & Xiaosai Liao, 2020. "A New Robust Inference for Predictive Quantile Regression," WORKING PAPERS SERIES IN THEORETICAL AND APPLIED ECONOMICS 202002, University of Kansas, Department of Economics, revised Feb 2020.
    7. Dunbar, Kwamie, 2021. "Pricing the hedging factor in the cross-section of stock returns," The North American Journal of Economics and Finance, Elsevier, vol. 56(C).
    8. Bandi, Federico M. & Bretscher, Lorenzo & Tamoni, Andrea, 2023. "Return predictability with endogenous growth," Journal of Financial Economics, Elsevier, vol. 150(3).
    9. Geert Bekaert & Eric C. Engstrom & Nancy R. Xu, 2022. "The Time Variation in Risk Appetite and Uncertainty," Management Science, INFORMS, vol. 68(6), pages 3975-4004, June.
    10. Andersen, Torben G. & Varneskov, Rasmus T., 2021. "Consistent inference for predictive regressions in persistent economic systems," Journal of Econometrics, Elsevier, vol. 224(1), pages 215-244.
    11. Atanasov, Victoria, 2021. "Unemployment and aggregate stock returns," Journal of Banking & Finance, Elsevier, vol. 129(C).
    12. Lee, Ji Hyung, 2016. "Predictive quantile regression with persistent covariates: IVX-QR approach," Journal of Econometrics, Elsevier, vol. 192(1), pages 105-118.
    13. Narayan, Paresh Kumar & Liu, Ruipeng, 2018. "A new GARCH model with higher moments for stock return predictability," Journal of International Financial Markets, Institutions and Money, Elsevier, vol. 56(C), pages 93-103.
    14. Park, Dojoon & Hahn, Jaehoon & Eom, Young Ho, 2024. "Predicting the equity premium with financial ratios: A comprehensive look over a long period in Korea," Pacific-Basin Finance Journal, Elsevier, vol. 84(C).
    15. Demetrescu, Matei & Rodrigues, Paulo M.M. & Taylor, A.M. Robert, 2023. "Transformed regression-based long-horizon predictability tests," Journal of Econometrics, Elsevier, vol. 237(2).
    16. Demetrescu, Matei & Georgiev, Iliyan & Rodrigues, Paulo M.M. & Taylor, A.M. Robert, 2022. "Testing for episodic predictability in stock returns," Journal of Econometrics, Elsevier, vol. 227(1), pages 85-113.
    17. Yu, Deshui & Huang, Difang & Chen, Li & Li, Luyang, 2023. "Forecasting dividend growth: The role of adjusted earnings yield," Economic Modelling, Elsevier, vol. 120(C).
    18. Demetrescu, Matei & Rodrigues, Paulo M.M., 2022. "Residual-augmented IVX predictive regression," Journal of Econometrics, Elsevier, vol. 227(2), pages 429-460.
    19. Zhishui Hu & Ioannis Kasparis & Qiying Wang, 2020. "Locally trimmed least squares: conventional inference in possibly nonstationary models," Papers 2006.12595, arXiv.org.
    20. Chaohua Dong & Jiti Gao & Bin Peng & Yundong Tu, 2021. "Multiple-index Nonstationary Time Series Models: Robust Estimation Theory and Practice," Papers 2111.02023, arXiv.org.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2408.03930. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.