IDEAS home Printed from https://ideas.repec.org/a/spr/orspec/v43y2021i3d10.1007_s00291-021-00627-y.html
   My bibliography  Save this article

Leveraged least trimmed absolute deviations

Author

Listed:
  • Nathan Sudermann-Merx

    (BASF SE)

  • Steffen Rebennack

    (Karlsruhe Institute of Technology (KIT))

Abstract

The design of regression models that are not affected by outliers is an important task which has been subject of numerous papers within the statistics community for the last decades. Prominent examples of robust regression models are least trimmed squares (LTS), where the k largest squared deviations are ignored, and least trimmed absolute deviations (LTA) which ignores the k largest absolute deviations. The numerical complexity of both models is driven by the number of binary variables and by the value k of ignored deviations. We introduce leveraged least trimmed absolute deviations (LLTA) which exploits that LTA is already immune against y-outliers. Therefore, LLTA has only to be guarded against outlying values in x, so-called leverage points, which can be computed beforehand, in contrast to y-outliers. Thus, while the mixed-integer formulations of LTS and LTA have as many binary variables as data points, LLTA only needs one binary variable per leverage point, resulting in a significant reduction of binary variables. Based on 11 data sets from the literature, we demonstrate that (1) LLTA’s prediction quality improves much faster than LTS and as fast as LTA for increasing values of k and (2) that LLTA solves the benchmark problems about 80 times faster than LTS and about five times faster than LTA, in median.

Suggested Citation

  • Nathan Sudermann-Merx & Steffen Rebennack, 2021. "Leveraged least trimmed absolute deviations," OR Spectrum: Quantitative Approaches in Management, Springer;Gesellschaft für Operations Research e.V., vol. 43(3), pages 809-834, September.
  • Handle: RePEc:spr:orspec:v:43:y:2021:i:3:d:10.1007_s00291-021-00627-y
    DOI: 10.1007/s00291-021-00627-y
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s00291-021-00627-y
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s00291-021-00627-y?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Steffen Rebennack & Vitaliy Krasko, 2020. "Piecewise Linear Function Fitting via Mixed-Integer Linear Programming," INFORMS Journal on Computing, INFORMS, vol. 32(2), pages 507-530, April.
    2. Steffen Rebennack & Josef Kallrath, 2015. "Continuous Piecewise Linear Delta-Approximations for Univariate Functions: Computing Minimal Breakpoint Systems," Journal of Optimization Theory and Applications, Springer, vol. 167(2), pages 617-643, November.
    3. Hawkins, Douglas M. & Olive, David, 1999. "Applications and algorithms for least trimmed sum of absolute deviations regression," Computational Statistics & Data Analysis, Elsevier, vol. 32(2), pages 119-134, December.
    4. Steffen Rebennack & Josef Kallrath, 2015. "Continuous Piecewise Linear Delta-Approximations for Bivariate and Multivariate Functions," Journal of Optimization Theory and Applications, Springer, vol. 167(1), pages 102-117, October.
    5. Lembke B., 1918. "√ a. p," Journal of Economics and Statistics (Jahrbuecher fuer Nationaloekonomie und Statistik), De Gruyter, vol. 111(1), pages 709-712, February.
    6. Roger Koenker & Kevin F. Hallock, 2001. "Quantile Regression," Journal of Economic Perspectives, American Economic Association, vol. 15(4), pages 143-156, Fall.
    7. Tableman, Mara, 1994. "The asymptotics of the least trimmed absolute deviations (LTAD) estimator," Statistics & Probability Letters, Elsevier, vol. 19(5), pages 387-398, April.
    8. C. Chatzinakos & L. Pitsoulis & G. Zioutas, 2016. "Optimization techniques for robust multivariate location and scatter estimation," Journal of Combinatorial Optimization, Springer, vol. 31(4), pages 1443-1460, May.
    9. Dodge, Yadolah, 1997. "LAD Regression for Detecting Outliers in Response and Explanatory Variables," Journal of Multivariate Analysis, Elsevier, vol. 61(1), pages 144-158, April.
    10. Bernholt, Thorsten, 2006. "Robust Estimators are Hard to Compute," Technical Reports 2005,52, Technische Universität Dortmund, Sonderforschungsbereich 475: Komplexitätsreduktion in multivariaten Datenstrukturen.
    11. Noam Goldberg & Steffen Rebennack & Youngdae Kim & Vitaliy Krasko & Sven Leyffer, 2021. "MINLP formulations for continuous piecewise linear function fitting," Computational Optimization and Applications, Springer, vol. 79(1), pages 223-233, May.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Barbato, Michele & Ceselli, Alberto, 2024. "Mathematical programming for simultaneous feature selection and outlier detection under l1 norm," European Journal of Operational Research, Elsevier, vol. 316(3), pages 1070-1084.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Steffen Rebennack, 2022. "Data-driven stochastic optimization for distributional ambiguity with integrated confidence region," Journal of Global Optimization, Springer, vol. 84(2), pages 255-293, October.
    2. Aloïs Duguet & Christian Artigues & Laurent Houssin & Sandra Ulrich Ngueveu, 2022. "Properties, Extensions and Application of Piecewise Linearization for Euclidean Norm Optimization in $$\mathbb {R}^2$$ R 2," Journal of Optimization Theory and Applications, Springer, vol. 195(2), pages 418-448, November.
    3. Čížek, Pavel, 2008. "General Trimmed Estimation: Robust Approach To Nonlinear And Limited Dependent Variable Models," Econometric Theory, Cambridge University Press, vol. 24(6), pages 1500-1529, December.
    4. Andreas Bärmann & Robert Burlacu & Lukas Hager & Thomas Kleinert, 2023. "On piecewise linear approximations of bilinear terms: structural comparison of univariate and bivariate mixed-integer programming formulations," Journal of Global Optimization, Springer, vol. 85(4), pages 789-819, April.
    5. Olive, David J. & Hawkins, Douglas M., 2003. "Robust regression with high coverage," Statistics & Probability Letters, Elsevier, vol. 63(3), pages 259-266, July.
    6. Neykov, N.M. & Čížek, P. & Filzmoser, P. & Neytchev, P.N., 2012. "The least trimmed quantile regression," Computational Statistics & Data Analysis, Elsevier, vol. 56(6), pages 1757-1770.
    7. C. Chatzinakos & L. Pitsoulis & G. Zioutas, 2016. "Optimization techniques for robust multivariate location and scatter estimation," Journal of Combinatorial Optimization, Springer, vol. 31(4), pages 1443-1460, May.
    8. Cizek, P., 2007. "General Trimmed Estimation : Robust Approach to Nonlinear and Limited Dependent Variable Models (Replaces DP 2007-1)," Other publications TiSEM eeccf622-dd18-41d4-a2f9-b, Tilburg University, School of Economics and Management.
    9. Kazda, Kody & Li, Xiang, 2024. "A linear programming approach to difference-of-convex piecewise linear approximation," European Journal of Operational Research, Elsevier, vol. 312(2), pages 493-511.
    10. Noam Goldberg & Steffen Rebennack & Youngdae Kim & Vitaliy Krasko & Sven Leyffer, 2021. "MINLP formulations for continuous piecewise linear function fitting," Computational Optimization and Applications, Springer, vol. 79(1), pages 223-233, May.
    11. Barbato, Michele & Ceselli, Alberto, 2024. "Mathematical programming for simultaneous feature selection and outlier detection under l1 norm," European Journal of Operational Research, Elsevier, vol. 316(3), pages 1070-1084.
    12. Steffen Rebennack & Vitaliy Krasko, 2020. "Piecewise Linear Function Fitting via Mixed-Integer Linear Programming," INFORMS Journal on Computing, INFORMS, vol. 32(2), pages 507-530, April.
    13. Aakil M. Caunhye & Douglas Alem, 2023. "Practicable robust stochastic optimization under divergence measures with an application to equitable humanitarian response planning," OR Spectrum: Quantitative Approaches in Management, Springer;Gesellschaft für Operations Research e.V., vol. 45(3), pages 759-806, September.
    14. John Alasdair Warwicker & Steffen Rebennack, 2022. "A Comparison of Two Mixed-Integer Linear Programs for Piecewise Linear Function Fitting," INFORMS Journal on Computing, INFORMS, vol. 34(2), pages 1042-1047, March.
    15. Shao, Yu & Zhou, Xinhong & Yu, Tingchao & Zhang, Tuqiao & Chu, Shipeng, 2024. "Pump scheduling optimization in water distribution system based on mixed integer linear programming," European Journal of Operational Research, Elsevier, vol. 313(3), pages 1140-1151.
    16. Akosah, Nana Kwame & Alagidede, Imhotep Paul & Schaling, Eric, 2020. "Testing for asymmetry in monetary policy rule for small-open developing economies: Multiscale Bayesian quantile evidence from Ghana," The Journal of Economic Asymmetries, Elsevier, vol. 22(C).
    17. Paul Hewson & Keming Yu, 2008. "Quantile regression for binary performance indicators," Applied Stochastic Models in Business and Industry, John Wiley & Sons, vol. 24(5), pages 401-418, September.
    18. Sergei Rogosin & Maryna Dubatovskaya, 2017. "Letnikov vs. Marchaud: A Survey on Two Prominent Constructions of Fractional Derivatives," Mathematics, MDPI, vol. 6(1), pages 1-15, December.
    19. , Aisdl, 2019. "What Citizenship for What Transition?: Contradictions, Ambivalence, and Promises in Post-Socialist Citizenship Education in Vietnam," OSF Preprints jyqp5, Center for Open Science.
    20. Héctor Manuel Zárate S., 2005. "Cambios en la estructura salarial: una historia desde la regresión cuanfílica," Monetaria, CEMLA, vol. 0(4), pages 339-364, octubre-d.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:orspec:v:43:y:2021:i:3:d:10.1007_s00291-021-00627-y. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.