IDEAS home Printed from https://ideas.repec.org/a/gam/jijerp/v18y2021i8p4259-d537958.html
   My bibliography  Save this article

Regression with Highly Correlated Predictors: Variable Omission Is Not the Solution

Author

Listed:
  • Mariella Gregorich

    (Section for Clinical Biometrics, Center for Medical Statistics, Informatics and Intelligent Systems, Medical University of Vienna, 1090 Vienna, Austria)

  • Susanne Strohmaier

    (Section for Clinical Biometrics, Center for Medical Statistics, Informatics and Intelligent Systems, Medical University of Vienna, 1090 Vienna, Austria
    Center for Public Health, Department of Epidemiology, Medical University of Vienna, 1090 Vienna, Austria)

  • Daniela Dunkler

    (Section for Clinical Biometrics, Center for Medical Statistics, Informatics and Intelligent Systems, Medical University of Vienna, 1090 Vienna, Austria)

  • Georg Heinze

    (Section for Clinical Biometrics, Center for Medical Statistics, Informatics and Intelligent Systems, Medical University of Vienna, 1090 Vienna, Austria)

Abstract

Regression models have been in use for decades to explore and quantify the association between a dependent response and several independent variables in environmental sciences, epidemiology and public health. However, researchers often encounter situations in which some independent variables exhibit high bivariate correlation, or may even be collinear. Improper statistical handling of this situation will most certainly generate models of little or no practical use and misleading interpretations. By means of two example studies, we demonstrate how diagnostic tools for collinearity or near-collinearity may fail in guiding the analyst. Instead, the most appropriate way of handling collinearity should be driven by the research question at hand and, in particular, by the distinction between predictive or explanatory aims.

Suggested Citation

  • Mariella Gregorich & Susanne Strohmaier & Daniela Dunkler & Georg Heinze, 2021. "Regression with Highly Correlated Predictors: Variable Omission Is Not the Solution," IJERPH, MDPI, vol. 18(8), pages 1-12, April.
  • Handle: RePEc:gam:jijerp:v:18:y:2021:i:8:p:4259-:d:537958
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/1660-4601/18/8/4259/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/1660-4601/18/8/4259/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Andrews, Donald W K, 1991. "Heteroskedasticity and Autocorrelation Consistent Covariance Matrix Estimation," Econometrica, Econometric Society, vol. 59(3), pages 817-858, May.
    2. Zeileis, Achim, 2004. "Econometric Computing with HC and HAC Covariance Matrix Estimators," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 11(i10).
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Ball, Laurence & Carvalho, Carlos & Evans, Christopher & Antonio Ricci, Luca, 2024. "Weighted Median Inflation Around the World: A Measure of Core Inflation," Journal of International Money and Finance, Elsevier, vol. 142(C).
    2. Timo Dimitriadis & Xiaochun Liu & Julie Schnaitmann, 2020. "Encompassing Tests for Value at Risk and Expected Shortfall Multi-Step Forecasts based on Inference on the Boundary," Papers 2009.07341, arXiv.org.
    3. Malte Knüppel & Fabian Krüger, 2022. "Forecast uncertainty, disagreement, and the linear pool," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 37(1), pages 23-41, January.
    4. Zeileis, Achim, 2006. "Implementing a class of structural change tests: An econometric computing approach," Computational Statistics & Data Analysis, Elsevier, vol. 50(11), pages 2987-3008, July.
    5. Lupi, Claudio, 2009. "Unit Root CADF Testing with R," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 32(i02).
    6. Fruehwirt, Wolfgang & Hochfilzer, Leonhard & Weydemann, Leonard & Roberts, Stephen, 2021. "Cumulation, crash, coherency: A cryptocurrency bubble wavelet analysis," Finance Research Letters, Elsevier, vol. 40(C).
    7. Harry J. Paarsch & Alberto M. Segre & John P. Roberts & Jeffrey B. Halldorson, 2011. "Competition and Post-Transplant Outcomes in Cadaveric Liver Transplantation under the MELD Scoring System," Carlo Alberto Notebooks 213, Collegio Carlo Alberto.
    8. Ding, Peng, 2021. "The Frisch–Waugh–Lovell theorem for standard errors," Statistics & Probability Letters, Elsevier, vol. 168(C).
    9. Krüger, Jens J. & Hoss, Julian, 2012. "German business cycle forecasts, asymmetric loss and financial variables," Economics Letters, Elsevier, vol. 114(3), pages 284-287.
    10. Gavalas, Dimitris, 2015. "How do banks perform under Basel III? Tracing lending rates and loan quantity," Journal of Economics and Business, Elsevier, vol. 81(C), pages 21-37.
    11. Hartigan, Luke, 2018. "Alternative HAC covariance matrix estimators with improved finite sample properties," Computational Statistics & Data Analysis, Elsevier, vol. 119(C), pages 55-73.
    12. repec:jss:jstsof:32:i02 is not listed on IDEAS
    13. repec:jss:jstsof:16:i09 is not listed on IDEAS
    14. Kajal Lahiri & Liu Yang, 2018. "Confidence Bands for ROC Curves With Serially Dependent Data," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 36(1), pages 115-130, January.
    15. Zeileis, Achim, 2006. "Object-oriented Computation of Sandwich Estimators," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 16(i09).
    16. Lupi, Claudio, 2009. "Covariate Augmented Dickey-Fuller Tests with R," Economics & Statistics Discussion Papers esdp09051, University of Molise, Department of Economics.
    17. Agosto, Arianna & Cerchiello, Paola & Pagnottoni, Paolo, 2022. "Sentiment, Google queries and explosivity in the cryptocurrency market," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 605(C).
    18. Jochen Heberle & Cristina Sattarhoff, 2017. "A Fast Algorithm for the Computation of HAC Covariance Matrix Estimators," Econometrics, MDPI, vol. 5(1), pages 1-16, January.
    19. Ekvall, Karl Oskar, 2022. "Targeted principal components regression," Journal of Multivariate Analysis, Elsevier, vol. 190(C).
    20. Lucio Capitani & Leo Pasquazzi, 2015. "Inference for performance measures for financial assets," METRON, Springer;Sapienza Università di Roma, vol. 73(1), pages 73-98, April.
    21. Preinerstorfer, David, 2014. "Finite Sample Properties of Tests Based on Prewhitened Nonparametric Covariance Estimators," MPRA Paper 58333, University Library of Munich, Germany.
    22. Stephen Taylor & Ming Fang, 2018. "Unbiased weighted variance and skewness estimators for overlapping returns," Swiss Journal of Economics and Statistics, Springer;Swiss Society of Economics and Statistics, vol. 154(1), pages 1-8, December.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jijerp:v:18:y:2021:i:8:p:4259-:d:537958. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.