IDEAS home Printed from https://ideas.repec.org/p/iza/izadps/dp12584.html
   My bibliography  Save this paper

Inference with Arbitrary Clustering

Author

Listed:
  • Colella, Fabrizio

    (University of Svizzera Italiana)

  • Lalive, Rafael

    (University of Lausanne)

  • Sakalli, Seyhun Orcan

    (Université de Lausanne)

  • Thoenig, Mathias

    (University of Lausanne)

Abstract

Analyses of spatial or network data are now very common. Nevertheless, statistical inference is challenging since unobserved heterogeneity can be correlated across neighboring observational units. We develop an estimator for the variance-covariance matrix (VCV) of OLS and 2SLS that allows for arbitrary dependence of the errors across observations in space or network structure and across time periods. As a proof of concept, we conduct Monte Carlo simulations in a geospatial setting based on U.S. metropolitan areas. Tests based on our estimator of the VCV asymptotically correctly reject the null hypothesis, whereas conventional inference methods, e.g., those without clusters or with clusters based on administrative units, reject the null hypothesis too often. We also provide simulations in a network setting based on the IDEAS structure of coauthorship and real-life data on scientific performance. The Monte Carlo results again show that our estimator yields inference at the correct significance level even in moderately sized samples and that it dominates other commonly used approaches to inference in networks. We provide guidance to the applied researcher with respect to (i) whether or not to include potentially correlated regressors and (ii) the choice of cluster bandwidth. Finally, we provide a companion statistical package (acreg) enabling users to adjust the OLS and 2SLS coefficient's standard errors to account for arbitrary dependence.

Suggested Citation

  • Colella, Fabrizio & Lalive, Rafael & Sakalli, Seyhun Orcan & Thoenig, Mathias, 2019. "Inference with Arbitrary Clustering," IZA Discussion Papers 12584, Institute of Labor Economics (IZA).
  • Handle: RePEc:iza:izadps:dp12584
    as

    Download full text from publisher

    File URL: https://docs.iza.org/dp12584.pdf
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Cameron, A. Colin & Gelbach, Jonah B. & Miller, Douglas L., 2011. "Robust Inference With Multiway Clustering," Journal of Business & Economic Statistics, American Statistical Association, vol. 29(2), pages 238-249.
    2. A. Colin Cameron & Douglas L. Miller, 2015. "A Practitioner’s Guide to Cluster-Robust Inference," Journal of Human Resources, University of Wisconsin Press, vol. 50(2), pages 317-372.
    3. Stelios Michalopoulos & Elias Papaioannou, 2018. "Spatial Patterns of Development: A Meso Approach," Annual Review of Economics, Annual Reviews, vol. 10(1), pages 383-410, August.
    4. White, Halbert, 1980. "A Heteroskedasticity-Consistent Covariance Matrix Estimator and a Direct Test for Heteroskedasticity," Econometrica, Econometric Society, vol. 48(4), pages 817-838, May.
    5. Marianne Bertrand & Esther Duflo & Sendhil Mullainathan, 2004. "How Much Should We Trust Differences-In-Differences Estimates?," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 119(1), pages 249-275.
    6. Kelly, Morgan, 2019. "The Standard Errors of Persistence," CEPR Discussion Papers 13783, C.E.P.R. Discussion Papers.
    7. Kelejian, Harry H & Prucha, Ingmar R, 1998. "A Generalized Spatial Two-Stage Least Squares Procedure for Estimating a Spatial Autoregressive Model with Autoregressive Disturbances," The Journal of Real Estate Finance and Economics, Springer, vol. 17(1), pages 99-121, July.
    8. Kelejian, Harry H & Prucha, Ingmar R, 1999. "A Generalized Moments Estimator for the Autoregressive Parameter in a Spatial Model," International Economic Review, Department of Economics, University of Pennsylvania and Osaka University Institute of Social and Economic Research Association, vol. 40(2), pages 509-533, May.
    9. Morgan Kelly, 2019. "The Standard Errors of Persistence," Working Papers 201913, School of Economics, University College Dublin.
    10. Conley, T. G., 1999. "GMM estimation with cross sectional dependence," Journal of Econometrics, Elsevier, vol. 92(1), pages 1-45, September.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. James G. MacKinnon & Matthew D. Webb, 2020. "When and How to Deal with Clustered Errors in Regression Models," Working Paper 1421, Economics Department, Queen's University.
    2. Bruno Ferman, 2023. "Inference in difference‐in‐differences: How much should we trust in independent clusters?," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 38(3), pages 358-369, April.
    3. Tiago Sequeira & Hugo Morão, 2020. "Growth accounting and regressions: New approach and results," International Economics, CEPII research center, issue 162, pages 67-79.
    4. Hansen, Bruce E. & Lee, Seojeong, 2019. "Asymptotic theory for clustered samples," Journal of Econometrics, Elsevier, vol. 210(2), pages 268-290.
    5. Kim, Min Seong & Sun, Yixiao, 2013. "Heteroskedasticity and spatiotemporal dependence robust inference for linear panel models with fixed effects," Journal of Econometrics, Elsevier, vol. 177(1), pages 85-108.
    6. Gibbons, Steve & Overman, Henry G. & Patacchini, Eleonora, 2015. "Spatial Methods," Handbook of Regional and Urban Economics, in: Gilles Duranton & J. V. Henderson & William C. Strange (ed.), Handbook of Regional and Urban Economics, edition 1, volume 5, chapter 0, pages 115-168, Elsevier.
    7. Rodolfo Metulini & Paolo Sgrignoli & Stefano Schiavo & Massimo Riccaboni, 2018. "The network of migrants and international trade," Economia Politica: Journal of Analytical and Institutional Economics, Springer;Fondazione Edison, vol. 35(3), pages 763-787, December.
    8. MacKinnon, James G. & Nielsen, Morten Ørregaard & Webb, Matthew D., 2023. "Cluster-robust inference: A guide to empirical practice," Journal of Econometrics, Elsevier, vol. 232(2), pages 272-299.
    9. Christian Helmers & Manasa Patnam, 2014. "Does the rotten child spoil his companion? Spatial peer effects among children in rural India," Quantitative Economics, Econometric Society, vol. 5, pages 67-121, March.
    10. Jeffrey D. Michler & Anna Josephson, 2022. "Recent developments in inference: practicalities for applied economics," Chapters, in: A Modern Guide to Food Economics, chapter 11, pages 235-268, Edward Elgar Publishing.
    11. A. Colin Cameron & Douglas L. Miller, 2010. "Robust Inference with Clustered Data," Working Papers 318, University of California, Davis, Department of Economics.
    12. MacKinnon, James G. & Nielsen, Morten Ørregaard & Webb, Matthew D., 2023. "Testing for the appropriate level of clustering in linear regression models," Journal of Econometrics, Elsevier, vol. 235(2), pages 2027-2056.
    13. Remi Jedwab & Felix Meier zu Selhausen & Alexander Moradi, 2022. "The economics of missionary expansion: evidence from Africa and implications for development," Journal of Economic Growth, Springer, vol. 27(2), pages 149-192, June.
    14. A. Colin Cameron & Douglas L. Miller, 2010. "Robust Inference with Clustered Data," Working Papers 106, University of California, Davis, Department of Economics.
    15. Paik, Christopher & Shahi, Keshar, 2023. "Ancient nomadic corridors and long-run development in the highlands of Asia," Explorations in Economic History, Elsevier, vol. 89(C).
    16. Moscone, F. & Tosetti, Elisa, 2015. "Robust estimation under error cross section dependence," Economics Letters, Elsevier, vol. 133(C), pages 100-104.
    17. Christian Helmers & Manasa Patnam, 2014. "Does the rotten child spoil his companion? Spatial peer effects among children in rural India," Quantitative Economics, Econometric Society, vol. 5, pages 67-121, 03.

    More about this item

    Keywords

    spatial correlation; cluster; network data; clustering; arbitrary; geospatial data; instrumental variables;
    All these keywords.

    JEL classification:

    • C13 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods and Methodology: General - - - Estimation: General
    • C23 - Mathematical and Quantitative Methods - - Single Equation Models; Single Variables - - - Models with Panel Data; Spatio-temporal Models
    • C26 - Mathematical and Quantitative Methods - - Single Equation Models; Single Variables - - - Instrumental Variables (IV) Estimation

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:iza:izadps:dp12584. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Holger Hinte (email available below). General contact details of provider: https://edirc.repec.org/data/izaaade.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.