IDEAS home Printed from https://ideas.repec.org/p/cda/wpaper/318.html
   My bibliography  Save this paper

Robust Inference with Clustered Data

Author

Listed:
  • A. Colin Cameron
  • Douglas L. Miller

    (Department of Economics, University of California Davis)

Abstract

In this paper we survey methods to control for regression model error that is correlated within groups or clusters, but is uncorrelated across groups or clusters. Then failure to control for the clustering can lead to understatement of standard errors and overstatement of statistical significance, as emphasized most notably in empirical studies by Moulton (1990) and Bertrand, Duflo and Mullainathan (2004). We emphasize OLS estimation with statistical inference based on minimal assumptions regarding the error correlation process. Complications we consider include cluster-specific fixed effects, few clusters, multi-way clustering, more efficient feasible GLS estimation, and adaptation to nonlinear and instrumental variables estimators.

Suggested Citation

  • A. Colin Cameron & Douglas L. Miller, 2010. "Robust Inference with Clustered Data," Working Papers 318, University of California, Davis, Department of Economics.
  • Handle: RePEc:cda:wpaper:318
    as

    Download full text from publisher

    File URL: https://repec.dss.ucdavis.edu/files/T29sWVnXspMuzCnsfhRU5HGV/10-6.pdf
    Download Restriction: no
    ---><---

    Other versions of this item:

    References listed on IDEAS

    as
    1. Keith Finlay & Leandro M. Magnusson, 2009. "Implementing weak-instrument robust tests for a general class of instrumental-variables models," Stata Journal, StataCorp LP, vol. 9(3), pages 398-421, September.
    2. James H. Stock & Mark W. Watson, 2008. "Heteroskedasticity-Robust Standard Errors for Fixed Effects Panel Data Regression," Econometrica, Econometric Society, vol. 76(1), pages 155-174, January.
    3. Lara Shore-Sheppard, 1996. "The Precision of Instrumental Variables Estimates With Grouped Data," Working Papers 753, Princeton University, Department of Economics, Industrial Relations Section..
    4. A. Colin Cameron & Jonah B. Gelbach & Douglas L. Miller, 2008. "Bootstrap-Based Improvements for Inference with Clustered Errors," The Review of Economics and Statistics, MIT Press, vol. 90(3), pages 414-427, August.
    5. Pepper, John V., 2002. "Robust inferences from random clustered samples: an application using data from the panel study of income dynamics," Economics Letters, Elsevier, vol. 75(3), pages 341-345, May.
    6. A. Colin Cameron & Natalia Golotvina, 2005. "Estimation of Country-Pair Data Models Controlling for Clustered Errors: with International Trade Applications," Working Papers 182, University of California, Davis, Department of Economics.
    7. Hersch, Joni, 1998. "Compensating Differentials for Gender-Specific Job Injury Risks," American Economic Review, American Economic Association, vol. 88(3), pages 598-627, June.
    8. A. Colin Cameron & Jonah B. Gelbach & Douglas L. Miller, 2011. "Robust Inference With Multiway Clustering," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 29(2), pages 238-249, April.
    9. White, Halbert, 1980. "A Heteroskedasticity-Consistent Covariance Matrix Estimator and a Direct Test for Heteroskedasticity," Econometrica, Econometric Society, vol. 48(4), pages 817-838, May.
    10. Caroline Hoxby & M. Daniele Paserman, 1998. "Overidentification Tests with Grouped Data," NBER Technical Working Papers 0223, National Bureau of Economic Research, Inc.
    11. Chernozhukov, Victor & Hansen, Christian, 2008. "The reduced form: A simple approach to inference with weak instruments," Economics Letters, Elsevier, vol. 100(1), pages 68-71, July.
    12. Arellano, M, 1987. "Computing Robust Standard Errors for Within-Groups Estimators," Oxford Bulletin of Economics and Statistics, Department of Economics, University of Oxford, vol. 49(4), pages 431-434, November.
    13. Blundell,Richard & Newey,Whitney K. & Persson,Torsten (ed.), 2007. "Advances in Economics and Econometrics," Cambridge Books, Cambridge University Press, number 9780521871532.
    14. Mitchell A. Petersen, 2009. "Estimating Standard Errors in Finance Panel Data Sets: Comparing Approaches," The Review of Financial Studies, Society for Financial Studies, vol. 22(1), pages 435-480, January.
    15. Davis, Peter, 2002. "Estimating multi-way error components models with unbalanced data structures," Journal of Econometrics, Elsevier, vol. 106(1), pages 67-95, January.
    16. MacKinnon, James G. & White, Halbert, 1985. "Some heteroskedasticity-consistent covariance matrix estimators with improved finite sample properties," Journal of Econometrics, Elsevier, vol. 29(3), pages 305-325, September.
    17. Jeffrey M. Wooldridge, 2003. "Cluster-Sample Methods in Applied Econometrics," American Economic Review, American Economic Association, vol. 93(2), pages 133-138, May.
    18. Hausman, Jerry & Kuersteiner, Guido, 2008. "Difference in difference meets generalized least squares: Higher order properties of hypotheses tests," Journal of Econometrics, Elsevier, vol. 144(2), pages 371-391, June.
    19. Lara D. Shore-Sheppard, 1996. "The Precision of Instrumental Variables Estimates With Grouped Data," Working Papers 753, Princeton University, Department of Economics, Industrial Relations Section..
    20. John C. Driscoll & Aart C. Kraay, 1998. "Consistent Covariance Matrix Estimation With Spatially Dependent Panel Data," The Review of Economics and Statistics, MIT Press, vol. 80(4), pages 549-560, November.
    21. Fafchamps, Marcel & Gubert, Flore, 2007. "The formation of risk sharing networks," Journal of Development Economics, Elsevier, vol. 83(2), pages 326-350, July.
    22. Stephen G. Donald & Kevin Lang, 2007. "Inference with Difference-in-Differences and Other Panel Data," The Review of Economics and Statistics, MIT Press, vol. 89(2), pages 221-233, May.
    23. Bhattacharya, Debopam, 2005. "Asymptotic inference from multi-stage samples," Journal of Econometrics, Elsevier, vol. 126(1), pages 145-171, May.
    24. Hansen, Christian B., 2007. "Asymptotic properties of a robust variance matrix estimator for panel data when T is large," Journal of Econometrics, Elsevier, vol. 141(2), pages 597-620, December.
    25. Hansen, Christian B., 2007. "Generalized least squares inference in panel and multilevel models with serial correlation and fixed effects," Journal of Econometrics, Elsevier, vol. 140(2), pages 670-694, October.
    26. A. Colin Cameron & Natalia Golotvina, 2005. "Estimation of Country-Pair Data Models Controlling for Clustered Errors: with International Trade Applications," Working Papers 613, University of California, Davis, Department of Economics.
    27. Blundell,Richard & Newey,Whitney & Persson,Torsten (ed.), 2007. "Advances in Economics and Econometrics," Cambridge Books, Cambridge University Press, number 9780521871549.
    28. Marianne Bertrand & Esther Duflo & Sendhil Mullainathan, 2004. "How Much Should We Trust Differences-In-Differences Estimates?," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 119(1), pages 249-275.
    29. Kiefer, Nicholas M., 1980. "Estimation of fixed effect models for time series of cross-sections with arbitrary intertemporal covariance," Journal of Econometrics, Elsevier, vol. 14(2), pages 195-202, October.
    30. Blundell,Richard & Newey,Whitney & Persson,Torsten (ed.), 2007. "Advances in Economics and Econometrics," Cambridge Books, Cambridge University Press, number 9780521692106.
    31. Greenwald, Bruce C., 1983. "A general analysis of bias in the estimated standard errors of least squares coefficients," Journal of Econometrics, Elsevier, vol. 22(3), pages 323-338, August.
    32. Conley, T. G., 1999. "GMM estimation with cross sectional dependence," Journal of Econometrics, Elsevier, vol. 92(1), pages 1-45, September.
    33. Christopher L. Foote, 2007. "Space and time in macroeconomic panel data: young workers and state-level unemployment revisited," Working Papers 07-10, Federal Reserve Bank of Boston.
    34. Ibragimov, Rustam & Müller, Ulrich K., 2010. "t-Statistic Based Correlation and Heterogeneity Robust Inference," Journal of Business & Economic Statistics, American Statistical Association, vol. 28(4), pages 453-468.
    35. White, Halbert & Domowitz, Ian, 1984. "Nonlinear Regression with Dependent Observations," Econometrica, Econometric Society, vol. 52(1), pages 143-161, January.
    36. repec:dau:papers:123456789/4392 is not listed on IDEAS
    37. Moulton, Brent R., 1986. "Random group effects and the precision of regression estimates," Journal of Econometrics, Elsevier, vol. 32(3), pages 385-397, August.
    38. Moulton, Brent R, 1990. "An Illustration of a Pitfall in Estimating the Effects of Aggregate Variables on Micro Unit," The Review of Economics and Statistics, MIT Press, vol. 72(2), pages 334-338, May.
    39. repec:fth:prinin:374 is not listed on IDEAS
    40. Blundell,Richard & Newey,Whitney K. & Persson,Torsten (ed.), 2007. "Advances in Economics and Econometrics," Cambridge Books, Cambridge University Press, number 9780521692090.
    41. Kloek, T, 1981. "OLS Estimation in a Model Where a Microvariable Is Explained by Aggregates and Contemporaneous Disturbances Are Equicorrelated," Econometrica, Econometric Society, vol. 49(1), pages 205-207, January.
    42. White, Halbert, 1982. "Maximum Likelihood Estimation of Misspecified Models," Econometrica, Econometric Society, vol. 50(1), pages 1-25, January.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. A. Colin Cameron & Douglas L. Miller, 2010. "Robust Inference with Clustered Data," Working Papers 106, University of California, Davis, Department of Economics.
    2. A. Colin Cameron & Douglas L. Miller, 2015. "A Practitioner’s Guide to Cluster-Robust Inference," Journal of Human Resources, University of Wisconsin Press, vol. 50(2), pages 317-372.
    3. Jonah B. Gelbach & Doug Miller, 2009. "Robust Inference with Multi-way Clustering," Working Papers 226, University of California, Davis, Department of Economics.
    4. Hansen, Bruce E. & Lee, Seojeong, 2019. "Asymptotic theory for clustered samples," Journal of Econometrics, Elsevier, vol. 210(2), pages 268-290.
    5. Pakel, Cavit, 2019. "Bias reduction in nonlinear and dynamic panels in the presence of cross-section dependence," Journal of Econometrics, Elsevier, vol. 213(2), pages 459-492.
    6. Alberto Abadie & Susan Athey & Guido W Imbens & Jeffrey M Wooldridge, 2023. "When Should You Adjust Standard Errors for Clustering?," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 138(1), pages 1-35.
    7. A. Colin Cameron & Jonah B. Gelbach & Douglas L. Miller, 2008. "Bootstrap-Based Improvements for Inference with Clustered Errors," The Review of Economics and Statistics, MIT Press, vol. 90(3), pages 414-427, August.
    8. Cameron, A. Colin & Gelbach, Jonah B. & Miller, Douglas L., 2011. "Robust Inference With Multiway Clustering," Journal of Business & Economic Statistics, American Statistical Association, vol. 29(2), pages 238-249.
    9. Rok Spruk, 2019. "The rise and fall of Argentina," Latin American Economic Review, Springer;Centro de Investigaciòn y Docencia Económica (CIDE), vol. 28(1), pages 1-40, December.
    10. James G. MacKinnon & Matthew D. Webb, 2020. "When and How to Deal with Clustered Errors in Regression Models," Working Paper 1421, Economics Department, Queen's University.
    11. Rok Spruk & Mitja Kovac, 2018. "Inefficient Growth," Review of Economics and Institutions, Università di Perugia, vol. 9(2).
    12. Vikström, Johan, 2009. "Cluster sample inference using sensitivity analysis: the case with few groups," Working Paper Series 2009:15, IFAU - Institute for Evaluation of Labour Market and Education Policy.
    13. Mitja Kovac & Salvini Datta & Rok Spruk, 2021. "Pharmaceutical Product Liability, Litigation Regimes, and the Propensity to Patent: An Empirical Firm-Level Investigation," SAGE Open, , vol. 11(2), pages 21582440211, April.
    14. Miroslav Verbič & Rok Spruk, 2019. "Political economy of pension reforms: an empirical investigation," European Journal of Law and Economics, Springer, vol. 47(2), pages 171-232, April.
    15. Kim, Min Seong & Sun, Yixiao, 2013. "Heteroskedasticity and spatiotemporal dependence robust inference for linear panel models with fixed effects," Journal of Econometrics, Elsevier, vol. 177(1), pages 85-108.
    16. A. Colin Cameron & Jonah B. Gelbach & Douglas L. Miller, 2008. "Bootstrap-Based Improvements for Inference with Clustered Errors," The Review of Economics and Statistics, MIT Press, vol. 90(3), pages 414-427, August.
    17. Timothy Conley & Silvia Gonçalves & Christian Hansen, 2018. "Inference with Dependent Data in Accounting and Finance Applications," Journal of Accounting Research, Wiley Blackwell, vol. 56(4), pages 1139-1203, September.
    18. Rok Spruk & Mitja Kovac, 2019. "Transaction costs and economic growth under common legal system: State‐level evidence from Mexico," Economics and Politics, Wiley Blackwell, vol. 31(2), pages 240-292, July.
    19. Guido W. Imbens & Jeffrey M. Wooldridge, 2009. "Recent Developments in the Econometrics of Program Evaluation," Journal of Economic Literature, American Economic Association, vol. 47(1), pages 5-86, March.
    20. MacKinnon, James G. & Nielsen, Morten Ørregaard & Webb, Matthew D., 2023. "Cluster-robust inference: A guide to empirical practice," Journal of Econometrics, Elsevier, vol. 232(2), pages 272-299.

    More about this item

    Keywords

    Cluster robust; random e ects; xed e ects; di erences in di erences; cluster bootstrap; few clusters; multi-way clusters.;
    All these keywords.

    JEL classification:

    • C12 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods and Methodology: General - - - Hypothesis Testing: General
    • C21 - Mathematical and Quantitative Methods - - Single Equation Models; Single Variables - - - Cross-Sectional Models; Spatial Models; Treatment Effect Models
    • C23 - Mathematical and Quantitative Methods - - Single Equation Models; Single Variables - - - Models with Panel Data; Spatio-temporal Models

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:cda:wpaper:318. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Letters and Science IT Services Unit (email available below). General contact details of provider: https://edirc.repec.org/data/educdus.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.