IDEAS home Printed from https://ideas.repec.org/a/taf/jnlbes/v41y2023i3p737-751.html
   My bibliography  Save this article

Testing for Unobserved Heterogeneity via k-means Clustering

Author

Listed:
  • Andrew J. Patton
  • Brian M. Weller

Abstract

Clustering methods such as k-means have found widespread use in a variety of applications. This article proposes a split-sample testing procedure to determine whether a null hypothesis of a single cluster, indicating homogeneity of the data, can be rejected in favor of multiple clusters. The test is simple to implement, valid under mild conditions (including nonnormality, and heterogeneity of the data in aspects beyond those in the clustering analysis), and applicable in a range of contexts (including clustering when the time series dimension is small, or clustering on parameters other than the mean). We verify that the test has good size control in finite samples, and we illustrate the test in applications to clustering vehicle manufacturers and U.S. mutual funds.

Suggested Citation

  • Andrew J. Patton & Brian M. Weller, 2023. "Testing for Unobserved Heterogeneity via k-means Clustering," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 41(3), pages 737-751, July.
  • Handle: RePEc:taf:jnlbes:v:41:y:2023:i:3:p:737-751
    DOI: 10.1080/07350015.2022.2061983
    as

    Download full text from publisher

    File URL: http://hdl.handle.net/10.1080/07350015.2022.2061983
    Download Restriction: Access to full text is restricted to subscribers.

    File URL: https://libkey.io/10.1080/07350015.2022.2061983?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to look for a different version below or search for a different version of it.

    Other versions of this item:

    References listed on IDEAS

    as
    1. Stéphane Bonhomme & Elena Manresa, 2015. "Grouped Patterns of Heterogeneity in Panel Data," Econometrica, Econometric Society, vol. 83(3), pages 1147-1184, May.
    2. Hansen, Christian B., 2007. "Asymptotic properties of a robust variance matrix estimator for panel data when T is large," Journal of Econometrics, Elsevier, vol. 141(2), pages 597-620, December.
    3. Pesaran, M. Hashem, 2015. "Time Series and Panel Data Econometrics," OUP Catalogue, Oxford University Press, number 9780198759980.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Hansen, Christian & Liao, Yuan, 2019. "The Factor-Lasso And K-Step Bootstrap Approach For Inference In High-Dimensional Economic Applications," Econometric Theory, Cambridge University Press, vol. 35(3), pages 465-509, June.
    2. Denis Chetverikov & Elena Manresa, 2022. "Spectral and post-spectral estimators for grouped panel data models," Papers 2212.13324, arXiv.org, revised Dec 2022.
    3. Ryo Okui & Takahide Yanagi, 2020. "Kernel estimation for panel data with heterogeneous dynamics," The Econometrics Journal, Royal Economic Society, vol. 23(1), pages 156-175.
    4. Zhan Gao & M. Hashem Pesaran, 2023. "Identification and estimation of categorical random coefficient models," Empirical Economics, Springer, vol. 64(6), pages 2543-2588, June.
    5. Kopp, Thomas & Nabernegg, Markus & Lange, Steffen, 2023. "The net climate effect of digitalization, differentiating between firms and households," Energy Economics, Elsevier, vol. 126(C).
    6. Claudia García-García & Catalina B. García-García & Román Salmerón, 2021. "Confronting collinearity in environmental regression models: evidence from world data," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 30(3), pages 895-926, September.
    7. Ghosh, Soumya Kanti & Nath, Hiranya K., 2023. "What determines private and household savings in India?," International Review of Economics & Finance, Elsevier, vol. 86(C), pages 639-651.
    8. Chudik, Alexander & Pesaran, M. Hashem, 2019. "Mean group estimation in presence of weakly cross-correlated estimators," Economics Letters, Elsevier, vol. 175(C), pages 101-105.
    9. Francisco Javier Forcadell & Fernando Úbeda, 2022. "Individual entrepreneurial orientation and performance: the mediating role of international entrepreneurship," International Entrepreneurship and Management Journal, Springer, vol. 18(2), pages 875-900, June.
    10. Emmanuel Anyigbah & Yusheng Kong & Bless Kofi Edziah & Ahotovi Thomas Ahoto & Wilhelmina Seyome Ahiaku, 2023. "Board Characteristics and Corporate Sustainability Reporting: Evidence from Chinese Listed Companies," Sustainability, MDPI, vol. 15(4), pages 1-26, February.
    11. Nikolov, Plamen & Adelman, Alan, 2019. "Do private household transfers to the elderly respond to public pension benefits? Evidence from rural China," The Journal of the Economics of Ageing, Elsevier, vol. 14(C).
    12. Andrii Babii & Ryan T. Ball & Eric Ghysels & Jonas Striaukas, 2024. "Panel data nowcasting: The case of price–earnings ratios," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 39(2), pages 292-307, March.
    13. Rok Spruk & Mitja Kovac, 2018. "Inefficient Growth," Review of Economics and Institutions, Università di Perugia, vol. 9(2).
    14. James G. MacKinnon & Matthew D. Webb & Morten Ø. Nielsen, 2017. "Bootstrap And Asymptotic Inference With Multiway Clustering," Working Paper 1386, Economics Department, Queen's University.
    15. Bowei Guo & Giorgio Castagneto Gissey, 2019. "Cost Pass-through in the British Wholesale Electricity Market: Implications of Brexit and the ETS reform," Working Papers EPRG1937, Energy Policy Research Group, Cambridge Judge Business School, University of Cambridge.
    16. Hagemann, Andreas, 2019. "Placebo inference on treatment effects when the number of clusters is small," Journal of Econometrics, Elsevier, vol. 213(1), pages 190-209.
    17. Polemis, Michael & Tselekounis, Markos, 2019. "Does deregulation drive innovation intensity? Lessons learned from the OECD telecommunications sector," MPRA Paper 92770, University Library of Munich, Germany.
    18. Nibbering, D. & Paap, R., 2019. "Panel Forecasting with Asymmetric Grouping," Econometric Institute Research Papers EI-2019-30, Erasmus University Rotterdam, Erasmus School of Economics (ESE), Econometric Institute.
    19. Manuel Arellano & Stéphane Bonhomme & Micole De Vera & Laura Hospido & Siqi Wei, 2022. "Income risk inequality: Evidence from Spanish administrative records," Quantitative Economics, Econometric Society, vol. 13(4), pages 1747-1801, November.
    20. Arestis, Philip & Ferreiro, Jesus & Gomez, Carmen, 2023. "Does employment protection legislation affect employment and unemployment?11We acknowledge the comments of an editor and an associate editor of the journal and three reviewers. Their suggestions and r," Economic Modelling, Elsevier, vol. 126(C).

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:taf:jnlbes:v:41:y:2023:i:3:p:737-751. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Chris Longhurst (email available below). General contact details of provider: http://www.tandfonline.com/UBES20 .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.