IDEAS home Printed from https://ideas.repec.org/p/ifs/cemmap/08-18.html
   My bibliography  Save this paper

Multiscale clustering of nonparametric regression curves

Author

Listed:
  • Michael Vogt

    (Institute for Fiscal Studies)

  • Oliver Linton

    (Institute for Fiscal Studies and University of Cambridge)

Abstract

We study a longitudinal data model with nonparametric regression functions that may vary across the observed subjects. In a wide range of applications, it is natural to assume that not every subject has a completely different regression function. We may rather suppose that the observed subjects can be grouped into a small number of classes whose members share the same regression curve. We develop a bandwidth-free clustering method to estimate the unknown group structure from the data. More speci cally, we construct estimators of the unknown classes and their unknown number which are free of classical bandwidth or smoothing parameters. In the theoretical part of the paper, we analyze the statistical properties of our estimators. The technical analysis is complemented by a simulation study and an application to temperature anomaly data.

Suggested Citation

  • Michael Vogt & Oliver Linton, 2018. "Multiscale clustering of nonparametric regression curves," CeMMAP working papers CWP08/18, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
  • Handle: RePEc:ifs:cemmap:08/18
    as

    Download full text from publisher

    File URL: https://www.ifs.org.uk/uploads/CWP081818.pdf
    Download Restriction: no
    ---><---

    Other versions of this item:

    References listed on IDEAS

    as
    1. Jeng‐Min Chiou & Pai‐Ling Li, 2007. "Functional clustering and identifying substructures of longitudinal data," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 69(4), pages 679-699, September.
    2. Stéphane Bonhomme & Elena Manresa, 2015. "Grouped Patterns of Heterogeneity in Panel Data," Econometrica, Econometric Society, vol. 83(3), pages 1147-1184, May.
    3. Armstrong, Timothy B. & Chan, Hock Peng, 2016. "Multiscale adaptive inference on conditional moment inequalities," Journal of Econometrics, Elsevier, vol. 194(1), pages 24-43.
    4. Wuyi Wang & Peter C. B. Phillips & Liangjun Su, 2018. "Homogeneity pursuit in panel data models: Theory and application," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 33(6), pages 797-815, September.
    5. Michael Vogt & Oliver Linton, 2014. "Nonparametric estimation of a periodic sequence in the presence of a smooth trend," Biometrika, Biometrika Trust, vol. 101(1), pages 121-140.
    6. Liangjun Su & Zhentao Shi & Peter C. B. Phillips, 2016. "Identifying Latent Structures in Panel Data," Econometrica, Econometric Society, vol. 84, pages 2215-2264, November.
    7. Andrews, Donald W K, 1991. "Heteroskedasticity and Autocorrelation Consistent Covariance Matrix Estimation," Econometrica, Econometric Society, vol. 59(3), pages 817-858, May.
    8. Lena Boneva & Oliver Linton & Michael Vogt, 2016. "The Effect of Fragmentation in Trading on Market Quality in the UK Equity Market," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 31(1), pages 192-213, January.
    9. Robert M. De Jong & James Davidson, 2000. "Consistency of Kernel Estimators of Heteroscedastic and Autocorrelated Covariance Matrices," Econometrica, Econometric Society, vol. 68(2), pages 407-424, March.
    10. Hannig, J. & Marron, J.S., 2006. "Advanced Distribution Theory for SiZer," Journal of the American Statistical Association, American Statistical Association, vol. 101, pages 484-499, June.
    11. Boneva, Lena & Linton, Oliver & Vogt, Michael, 2015. "A semiparametric model for heterogeneous panel data with fixed effects," Journal of Econometrics, Elsevier, vol. 188(2), pages 327-345.
    12. Horowitz, Joel L & Spokoiny, Vladimir G, 2001. "An Adaptive, Rate-Optimal Test of a Parametric Mean-Regression Model against a Nonparametric Alternative," Econometrica, Econometric Society, vol. 69(3), pages 599-631, May.
    13. Julien Jacques & Cristian Preda, 2014. "Functional data clustering: a survey," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 8(3), pages 231-255, September.
    14. C. Abraham & P. A. Cornillon & E. Matzner‐Løber & N. Molinari, 2003. "Unsupervised Curve Clustering using B‐Splines," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 30(3), pages 581-595, September.
    15. Hansen, Bruce E., 2008. "Uniform Convergence Rates For Kernel Estimation With Dependent Data," Econometric Theory, Cambridge University Press, vol. 24(3), pages 726-748, June.
    16. Tarpey, Thaddeus, 2007. "Linear Transformations and the k-Means Clustering Algorithm: Applications to Clustering Curves," The American Statistician, American Statistical Association, vol. 61, pages 34-40, February.
    17. Su, Liangjun & Ju, Gaosheng, 2018. "Identifying latent grouped patterns in panel data models with interactive fixed effects," Journal of Econometrics, Elsevier, vol. 206(2), pages 554-573.
    18. Michael Vogt & Oliver Linton, 2017. "Classification of non-parametric regression functions in longitudinal data models," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 79(1), pages 5-27, January.
    19. James G.M. & Sugar C.A., 2003. "Clustering for Sparsely Sampled Functional Data," Journal of the American Statistical Association, American Statistical Association, vol. 98, pages 397-408, January.
    20. Shubhankar Ray & Bani Mallick, 2006. "Functional clustering by Bayesian wavelet methods," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 68(2), pages 305-332, April.
    21. Degryse, H.A. & de Jong, F.C.J.M. & van Kervel, V.L., 2014. "The impact of dark trading and visible fragmentation on market quality," Other publications TiSEM a51b5d9e-2687-4972-930f-4, Tilburg University, School of Economics and Management.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Xiaorong Yang & Jia Chen & Degui Li & Runze Li, 2024. "Functional-Coefficient Quantile Regression for Panel Data with Latent Group Structure," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 42(3), pages 1026-1040, July.
    2. Dong Hwan Oh & Andrew J. Patton, 2021. "Dynamic Factor Copula Models with Estimated Cluster Assignments," Finance and Economics Discussion Series 2021-029r1, Board of Governors of the Federal Reserve System (U.S.), revised 06 May 2022.
    3. Degui Li & Bin Peng & Songqiao Tang & Weibiao Wu, 2023. "Inference of Grouped Time-Varying Network Vector Autoregression Models," Monash Econometrics and Business Statistics Working Papers 5/23, Monash University, Department of Econometrics and Business Statistics.
    4. Jia Chen, 2019. "Estimating latent group structure in time-varying coefficient panel data models," The Econometrics Journal, Royal Economic Society, vol. 22(3), pages 223-240.
    5. Zhentao Shi & Liangjun Su & Tian Xie, 2020. "L2-Relaxation: With Applications to Forecast Combination and Portfolio Analysis," Papers 2010.09477, arXiv.org, revised Aug 2022.
    6. Oh, Dong Hwan & Patton, Andrew J., 2023. "Dynamic factor copula models with estimated cluster assignments," Journal of Econometrics, Elsevier, vol. 237(2).
    7. Su, Liangjun & Wang, Wuyi & Xu, Xingbai, 2023. "Identifying latent group structures in spatial dynamic panels," Journal of Econometrics, Elsevier, vol. 235(2), pages 1955-1980.
    8. Degui Li & Bin Peng & Songqiao Tang & Weibiao Wu, 2023. "Estimation of Grouped Time-Varying Network Vector Autoregression Models," Papers 2303.10117, arXiv.org, revised Mar 2024.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Michael Vogt & Oliver Linton, 2015. "Classification of nonparametric regression functions in heterogeneous panels," CeMMAP working papers CWP06/15, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
    2. Michael Vogt & Oliver Linton, 2015. "Classification of nonparametric regression functions in heterogeneous panels," CeMMAP working papers 06/15, Institute for Fiscal Studies.
    3. Michael Vogt & Oliver Linton, 2017. "Classification of non-parametric regression functions in longitudinal data models," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 79(1), pages 5-27, January.
    4. Li, Pai-Ling & Chiou, Jeng-Min, 2011. "Identifying cluster number for subspace projected functional data clustering," Computational Statistics & Data Analysis, Elsevier, vol. 55(6), pages 2090-2103, June.
    5. Miao, Ke & Su, Liangjun & Wang, Wendun, 2020. "Panel threshold regressions with latent group structures," Journal of Econometrics, Elsevier, vol. 214(2), pages 451-481.
    6. Wang, Yiren & Phillips, Peter C.B. & Su, Liangjun, 2024. "Panel data models with time-varying latent group structures," Journal of Econometrics, Elsevier, vol. 240(1).
    7. Su, Liangjun & Wang, Wuyi & Xu, Xingbai, 2023. "Identifying latent group structures in spatial dynamic panels," Journal of Econometrics, Elsevier, vol. 235(2), pages 1955-1980.
    8. Yiren Wang & Liangjun Su & Yichong Zhang, 2022. "Low-rank Panel Quantile Regression: Estimation and Inference," Papers 2210.11062, arXiv.org.
    9. Wang, Wuyi & Su, Liangjun, 2021. "Identifying latent group structures in nonlinear panels," Journal of Econometrics, Elsevier, vol. 220(2), pages 272-295.
    10. Denis Chetverikov & Elena Manresa, 2022. "Spectral and post-spectral estimators for grouped panel data models," Papers 2212.13324, arXiv.org, revised Dec 2022.
    11. Gao, Jiti & Xia, Kai & Zhu, Huanjun, 2020. "Heterogeneous panel data models with cross-sectional dependence," Journal of Econometrics, Elsevier, vol. 219(2), pages 329-353.
    12. Mehrabani, Ali, 2023. "Estimation and identification of latent group structures in panel data," Journal of Econometrics, Elsevier, vol. 235(2), pages 1464-1482.
    13. Zhentao Shi & Liangjun Su & Tian Xie, 2020. "L2-Relaxation: With Applications to Forecast Combination and Portfolio Analysis," Papers 2010.09477, arXiv.org, revised Aug 2022.
    14. Boyuan Zhang, 2022. "Incorporating Prior Knowledge of Latent Group Structure in Panel Data Models," Papers 2211.16714, arXiv.org, revised Oct 2023.
    15. Carlos Barrera-Causil & Juan Carlos Correa & Andrew Zamecnik & Francisco Torres-Avilés & Fernando Marmolejo-Ramos, 2021. "An FDA-Based Approach for Clustering Elicited Expert Knowledge," Stats, MDPI, vol. 4(1), pages 1-21, March.
    16. Jacques, Julien & Preda, Cristian, 2014. "Model-based clustering for multivariate functional data," Computational Statistics & Data Analysis, Elsevier, vol. 71(C), pages 92-106.
    17. Nibbering, D. & Paap, R., 2019. "Panel Forecasting with Asymmetric Grouping," Econometric Institute Research Papers EI-2019-30, Erasmus University Rotterdam, Erasmus School of Economics (ESE), Econometric Institute.
    18. Yaeji Lim & Hee-Seok Oh & Ying Kuen Cheung, 2019. "Multiscale Clustering for Functional Data," Journal of Classification, Springer;The Classification Society, vol. 36(2), pages 368-391, July.
    19. Xiaorong Yang & Jia Chen & Degui Li & Runze Li, 2024. "Functional-Coefficient Quantile Regression for Panel Data with Latent Group Structure," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 42(3), pages 1026-1040, July.
    20. Saptorshee Kanto Chakraborty & Massimiliano Mazzanti, 2021. "Revisiting the literature on the dynamic Environmental Kuznets Curves using a latent structure approach," Economia Politica: Journal of Analytical and Institutional Economics, Springer;Fondazione Edison, vol. 38(3), pages 923-941, October.

    More about this item

    Keywords

    Clustering of nonparametric curves; nonparametric regression; multiscale statistics; longitudinal/panel data;
    All these keywords.

    JEL classification:

    • C14 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods and Methodology: General - - - Semiparametric and Nonparametric Methods: General
    • C38 - Mathematical and Quantitative Methods - - Multiple or Simultaneous Equation Models; Multiple Variables - - - Classification Methdos; Cluster Analysis; Principal Components; Factor Analysis
    • C55 - Mathematical and Quantitative Methods - - Econometric Modeling - - - Large Data Sets: Modeling and Analysis

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:ifs:cemmap:08/18. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Emma Hyman (email available below). General contact details of provider: https://edirc.repec.org/data/cmifsuk.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.