IDEAS home Printed from https://ideas.repec.org/a/spr/stmapp/v24y2015i2p177-202.html
   My bibliography  Save this article

Multivariate functional outlier detection

Author

Listed:
  • Mia Hubert
  • Peter Rousseeuw
  • Pieter Segaert

Abstract

Functional data are occurring more and more often in practice, and various statistical techniques have been developed to analyze them. In this paper we consider multivariate functional data, where for each curve and each time point a $$p$$ p -dimensional vector of measurements is observed. For functional data the study of outlier detection has started only recently, and was mostly limited to univariate curves $$(p=1)$$ ( p = 1 ) . In this paper we set up a taxonomy of functional outliers, and construct new numerical and graphical techniques for the detection of outliers in multivariate functional data, with univariate curves included as a special case. Our tools include statistical depth functions and distance measures derived from them. The methods we study are affine invariant in $$p$$ p -dimensional space, and do not assume elliptical or any other symmetry. Copyright Springer-Verlag Berlin Heidelberg 2015

Suggested Citation

  • Mia Hubert & Peter Rousseeuw & Pieter Segaert, 2015. "Multivariate functional outlier detection," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 24(2), pages 177-202, July.
  • Handle: RePEc:spr:stmapp:v:24:y:2015:i:2:p:177-202
    DOI: 10.1007/s10260-015-0297-8
    as

    Download full text from publisher

    File URL: http://hdl.handle.net/10.1007/s10260-015-0297-8
    Download Restriction: Access to full text is restricted to subscribers.

    File URL: https://libkey.io/10.1007/s10260-015-0297-8?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Gerda Claeskens & Mia Hubert & Leen Slaets & Kaveh Vakili, 2014. "Multivariate Functional Halfspace Depth," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 109(505), pages 411-423, March.
    2. Struyf, Anja & Rousseeuw, Peter J., 2000. "High-dimensional computation of the deepest location," Computational Statistics & Data Analysis, Elsevier, vol. 34(4), pages 415-426, October.
    3. López-Pintado, Sara & Romo, Juan, 2009. "On the Concept of Depth for Functional Data," Journal of the American Statistical Association, American Statistical Association, vol. 104(486), pages 718-734.
    4. Cuevas, Antonio & Febrero, Manuel & Fraiman, Ricardo, 2006. "On the use of the bootstrap for estimating functions with functional data," Computational Statistics & Data Analysis, Elsevier, vol. 51(2), pages 1063-1074, November.
    5. Marc Hallin & Davy Paindaveine & Miroslav Siman, 2008. "Multivariate quantiles and multiple-output regression quantiles: from L1 optimization to halfspace depth," Working Papers ECARES 2008_042, ULB -- Universite Libre de Bruxelles.
    6. Masse, J. C. & Theodorescu, R., 1994. "Halfplane Trimming for Bivariate Distributions," Journal of Multivariate Analysis, Elsevier, vol. 48(2), pages 188-202, February.
    7. Davy Paindaveine & Miroslav Šiman, 2012. "Computing multiple-output regression quantile regions from projection quantiles," Computational Statistics, Springer, vol. 27(1), pages 29-49, March.
    8. Berrendero, J.R. & Justel, A. & Svarc, M., 2011. "Principal components for multivariate functional data," Computational Statistics & Data Analysis, Elsevier, vol. 55(9), pages 2619-2634, September.
    9. Peter J. Rousseeuw & Ida Ruts, 1996. "Bivariate Location Depth," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 45(4), pages 516-526, December.
    10. López-Pintado, Sara & Romo, Juan, 2011. "A half-region depth for functional data," Computational Statistics & Data Analysis, Elsevier, vol. 55(4), pages 1679-1695, April.
    11. Struyf, Anja J. & Rousseeuw, Peter J., 1999. "Halfspace Depth and Regression Depth Characterize the Empirical Distribution," Journal of Multivariate Analysis, Elsevier, vol. 69(1), pages 135-153, April.
    12. Romanazzi, Mario, 2001. "Influence Function of Halfspace Depth," Journal of Multivariate Analysis, Elsevier, vol. 77(1), pages 138-161, April.
    13. Rob J. Hyndman & Han Lin Shang, 2008. "Rainbow plots, Bagplots and Boxplots for Functional Data," Monash Econometrics and Business Statistics Working Papers 9/08, Monash University, Department of Econometrics and Business Statistics.
    14. Mia Hubert & Stephan Van der Veeken, 2010. "Robust classification for skewed data," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 4(4), pages 239-254, December.
    15. Sara López-Pintado & Ying Sun & Juan Lin & Marc Genton, 2014. "Simplicial band depth for multivariate functional data," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 8(3), pages 321-338, September.
    16. Ruts, Ida & Rousseeuw, Peter J., 1996. "Computing depth contours of bivariate point clouds," Computational Statistics & Data Analysis, Elsevier, vol. 23(1), pages 153-168, November.
    17. Hubert, M. & Vandervieren, E., 2008. "An adjusted boxplot for skewed distributions," Computational Statistics & Data Analysis, Elsevier, vol. 52(12), pages 5186-5201, August.
    18. Antonio Cuevas & Manuel Febrero & Ricardo Fraiman, 2007. "Robust estimation and classification for functional data via projection-based depth notions," Computational Statistics, Springer, vol. 22(3), pages 481-496, September.
    19. Ricardo Fraiman & Graciela Muniz, 2001. "Trimmed means for functional data," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 10(2), pages 419-440, December.
    20. Pigoli, Davide & Sangalli, Laura M., 2012. "Wavelets in functional data analysis: Estimation of multidimensional curves and their derivatives," Computational Statistics & Data Analysis, Elsevier, vol. 56(6), pages 1482-1498.
    21. Mizera, Ivan & Volauf, Milos, 2002. "Continuity of Halfspace Depth Contours and Maximum Depth Estimators: Diagnostics of Depth-Related Methods," Journal of Multivariate Analysis, Elsevier, vol. 83(2), pages 365-388, November.
    22. Paindaveine, Davy & Šiman, Miroslav, 2012. "Computing multiple-output regression quantile regions," Computational Statistics & Data Analysis, Elsevier, vol. 56(4), pages 840-853.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Mia Hubert & Peter Rousseeuw & Pieter Segaert, 2017. "Multivariate and functional classification using depth and distance," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 11(3), pages 445-466, September.
    2. Nagy, Stanislav & Ferraty, Frédéric, 2019. "Data depth for measurable noisy random functions," Journal of Multivariate Analysis, Elsevier, vol. 170(C), pages 95-114.
    3. Francesca Ieva & Anna Maria Paganoni, 2020. "Component-wise outlier detection methods for robustifying multivariate functional samples," Statistical Papers, Springer, vol. 61(2), pages 595-614, April.
    4. repec:cte:wsrepe:24606 is not listed on IDEAS
    5. Dai, Wenlin & Genton, Marc G., 2019. "Directional outlyingness for multivariate functional data," Computational Statistics & Data Analysis, Elsevier, vol. 131(C), pages 50-65.
    6. Dyckerhoff, Rainer & Mozharovskyi, Pavlo, 2016. "Exact computation of the halfspace depth," Computational Statistics & Data Analysis, Elsevier, vol. 98(C), pages 19-30.
    7. Alba M. Franco-Pereira & Rosa E. Lillo, 2020. "Rank tests for functional data based on the epigraph, the hypograph and associated graphical representations," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 14(3), pages 651-676, September.
    8. repec:cte:wsrepe:ws133329 is not listed on IDEAS
    9. Cleveland, Jason & Zhao, Weilong & Wu, Wei, 2018. "Robust template estimation for functional data with phase variability using band depth," Computational Statistics & Data Analysis, Elsevier, vol. 125(C), pages 10-26.
    10. Fraiman, Ricardo & Pateiro-López, Beatriz, 2012. "Quantiles for finite and infinite dimensional data," Journal of Multivariate Analysis, Elsevier, vol. 108(C), pages 1-14.
    11. Gijbels, Irène & Nagy, Stanislav, 2015. "Consistency of non-integrated depths for functional data," Journal of Multivariate Analysis, Elsevier, vol. 140(C), pages 259-282.
    12. Zhuo Qu & Wenlin Dai & Marc G. Genton, 2021. "Robust functional multivariate analysis of variance with environmental applications," Environmetrics, John Wiley & Sons, Ltd., vol. 32(1), February.
    13. Xiaohui Liu & Karl Mosler & Pavlo Mozharovskyi, 2017. "Fast computation of Tukey trimmed regions and median in dimension p > 2," Working Papers 2017-71, Center for Research in Economics and Statistics.
    14. Kuhnt, Sonja & Rehage, André, 2016. "An angle-based multivariate functional pseudo-depth for shape outlier detection," Journal of Multivariate Analysis, Elsevier, vol. 146(C), pages 325-340.
    15. Tian, Yahui & Gel, Yulia R., 2019. "Fusing data depth with complex networks: Community detection with prior information," Computational Statistics & Data Analysis, Elsevier, vol. 139(C), pages 99-116.
    16. Oluwasegun Taiwo Ojo & Antonio Fernández Anta & Rosa E. Lillo & Carlo Sguera, 2022. "Detecting and classifying outliers in big functional data," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 16(3), pages 725-760, September.
    17. Elías, Antonio & Jiménez, Raúl & Shang, Han Lin, 2022. "On projection methods for functional time series forecasting," Journal of Multivariate Analysis, Elsevier, vol. 189(C).
    18. Nieto-Reyes, Alicia & Battey, Heather, 2021. "A topologically valid construction of depth for functional data," Journal of Multivariate Analysis, Elsevier, vol. 184(C).
    19. repec:cte:wsrepe:ws140101 is not listed on IDEAS
    20. Serfling, Robert & Wijesuriya, Uditha, 2017. "Depth-based nonparametric description of functional data, with emphasis on use of spatial depth," Computational Statistics & Data Analysis, Elsevier, vol. 105(C), pages 24-45.
    21. Han Shang, 2014. "A survey of functional principal component analysis," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 98(2), pages 121-142, April.
    22. Nagy, Stanislav & Gijbels, Irène & Hlubinka, Daniel, 2016. "Weak convergence of discretely observed functional data with applications," Journal of Multivariate Analysis, Elsevier, vol. 146(C), pages 46-62.
    23. Sara López-Pintado, 2015. "Discussion of Multivariate functional outlier detection by M. Hubert, P. Rousseeuw and P. Segaert," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 24(2), pages 253-256, July.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:stmapp:v:24:y:2015:i:2:p:177-202. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.