IDEAS home Printed from https://ideas.repec.org/a/taf/jnlasa/v112y2017i519p979-993.html
   My bibliography  Save this article

A Geometric Approach to Visualization of Variability in Functional Data

Author

Listed:
  • Weiyi Xie
  • Sebastian Kurtek
  • Karthik Bharath
  • Ying Sun

Abstract

We propose a new method for the construction and visualization of boxplot-type displays for functional data. We use a recent functional data analysis framework, based on a representation of functions called square-root slope functions, to decompose observed variation in functional data into three main components: amplitude, phase, and vertical translation. We then construct separate displays for each component, using the geometry and metric of each representation space, based on a novel definition of the median, the two quartiles, and extreme observations. The outlyingness of functional data is a very complex concept. Thus, we propose to identify outliers based on any of the three main components after decomposition. We provide a variety of visualization tools for the proposed boxplot-type displays including surface plots. We evaluate the proposed method using extensive simulations and then focus our attention on three real data applications including exploratory data analysis of sea surface temperature functions, electrocardiogram functions, and growth curves. Supplementary materials for this article are available online.

Suggested Citation

  • Weiyi Xie & Sebastian Kurtek & Karthik Bharath & Ying Sun, 2017. "A Geometric Approach to Visualization of Variability in Functional Data," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 112(519), pages 979-993, July.
  • Handle: RePEc:taf:jnlasa:v:112:y:2017:i:519:p:979-993
    DOI: 10.1080/01621459.2016.1256813
    as

    Download full text from publisher

    File URL: http://hdl.handle.net/10.1080/01621459.2016.1256813
    Download Restriction: Access to full text is restricted to subscribers.

    File URL: https://libkey.io/10.1080/01621459.2016.1256813?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Tucker, J. Derek & Wu, Wei & Srivastava, Anuj, 2013. "Generative models for functional data using phase and amplitude separation," Computational Statistics & Data Analysis, Elsevier, vol. 61(C), pages 50-66.
    2. Gerda Claeskens & Mia Hubert & Leen Slaets & Kaveh Vakili, 2014. "Multivariate Functional Halfspace Depth," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 109(505), pages 411-423, March.
    3. Mia Hubert & Peter Rousseeuw & Pieter Segaert, 2015. "Multivariate functional outlier detection," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 24(2), pages 177-202, July.
    4. Mia Hubert & Peter Rousseeuw & Pieter Segaert, 2015. "Rejoinder to ‘multivariate functional outlier detection’," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 24(2), pages 269-277, July.
    5. Ricardo Fraiman & Jean Meloche & Luis García-Escudero & Alfonso Gordaliza & Xuming He & Ricardo Maronna & Víctor Yohai & Simon Sheather & Joseph McKean & Christopher Small & Andrew Wood & R. Fraiman &, 1999. "Multivariate L-estimation," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 8(2), pages 255-317, December.
    6. Rob J. Hyndman & Han Lin Shang, 2008. "Rainbow plots, Bagplots and Boxplots for Functional Data," Monash Econometrics and Business Statistics Working Papers 9/08, Monash University, Department of Econometrics and Business Statistics.
    7. Rong Tang & Hans-Georg Müller, 2008. "Pairwise curve synchronization for functional data," Biometrika, Biometrika Trust, vol. 95(4), pages 875-889.
    8. Kneip, Alois & Ramsay, James O, 2008. "Combining Registration and Fitting for Functional Models," Journal of the American Statistical Association, American Statistical Association, vol. 103(483), pages 1155-1165.
    9. Sara López-Pintado & Ying Sun & Juan Lin & Marc Genton, 2014. "Simplicial band depth for multivariate functional data," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 8(3), pages 321-338, September.
    10. Manuel Febrero & Pedro Galeano & Wenceslao González-Manteiga, 2007. "A functional analysis of NOx levels: location and scale estimation and outlier detection," Computational Statistics, Springer, vol. 22(3), pages 411-427, September.
    11. Oja, Hannu, 1983. "Descriptive statistics for multivariate distributions," Statistics & Probability Letters, Elsevier, vol. 1(6), pages 327-332, October.
    12. Ricardo Fraiman & Graciela Muniz, 2001. "Trimmed means for functional data," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 10(2), pages 419-440, December.
    13. Fraiman, Ricardo & Pateiro-López, Beatriz, 2012. "Quantiles for finite and infinite dimensional data," Journal of Multivariate Analysis, Elsevier, vol. 108(C), pages 1-14.
    14. Sebastian Kurtek & Wei Wu & Gary E. Christensen & Anuj Srivastava, 2013. "Segmentation, alignment and statistical analysis of biosignals with application to disease classification," Journal of Applied Statistics, Taylor & Francis Journals, vol. 40(6), pages 1270-1288, June.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Moritz Herrmann & Fabian Scheipl, 2021. "A Geometric Perspective on Functional Outlier Detection," Stats, MDPI, vol. 4(4), pages 1-41, November.
    2. Marc G. Genton & Ying Sun, 2019. "Comments on: Data science, big data and statistics," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 28(2), pages 338-341, June.
    3. Dai, Wenlin & Mrkvička, Tomáš & Sun, Ying & Genton, Marc G., 2020. "Functional outlier detection and taxonomy by sequential transformations," Computational Statistics & Data Analysis, Elsevier, vol. 149(C).
    4. Cleveland, Jason & Zhao, Weilong & Wu, Wei, 2018. "Robust template estimation for functional data with phase variability using band depth," Computational Statistics & Data Analysis, Elsevier, vol. 125(C), pages 10-26.
    5. Jorge R. Sosa Donoso & Miguel Flores & Salvador Naya & Javier Tarrío-Saavedra, 2023. "Local Correlation Integral Approach for Anomaly Detection Using Functional Data," Mathematics, MDPI, vol. 11(4), pages 1-18, February.
    6. Ma, Yijia & Zhou, Xinyu & Wu, Wei, 2024. "A stochastic process representation for time warping functions," Computational Statistics & Data Analysis, Elsevier, vol. 194(C).
    7. Derek Tucker, J. & Shand, Lyndsay & Chowdhary, Kenny, 2021. "Multimodal Bayesian registration of noisy functions using Hamiltonian Monte Carlo," Computational Statistics & Data Analysis, Elsevier, vol. 163(C).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Francesca Ieva & Anna Maria Paganoni, 2020. "Component-wise outlier detection methods for robustifying multivariate functional samples," Statistical Papers, Springer, vol. 61(2), pages 595-614, April.
    2. Kuhnt, Sonja & Rehage, André, 2016. "An angle-based multivariate functional pseudo-depth for shape outlier detection," Journal of Multivariate Analysis, Elsevier, vol. 146(C), pages 325-340.
    3. Dai, Wenlin & Genton, Marc G., 2019. "Directional outlyingness for multivariate functional data," Computational Statistics & Data Analysis, Elsevier, vol. 131(C), pages 50-65.
    4. Mia Hubert & Peter Rousseeuw & Pieter Segaert, 2015. "Multivariate functional outlier detection," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 24(2), pages 177-202, July.
    5. Francesca Ieva & Anna Paganoni, 2015. "Discussion of “multivariate functional outlier detection” by M. Hubert, P. Rousseeuw and P. Segaert," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 24(2), pages 217-221, July.
    6. Dai, Wenlin & Mrkvička, Tomáš & Sun, Ying & Genton, Marc G., 2020. "Functional outlier detection and taxonomy by sequential transformations," Computational Statistics & Data Analysis, Elsevier, vol. 149(C).
    7. Cleveland, Jason & Zhao, Weilong & Wu, Wei, 2018. "Robust template estimation for functional data with phase variability using band depth," Computational Statistics & Data Analysis, Elsevier, vol. 125(C), pages 10-26.
    8. Qiu, Zhiping & Fan, Jiangyuan & Zhang, Jin-Ting & Chen, Jianwei, 2024. "Tests for equality of several covariance matrix functions for multivariate functional data," Journal of Multivariate Analysis, Elsevier, vol. 199(C).
    9. Gijbels, Irène & Nagy, Stanislav, 2015. "Consistency of non-integrated depths for functional data," Journal of Multivariate Analysis, Elsevier, vol. 140(C), pages 259-282.
    10. Zhuo Qu & Wenlin Dai & Marc G. Genton, 2021. "Robust functional multivariate analysis of variance with environmental applications," Environmetrics, John Wiley & Sons, Ltd., vol. 32(1), February.
    11. Boente, Graciela & Parada, Daniela, 2023. "Robust estimation for functional quadratic regression models," Computational Statistics & Data Analysis, Elsevier, vol. 187(C).
    12. Carlo Sguera & Sara López-Pintado, 2021. "A notion of depth for sparse functional data," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 30(3), pages 630-649, September.
    13. Graciela Boente & Matías Salibián-Barrera, 2021. "Robust functional principal components for sparse longitudinal data," METRON, Springer;Sapienza Università di Roma, vol. 79(2), pages 159-188, August.
    14. Qiu, Zhiping & Chen, Jianwei & Zhang, Jin-Ting, 2021. "Two-sample tests for multivariate functional data with applications," Computational Statistics & Data Analysis, Elsevier, vol. 157(C).
    15. Zhu, Tianming & Zhang, Jin-Ting & Cheng, Ming-Yen, 2022. "One-way MANOVA for functional data via Lawley–Hotelling trace test," Journal of Multivariate Analysis, Elsevier, vol. 192(C).
    16. Oluwasegun Taiwo Ojo & Antonio Fernández Anta & Rosa E. Lillo & Carlo Sguera, 2022. "Detecting and classifying outliers in big functional data," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 16(3), pages 725-760, September.
    17. Nagy, Stanislav & Ferraty, Frédéric, 2019. "Data depth for measurable noisy random functions," Journal of Multivariate Analysis, Elsevier, vol. 170(C), pages 95-114.
    18. P. Navarro-Esteban & J. A. Cuesta-Albertos, 2021. "High-dimensional outlier detection using random projections," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 30(4), pages 908-934, December.
    19. Ana Arribas-Gil & Juan Romo, 2015. "Discussion of “Multivariate functional outlier detection”," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 24(2), pages 263-267, July.
    20. Fraiman, Ricardo & Pateiro-López, Beatriz, 2012. "Quantiles for finite and infinite dimensional data," Journal of Multivariate Analysis, Elsevier, vol. 108(C), pages 1-14.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:taf:jnlasa:v:112:y:2017:i:519:p:979-993. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Chris Longhurst (email available below). General contact details of provider: http://www.tandfonline.com/UASA20 .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.