IDEAS home Printed from https://ideas.repec.org/p/msh/ebswps/2008-9.html
   My bibliography  Save this paper

Rainbow plots, Bagplots and Boxplots for Functional Data

Author

Listed:
  • Rob J. Hyndman
  • Han Lin Shang

Abstract

We propose new tools for visualizing large numbers of functional data in the form of smooth curves or surfaces. The proposed tools include functional versions of the bagplot and boxplot, and make use of the first two robust principal component scores, Tukey's data depth and highest density regions. By-products of our graphical displays are outlier detection methods for functional data. We compare these new outlier detection methods with exiting methods for detecting outliers in functional data and show that our methods are better able to identify the outliers.

Suggested Citation

  • Rob J. Hyndman & Han Lin Shang, 2008. "Rainbow plots, Bagplots and Boxplots for Functional Data," Monash Econometrics and Business Statistics Working Papers 9/08, Monash University, Department of Econometrics and Business Statistics.
  • Handle: RePEc:msh:ebswps:2008-9
    as

    Download full text from publisher

    File URL: http://www.buseco.monash.edu.au/ebs/pubs/wpapers/2008/wp9-08.pdf
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Struyf, Anja & Rousseeuw, Peter J., 2000. "High-dimensional computation of the deepest location," Computational Statistics & Data Analysis, Elsevier, vol. 34(4), pages 415-426, October.
    2. Hyde, Valerie & Jank, Wolfgang & Shmueli, Galit, 2006. "Investigating Concurrency in Online Auctions Through Visualization," The American Statistician, American Statistical Association, vol. 60, pages 241-250, August.
    3. Filzmoser, Peter & Maronna, Ricardo & Werner, Mark, 2008. "Outlier identification in high dimensions," Computational Statistics & Data Analysis, Elsevier, vol. 52(3), pages 1694-1711, January.
    4. Ramsay, James O. & Ramsey, James B., 2002. "Functional data analysis of the dynamics of the monthly index of nondurable goods production," Journal of Econometrics, Elsevier, vol. 107(1-2), pages 327-344, March.
    5. Tarn Duong & Martin L. Hazelton, 2005. "Cross‐validation Bandwidth Matrices for Multivariate Kernel Density Estimation," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 32(3), pages 485-506, September.
    6. Ashish Sood & Gareth M. James & Gerard J. Tellis, 2009. "Functional Regression: A New Model for Predicting Market Penetration of New Products," Marketing Science, INFORMS, vol. 28(1), pages 36-51, 01-02.
    7. Hyndman, Rob J. & Shahid Ullah, Md., 2007. "Robust forecasting of mortality and fertility rates: A functional data approach," Computational Statistics & Data Analysis, Elsevier, vol. 51(10), pages 4942-4956, June.
    8. López Pintado, Sara, 2006. "On the concept of depth for functional data," DES - Working Papers. Statistics and Econometrics. WS ws063012, Universidad Carlos III de Madrid. Departamento de Estadística.
    9. Becker, Claudia & Gather, Ursula, 2001. "The largest nonidentifiable outlier: a comparison of multivariate simultaneous outlier identification rules," Computational Statistics & Data Analysis, Elsevier, vol. 36(1), pages 119-127, March.
    10. Manuel Febrero & Pedro Galeano & Wenceslao González-Manteiga, 2007. "A functional analysis of NOx levels: location and scale estimation and outlier detection," Computational Statistics, Springer, vol. 22(3), pages 411-427, September.
    11. Kargin, V. & Onatski, A., 2008. "Curve forecasting by functional autoregression," Journal of Multivariate Analysis, Elsevier, vol. 99(10), pages 2508-2526, November.
    12. Reiss, Philip T. & Ogden, R. Todd, 2007. "Functional Principal Component Regression and Functional Partial Least Squares," Journal of the American Statistical Association, American Statistical Association, vol. 102, pages 984-996, September.
    13. Croux, Christophe & Ruiz-Gazen, Anne, 2005. "High breakdown estimators for principal components: the projection-pursuit approach revisited," Journal of Multivariate Analysis, Elsevier, vol. 95(1), pages 206-226, July.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Shang, Han Lin & Hyndman, Rob.J., 2011. "Nonparametric time series forecasting with dynamic updating," Mathematics and Computers in Simulation (MATCOM), Elsevier, vol. 81(7), pages 1310-1324.
    2. Mia Hubert & Peter Rousseeuw & Pieter Segaert, 2015. "Multivariate functional outlier detection," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 24(2), pages 177-202, July.
    3. Weiyi Xie & Sebastian Kurtek & Karthik Bharath & Ying Sun, 2017. "A Geometric Approach to Visualization of Variability in Functional Data," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 112(519), pages 979-993, July.
    4. Rob Hyndman & Heather Booth & Farah Yasmeen, 2013. "Coherent Mortality Forecasting: The Product-Ratio Method With Functional Time Series Models," Demography, Springer;Population Association of America (PAA), vol. 50(1), pages 261-283, February.
    5. Barrow, Devon & Kourentzes, Nikolaos, 2018. "The impact of special days in call arrivals forecasting: A neural network approach to modelling special days," European Journal of Operational Research, Elsevier, vol. 264(3), pages 967-977.
    6. Francesca Ieva & Anna Maria Paganoni, 2020. "Component-wise outlier detection methods for robustifying multivariate functional samples," Statistical Papers, Springer, vol. 61(2), pages 595-614, April.
    7. repec:cte:wsrepe:24606 is not listed on IDEAS
    8. Han Lin Shang & Yang Yang & Fearghal Kearney, 2019. "Intraday forecasts of a volatility index: functional time series methods with dynamic updating," Annals of Operations Research, Springer, vol. 282(1), pages 331-354, November.
    9. Farah Yasmeen & Rob J Hyndman & Bircan Erbas, 2010. "Forecasting age-related changes in breast cancer mortality among white and black US women: A functional approach," Monash Econometrics and Business Statistics Working Papers 9/10, Monash University, Department of Econometrics and Business Statistics.
    10. Han Shang, 2014. "A survey of functional principal component analysis," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 98(2), pages 121-142, April.
    11. Epifanio, Irene & Ventura-Campos, Noelia, 2011. "Functional data analysis in shape analysis," Computational Statistics & Data Analysis, Elsevier, vol. 55(9), pages 2758-2773, September.
    12. Ana Arribas-Gil & Juan Romo, 2015. "Discussion of “Multivariate functional outlier detection”," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 24(2), pages 263-267, July.
    13. Zafar, Raja Fawad & Qayyum, Abdul & Ghouri, Saghir Pervaiz, 2015. "Forecasting Inflation using Functional Time Series Analysis," MPRA Paper 67208, University Library of Munich, Germany.
    14. Graciela Boente & Matías Salibian-Barrera, 2015. "S -Estimators for Functional Principal Component Analysis," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 110(511), pages 1100-1111, September.
    15. Fraiman, Ricardo & Pateiro-López, Beatriz, 2012. "Quantiles for finite and infinite dimensional data," Journal of Multivariate Analysis, Elsevier, vol. 108(C), pages 1-14.
    16. Montes, Francisco & Sala, Ramón, 2012. "Equilibrio competitivo en Liga española de futbol de Primera División: Un test de Montecarlo basado en datos funcionales/Competitive Balance in the First Division Spanish Soccer League: A Montecarlo T," Estudios de Economia Aplicada, Estudios de Economia Aplicada, vol. 30, pages 513-526, Agosto.
    17. Yuan Gao & Han Lin Shang, 2017. "Multivariate Functional Time Series Forecasting: Application to Age-Specific Mortality Rates," Risks, MDPI, vol. 5(2), pages 1-18, March.
    18. Yuan Yan & Marc Genton, 2015. "Discussion of “Multivariate functional outlier detection” by Mia Hubert, Peter Rousseeuw and Pieter Segaert," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 24(2), pages 245-251, July.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Graciela Boente & Matías Salibian-Barrera, 2015. "S -Estimators for Functional Principal Component Analysis," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 110(511), pages 1100-1111, September.
    2. Han Shang, 2014. "A survey of functional principal component analysis," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 98(2), pages 121-142, April.
    3. Montes, Francisco & Sala, Ramón, 2012. "Equilibrio competitivo en Liga española de futbol de Primera División: Un test de Montecarlo basado en datos funcionales/Competitive Balance in the First Division Spanish Soccer League: A Montecarlo T," Estudios de Economia Aplicada, Estudios de Economia Aplicada, vol. 30, pages 513-526, Agosto.
    4. Goia, Aldo & May, Caterina & Fusai, Gianluca, 2010. "Functional clustering and linear regression for peak load forecasting," International Journal of Forecasting, Elsevier, vol. 26(4), pages 700-711, October.
    5. Haixu Wang & Jiguo Cao, 2023. "Nonlinear prediction of functional time series," Environmetrics, John Wiley & Sons, Ltd., vol. 34(5), August.
    6. Atefeh Zamani & Hossein Haghbin & Maryam Hashemi & Rob J. Hyndman, 2022. "Seasonal functional autoregressive models," Journal of Time Series Analysis, Wiley Blackwell, vol. 43(2), pages 197-218, March.
    7. P. Navarro-Esteban & J. A. Cuesta-Albertos, 2021. "High-dimensional outlier detection using random projections," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 30(4), pages 908-934, December.
    8. Boente, Graciela & Pires, Ana M. & Rodrigues, Isabel M., 2010. "Detecting influential observations in principal components and common principal components," Computational Statistics & Data Analysis, Elsevier, vol. 54(12), pages 2967-2975, December.
    9. Laha, A. K. & Rathi, Poonam, 2017. "New Approaches to Prediction using Functional Data Analysis," IIMA Working Papers WP 2017-08-02, Indian Institute of Management Ahmedabad, Research and Publication Department.
    10. Fraiman, Ricardo & Pateiro-López, Beatriz, 2012. "Quantiles for finite and infinite dimensional data," Journal of Multivariate Analysis, Elsevier, vol. 108(C), pages 1-14.
    11. Kalogridis, Ioannis & Van Aelst, Stefan, 2019. "Robust functional regression based on principal components," Journal of Multivariate Analysis, Elsevier, vol. 173(C), pages 393-415.
    12. Hyndman, Rob J. & Shahid Ullah, Md., 2007. "Robust forecasting of mortality and fertility rates: A functional data approach," Computational Statistics & Data Analysis, Elsevier, vol. 51(10), pages 4942-4956, June.
    13. Hervé Cardot & Antoine Godichon-Baggioni, 2017. "Fast estimation of the median covariation matrix with application to online robust principal components analysis," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 26(3), pages 461-480, September.
    14. Petropoulos, Fotios & Apiletti, Daniele & Assimakopoulos, Vassilios & Babai, Mohamed Zied & Barrow, Devon K. & Ben Taieb, Souhaib & Bergmeir, Christoph & Bessa, Ricardo J. & Bijak, Jakub & Boylan, Joh, 2022. "Forecasting: theory and practice," International Journal of Forecasting, Elsevier, vol. 38(3), pages 705-871.
      • Fotios Petropoulos & Daniele Apiletti & Vassilios Assimakopoulos & Mohamed Zied Babai & Devon K. Barrow & Souhaib Ben Taieb & Christoph Bergmeir & Ricardo J. Bessa & Jakub Bijak & John E. Boylan & Jet, 2020. "Forecasting: theory and practice," Papers 2012.03854, arXiv.org, revised Jan 2022.
    15. Bali, Juan Lucas & Boente, Graciela, 2017. "Robust estimators under a functional common principal components model," Computational Statistics & Data Analysis, Elsevier, vol. 113(C), pages 424-440.
    16. Shang, Han Lin, 2017. "Functional time series forecasting with dynamic updating: An application to intraday particulate matter concentration," Econometrics and Statistics, Elsevier, vol. 1(C), pages 184-200.
    17. Bali, Juan Lucas & Boente, Graciela, 2015. "Influence function of projection-pursuit principal components for functional data," Journal of Multivariate Analysis, Elsevier, vol. 133(C), pages 173-199.
    18. Boente, Graciela & Parada, Daniela, 2023. "Robust estimation for functional quadratic regression models," Computational Statistics & Data Analysis, Elsevier, vol. 187(C).
    19. Bali, Juan Lucas & Boente, Graciela, 2014. "Consistency of a numerical approximation to the first principal component projection pursuit estimator," Statistics & Probability Letters, Elsevier, vol. 94(C), pages 181-191.
    20. Shang, Han Lin & Hyndman, Rob.J., 2011. "Nonparametric time series forecasting with dynamic updating," Mathematics and Computers in Simulation (MATCOM), Elsevier, vol. 81(7), pages 1310-1324.

    More about this item

    Keywords

    Highest density regions; Robust principal component analysis; Kernel density estimation; Outlier detection; Tukey's halfspace depth;
    All these keywords.

    JEL classification:

    • C14 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods and Methodology: General - - - Semiparametric and Nonparametric Methods: General
    • C80 - Mathematical and Quantitative Methods - - Data Collection and Data Estimation Methodology; Computer Programs - - - General

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:msh:ebswps:2008-9. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Professor Xibin Zhang (email available below). General contact details of provider: https://edirc.repec.org/data/dxmonau.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.