IDEAS home Printed from https://ideas.repec.org/p/msh/ebswps/2008-9.html
   My bibliography  Save this paper

Rainbow plots, Bagplots and Boxplots for Functional Data

Author

Listed:
  • Rob J. Hyndman
  • Han Lin Shang

Abstract

We propose new tools for visualizing large numbers of functional data in the form of smooth curves or surfaces. The proposed tools include functional versions of the bagplot and boxplot, and make use of the first two robust principal component scores, Tukey's data depth and highest density regions. By-products of our graphical displays are outlier detection methods for functional data. We compare these new outlier detection methods with exiting methods for detecting outliers in functional data and show that our methods are better able to identify the outliers.

Suggested Citation

  • Rob J. Hyndman & Han Lin Shang, 2008. "Rainbow plots, Bagplots and Boxplots for Functional Data," Monash Econometrics and Business Statistics Working Papers 9/08, Monash University, Department of Econometrics and Business Statistics.
  • Handle: RePEc:msh:ebswps:2008-9
    as

    Download full text from publisher

    File URL: http://www.buseco.monash.edu.au/ebs/pubs/wpapers/2008/wp9-08.pdf
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Struyf, Anja & Rousseeuw, Peter J., 2000. "High-dimensional computation of the deepest location," Computational Statistics & Data Analysis, Elsevier, vol. 34(4), pages 415-426, October.
    2. Becker, Claudia & Gather, Ursula, 2001. "The largest nonidentifiable outlier: a comparison of multivariate simultaneous outlier identification rules," Computational Statistics & Data Analysis, Elsevier, vol. 36(1), pages 119-127, March.
    3. Hyde, Valerie & Jank, Wolfgang & Shmueli, Galit, 2006. "Investigating Concurrency in Online Auctions Through Visualization," The American Statistician, American Statistical Association, vol. 60, pages 241-250, August.
    4. Manuel Febrero & Pedro Galeano & Wenceslao González-Manteiga, 2007. "A functional analysis of NOx levels: location and scale estimation and outlier detection," Computational Statistics, Springer, vol. 22(3), pages 411-427, September.
    5. Filzmoser, Peter & Maronna, Ricardo & Werner, Mark, 2008. "Outlier identification in high dimensions," Computational Statistics & Data Analysis, Elsevier, vol. 52(3), pages 1694-1711, January.
    6. Ramsay, James O. & Ramsey, James B., 2002. "Functional data analysis of the dynamics of the monthly index of nondurable goods production," Journal of Econometrics, Elsevier, vol. 107(1-2), pages 327-344, March.
    7. Kargin, V. & Onatski, A., 2008. "Curve forecasting by functional autoregression," Journal of Multivariate Analysis, Elsevier, vol. 99(10), pages 2508-2526, November.
    8. Tarn Duong & Martin L. Hazelton, 2005. "Cross‐validation Bandwidth Matrices for Multivariate Kernel Density Estimation," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 32(3), pages 485-506, September.
    9. Ashish Sood & Gareth M. James & Gerard J. Tellis, 2009. "Functional Regression: A New Model for Predicting Market Penetration of New Products," Marketing Science, INFORMS, vol. 28(1), pages 36-51, 01-02.
    10. Reiss, Philip T. & Ogden, R. Todd, 2007. "Functional Principal Component Regression and Functional Partial Least Squares," Journal of the American Statistical Association, American Statistical Association, vol. 102, pages 984-996, September.
    11. Hyndman, Rob J. & Shahid Ullah, Md., 2007. "Robust forecasting of mortality and fertility rates: A functional data approach," Computational Statistics & Data Analysis, Elsevier, vol. 51(10), pages 4942-4956, June.
    12. Croux, Christophe & Ruiz-Gazen, Anne, 2005. "High breakdown estimators for principal components: the projection-pursuit approach revisited," Journal of Multivariate Analysis, Elsevier, vol. 95(1), pages 206-226, July.
    13. López Pintado, Sara, 2006. "On the concept of depth for functional data," DES - Working Papers. Statistics and Econometrics. WS ws063012, Universidad Carlos III de Madrid. Departamento de Estadística.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Shang, Han Lin & Hyndman, Rob.J., 2011. "Nonparametric time series forecasting with dynamic updating," Mathematics and Computers in Simulation (MATCOM), Elsevier, vol. 81(7), pages 1310-1324.
    2. Mia Hubert & Peter Rousseeuw & Pieter Segaert, 2015. "Multivariate functional outlier detection," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 24(2), pages 177-202, July.
    3. Weiyi Xie & Sebastian Kurtek & Karthik Bharath & Ying Sun, 2017. "A Geometric Approach to Visualization of Variability in Functional Data," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 112(519), pages 979-993, July.
    4. Rob Hyndman & Heather Booth & Farah Yasmeen, 2013. "Coherent Mortality Forecasting: The Product-Ratio Method With Functional Time Series Models," Demography, Springer;Population Association of America (PAA), vol. 50(1), pages 261-283, February.
    5. Barrow, Devon & Kourentzes, Nikolaos, 2018. "The impact of special days in call arrivals forecasting: A neural network approach to modelling special days," European Journal of Operational Research, Elsevier, vol. 264(3), pages 967-977.
    6. Ana Arribas-Gil & Juan Romo, 2015. "Discussion of “Multivariate functional outlier detection”," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 24(2), pages 263-267, July.
    7. Zafar, Raja Fawad & Qayyum, Abdul & Ghouri, Saghir Pervaiz, 2015. "Forecasting Inflation using Functional Time Series Analysis," MPRA Paper 67208, University Library of Munich, Germany.
    8. Francesca Ieva & Anna Maria Paganoni, 2020. "Component-wise outlier detection methods for robustifying multivariate functional samples," Statistical Papers, Springer, vol. 61(2), pages 595-614, April.
    9. Graciela Boente & Matías Salibian-Barrera, 2015. "S -Estimators for Functional Principal Component Analysis," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 110(511), pages 1100-1111, September.
    10. Fraiman, Ricardo & Pateiro-López, Beatriz, 2012. "Quantiles for finite and infinite dimensional data," Journal of Multivariate Analysis, Elsevier, vol. 108(C), pages 1-14.
    11. repec:cte:wsrepe:24606 is not listed on IDEAS
    12. Montes, Francisco & Sala, Ramón, 2012. "Equilibrio competitivo en Liga española de futbol de Primera División: Un test de Montecarlo basado en datos funcionales/Competitive Balance in the First Division Spanish Soccer League: A Montecarlo T," Estudios de Economia Aplicada, Estudios de Economia Aplicada, vol. 30, pages 513-526, Agosto.
    13. Yuan Gao & Han Lin Shang, 2017. "Multivariate Functional Time Series Forecasting: Application to Age-Specific Mortality Rates," Risks, MDPI, vol. 5(2), pages 1-18, March.
    14. Han Lin Shang & Yang Yang & Fearghal Kearney, 2019. "Intraday forecasts of a volatility index: functional time series methods with dynamic updating," Annals of Operations Research, Springer, vol. 282(1), pages 331-354, November.
    15. Yuan Yan & Marc Genton, 2015. "Discussion of “Multivariate functional outlier detection” by Mia Hubert, Peter Rousseeuw and Pieter Segaert," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 24(2), pages 245-251, July.
    16. Han Shang, 2014. "A survey of functional principal component analysis," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 98(2), pages 121-142, April.
    17. Farah Yasmeen & Rob J Hyndman & Bircan Erbas, 2010. "Forecasting age-related changes in breast cancer mortality among white and black US women: A functional approach," Monash Econometrics and Business Statistics Working Papers 9/10, Monash University, Department of Econometrics and Business Statistics.
    18. Epifanio, Irene & Ventura-Campos, Noelia, 2011. "Functional data analysis in shape analysis," Computational Statistics & Data Analysis, Elsevier, vol. 55(9), pages 2758-2773, September.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Graciela Boente & Matías Salibian-Barrera, 2015. "S -Estimators for Functional Principal Component Analysis," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 110(511), pages 1100-1111, September.
    2. Han Shang, 2014. "A survey of functional principal component analysis," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 98(2), pages 121-142, April.
    3. Goia, Aldo & May, Caterina & Fusai, Gianluca, 2010. "Functional clustering and linear regression for peak load forecasting," International Journal of Forecasting, Elsevier, vol. 26(4), pages 700-711, October.
    4. Montes, Francisco & Sala, Ramón, 2012. "Equilibrio competitivo en Liga española de futbol de Primera División: Un test de Montecarlo basado en datos funcionales/Competitive Balance in the First Division Spanish Soccer League: A Montecarlo T," Estudios de Economia Aplicada, Estudios de Economia Aplicada, vol. 30, pages 513-526, Agosto.
    5. Bali, Juan Lucas & Boente, Graciela, 2015. "Influence function of projection-pursuit principal components for functional data," Journal of Multivariate Analysis, Elsevier, vol. 133(C), pages 173-199.
    6. Boente, Graciela & Parada, Daniela, 2023. "Robust estimation for functional quadratic regression models," Computational Statistics & Data Analysis, Elsevier, vol. 187(C).
    7. Haixu Wang & Jiguo Cao, 2023. "Nonlinear prediction of functional time series," Environmetrics, John Wiley & Sons, Ltd., vol. 34(5), August.
    8. Atefeh Zamani & Hossein Haghbin & Maryam Hashemi & Rob J. Hyndman, 2022. "Seasonal functional autoregressive models," Journal of Time Series Analysis, Wiley Blackwell, vol. 43(2), pages 197-218, March.
    9. Bali, Juan Lucas & Boente, Graciela, 2014. "Consistency of a numerical approximation to the first principal component projection pursuit estimator," Statistics & Probability Letters, Elsevier, vol. 94(C), pages 181-191.
    10. Shang, Han Lin & Hyndman, Rob.J., 2011. "Nonparametric time series forecasting with dynamic updating," Mathematics and Computers in Simulation (MATCOM), Elsevier, vol. 81(7), pages 1310-1324.
    11. P. Navarro-Esteban & J. A. Cuesta-Albertos, 2021. "High-dimensional outlier detection using random projections," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 30(4), pages 908-934, December.
    12. Canale, Antonio & Vantini, Simone, 2016. "Constrained functional time series: Applications to the Italian gas market," International Journal of Forecasting, Elsevier, vol. 32(4), pages 1340-1351.
    13. Graciela Boente & Matías Salibián-Barrera, 2021. "Robust functional principal components for sparse longitudinal data," METRON, Springer;Sapienza Università di Roma, vol. 79(2), pages 159-188, August.
    14. Boente, Graciela & Pires, Ana M. & Rodrigues, Isabel M., 2010. "Detecting influential observations in principal components and common principal components," Computational Statistics & Data Analysis, Elsevier, vol. 54(12), pages 2967-2975, December.
    15. Alexander Aue & Diogo Dubart Norinho & Siegfried Hörmann, 2015. "On the Prediction of Stationary Functional Time Series," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 110(509), pages 378-392, March.
    16. Laha, A. K. & Rathi, Poonam, 2017. "New Approaches to Prediction using Functional Data Analysis," IIMA Working Papers WP 2017-08-02, Indian Institute of Management Ahmedabad, Research and Publication Department.
    17. Sven Otto & Nazarii Salish, 2022. "Approximate Factor Models for Functional Time Series," Papers 2201.02532, arXiv.org, revised May 2024.
    18. Philip Nadler & Alessio Sancetta, 2023. "Empirical Asset Pricing with Functional Factors," Journal of Financial Econometrics, Oxford University Press, vol. 21(4), pages 1258-1281.
    19. Petropoulos, Fotios & Apiletti, Daniele & Assimakopoulos, Vassilios & Babai, Mohamed Zied & Barrow, Devon K. & Ben Taieb, Souhaib & Bergmeir, Christoph & Bessa, Ricardo J. & Bijak, Jakub & Boylan, Joh, 2022. "Forecasting: theory and practice," International Journal of Forecasting, Elsevier, vol. 38(3), pages 705-871.
      • Fotios Petropoulos & Daniele Apiletti & Vassilios Assimakopoulos & Mohamed Zied Babai & Devon K. Barrow & Souhaib Ben Taieb & Christoph Bergmeir & Ricardo J. Bessa & Jakub Bijak & John E. Boylan & Jet, 2020. "Forecasting: theory and practice," Papers 2012.03854, arXiv.org, revised Jan 2022.
    20. Pallavi Sawant & Nedret Billor & Hyejin Shin, 2012. "Functional outlier detection with robust functional principal component analysis," Computational Statistics, Springer, vol. 27(1), pages 83-102, March.

    More about this item

    Keywords

    Highest density regions; Robust principal component analysis; Kernel density estimation; Outlier detection; Tukey's halfspace depth;
    All these keywords.

    JEL classification:

    • C14 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods and Methodology: General - - - Semiparametric and Nonparametric Methods: General
    • C80 - Mathematical and Quantitative Methods - - Data Collection and Data Estimation Methodology; Computer Programs - - - General

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:msh:ebswps:2008-9. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Professor Xibin Zhang (email available below). General contact details of provider: https://edirc.repec.org/data/dxmonau.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.