IDEAS home Printed from https://ideas.repec.org/p/mol/ecsdps/esdp03013.html
   My bibliography  Save this paper

A least squares approach to Principal Component Analysis for interval valued data

Author

Listed:
  • D'Urso, Pierpaolo
  • Giordani, Paolo

Abstract

Principal Component Analysis (PCA) is a well known technique the aim of which is to synthesize huge amounts of numerical data by means of a low number of unobserved variables, called components. In this paper, an extension of PCA to deal with interval valued data is proposed. The method, called Midpoint Radius Principal Component Analysis (MR-PCA) recovers the underlying structure of interval valued data by using both the midpoints (or centers) and the radii (a measure of the interval width) information. In order to analyze how MR-PCA works, the results of a simulation study and two applications on chemical data are proposed.

Suggested Citation

  • D'Urso, Pierpaolo & Giordani, Paolo, 2003. "A least squares approach to Principal Component Analysis for interval valued data," Economics & Statistics Discussion Papers esdp03013, University of Molise, Department of Economics.
  • Handle: RePEc:mol:ecsdps:esdp03013
    as

    Download full text from publisher

    File URL: http://web.unimol.it/progetti/repec/mol/ecsdps/ESDP03013.pdf
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Timmerman, Marieke E. & Kiers, Henk A. L., 2002. "Three-way component analysis with smoothness constraints," Computational Statistics & Data Analysis, Elsevier, vol. 40(3), pages 447-470, September.
    2. Roger Millsap & William Meredith, 1988. "Component analysis in cross-sectional and longitudinal data," Psychometrika, Springer;The Psychometric Society, vol. 53(1), pages 123-134, March.
    3. D'Urso, Pierpaolo & Gastaldi, Tommaso, 2000. "A least-squares approach to fuzzy linear regression analysis," Computational Statistics & Data Analysis, Elsevier, vol. 34(4), pages 427-440, October.
    4. Henk Kiers & Jos Berge, 1989. "Alternating least squares algorithms for simultaneous components analysis with equal component weight matrices in two or more populations," Psychometrika, Springer;The Psychometric Society, vol. 54(3), pages 467-473, September.
    5. Giordani, Paolo & Kiers, Henk A. L., 2004. "Principal Component Analysis of symmetric fuzzy data," Computational Statistics & Data Analysis, Elsevier, vol. 45(3), pages 519-548, April.
    6. Ledyard Tucker, 1966. "Some mathematical notes on three-mode factor analysis," Psychometrika, Springer;The Psychometric Society, vol. 31(3), pages 279-311, September.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Pierpaolo D'Urso & Paolo Giordani, 2006. "A robust fuzzy k-means clustering model for interval valued data," Computational Statistics, Springer, vol. 21(2), pages 251-269, June.
    2. Antonio Irpino & Valentino Tontodonato, 2006. "Clustering reduced interval data using Hausdorff distance," Computational Statistics, Springer, vol. 21(2), pages 271-288, June.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Rizzi, Alfredo & Vichi, Maurizio, 1995. "Representation, synthesis, variability and data preprocessing of a three-way data set," Computational Statistics & Data Analysis, Elsevier, vol. 19(2), pages 203-222, February.
    2. Giordani, Paolo, 2010. "Three-way analysis of imprecise data," Journal of Multivariate Analysis, Elsevier, vol. 101(3), pages 568-582, March.
    3. Takane, Yoshio & Hwang, Heungsun, 2005. "An extended redundancy analysis and its applications to two practical examples," Computational Statistics & Data Analysis, Elsevier, vol. 49(3), pages 785-808, June.
    4. Michel Velden & Tammo Bijmolt, 2006. "Generalized canonical correlation analysis of matrices with missing rows: a simulation study," Psychometrika, Springer;The Psychometric Society, vol. 71(2), pages 323-331, June.
    5. Tom Wilderjans & E. Ceulemans & I. Mechelen, 2012. "The SIMCLAS Model: Simultaneous Analysis of Coupled Binary Data Matrices with Noise Heterogeneity Between and Within Data Blocks," Psychometrika, Springer;The Psychometric Society, vol. 77(4), pages 724-740, October.
    6. Ji Yeh Choi & Heungsun Hwang & Marieke E. Timmerman, 2018. "Functional Parallel Factor Analysis for Functions of One- and Two-dimensional Arguments," Psychometrika, Springer;The Psychometric Society, vol. 83(1), pages 1-20, March.
    7. Kohei Adachi, 2013. "Generalized joint Procrustes analysis," Computational Statistics, Springer, vol. 28(6), pages 2449-2464, December.
    8. Coppi, Renato & D'Urso, Pierpaolo & Giordani, Paolo & Santoro, Adriana, 2006. "Least squares estimation of a linear regression model with LR fuzzy response," Computational Statistics & Data Analysis, Elsevier, vol. 51(1), pages 267-286, November.
    9. Mariela González-Narváez & María José Fernández-Gómez & Susana Mendes & José-Luis Molina & Omar Ruiz-Barzola & Purificación Galindo-Villardón, 2021. "Study of Temporal Variations in Species–Environment Association through an Innovative Multivariate Method: MixSTATICO," Sustainability, MDPI, vol. 13(11), pages 1-25, May.
    10. Henk Kiers, 1995. "Maximization of sums of quotients of quadratic forms and some generalizations," Psychometrika, Springer;The Psychometric Society, vol. 60(2), pages 221-245, June.
    11. Meyners, Michael & Qannari, El Mostafa, 2001. "Relating principal component analysis on merged data sets to a regression approach," Technical Reports 2001,47, Technische Universität Dortmund, Sonderforschungsbereich 475: Komplexitätsreduktion in multivariaten Datenstrukturen.
    12. Yuefeng Han & Rong Chen & Dan Yang & Cun-Hui Zhang, 2020. "Tensor Factor Model Estimation by Iterative Projection," Papers 2006.02611, arXiv.org, revised Jul 2024.
    13. DELL'ANNO, Roberto & VILLA, Stefania, 2012. "Growth in Transition Countries: Big Bang versus Gradualism," CELPE Discussion Papers 122, CELPE - CEnter for Labor and Political Economics, University of Salerno, Italy.
    14. Henk Kiers, 1991. "Hierarchical relations among three-way methods," Psychometrika, Springer;The Psychometric Society, vol. 56(3), pages 449-470, September.
    15. Willem Kloot & Pieter Kroonenberg, 1985. "External analysis with three-mode principal component models," Psychometrika, Springer;The Psychometric Society, vol. 50(4), pages 479-494, December.
    16. Pieter M. Kroonenberg & Cornelis J. Lammers & Ineke Stoop, 1985. "Three-Mode Principal Component Analysis of Multivariate Longitudinal Organizational Data," Sociological Methods & Research, , vol. 14(2), pages 99-136, November.
    17. Elisa Frutos-Bernal & Ángel Martín del Rey & Irene Mariñas-Collado & María Teresa Santos-Martín, 2022. "An Analysis of Travel Patterns in Barcelona Metro Using Tucker3 Decomposition," Mathematics, MDPI, vol. 10(7), pages 1-17, March.
    18. Xinhai Liu & Wolfgang Glänzel & Bart De Moor, 2011. "Hybrid clustering of multi-view data via Tucker-2 model and its application," Scientometrics, Springer;Akadémiai Kiadó, vol. 88(3), pages 819-839, September.
    19. Yoshio Takane & Forrest Young & Jan Leeuw, 1977. "Nonmetric individual differences multidimensional scaling: An alternating least squares method with optimal scaling features," Psychometrika, Springer;The Psychometric Society, vol. 42(1), pages 7-67, March.
    20. J. Le-Rademacher & L. Billard, 2013. "Principal component histograms from interval-valued observations," Computational Statistics, Springer, vol. 28(5), pages 2117-2138, October.

    More about this item

    Keywords

    Principal Component Analysis; Least squares approach; Interval valued data; Chemical data;
    All these keywords.

    JEL classification:

    • C1 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods and Methodology: General

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:mol:ecsdps:esdp03013. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Claudio Lupi (email available below). General contact details of provider: https://edirc.repec.org/data/dsmolit.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.