IDEAS home Printed from https://ideas.repec.org/p/hal/wpaper/hal-02340386.html
   My bibliography  Save this paper

Principal Component Analysis : A Generalized Gini Approach

Author

Listed:
  • Arthur Charpentier

    (CREM - Centre de recherche en économie et management - UNICAEN - Université de Caen Normandie - NU - Normandie Université - UR - Université de Rennes - CNRS - Centre National de la Recherche Scientifique)

  • Stéphane Mussard

    (CHROME - Détection, évaluation, gestion des risques CHROniques et éMErgents (CHROME) / Université de Nîmes - UNIMES - Université de Nîmes)

  • Tea Ouraga

    (CHROME - Détection, évaluation, gestion des risques CHROniques et éMErgents (CHROME) / Université de Nîmes - UNIMES - Université de Nîmes)

Abstract

A principal component analysis based on the generalized Gini correlation index is proposed (Gini PCA). The Gini PCA generalizes the standard PCA based on the variance. It is shown, in the Gaussian case, that the standard PCA is equivalent to the Gini PCA. It is also proven that the dimensionality reduction based on the generalized Gini correlation matrix, that relies on city-block distances, is robust to outliers. Monte Carlo simulations and an application on cars data (with outliers) show the robustness of the Gini PCA and provide different interpretations of the results compared with the variance PCA.
(This abstract was borrowed from another version of this item.)

Suggested Citation

  • Arthur Charpentier & Stéphane Mussard & Tea Ouraga, 2019. "Principal Component Analysis : A Generalized Gini Approach," Working Papers hal-02340386, HAL.
  • Handle: RePEc:hal:wpaper:hal-02340386
    Note: View the original document on HAL open archive server: https://hal.science/hal-02340386
    as

    Download full text from publisher

    File URL: https://hal.science/hal-02340386/document
    Download Restriction: no
    ---><---

    Other versions of this item:

    References listed on IDEAS

    as
    1. Yaari, Menahem E, 1987. "The Dual Theory of Choice under Risk," Econometrica, Econometric Society, vol. 55(1), pages 95-115, January.
    2. Thibault Gajdos & John Weymark, 2005. "Multidimensional generalized Gini indices," Economic Theory, Springer;Society for the Advancement of Economic Theory (SAET), vol. 26(3), pages 471-496, October.
    3. Brooks, J.P. & Dulá, J.H. & Boone, E.L., 2013. "A pure L1-norm principal component analysis," Computational Statistics & Data Analysis, Elsevier, vol. 61(C), pages 83-98.
    4. Laurini, Márcio Poletti & Ohashi, Alberto, 2015. "A noisy principal component analysis for forward rate curves," European Journal of Operational Research, Elsevier, vol. 246(1), pages 140-153.
    5. Charpentier, Arthur & Segers, Johan, 2009. "Tails of multivariate Archimedean copulas," Journal of Multivariate Analysis, Elsevier, vol. 100(7), pages 1521-1537, August.
    6. List, C., 1999. "Multidimensional Inequality Measurement: a Proposal," Economics Papers 9927, Economics Group, Nuffield College, University of Oxford.
    7. Hofert, Marius, 2011. "Efficiently sampling nested Archimedean copulas," Computational Statistics & Data Analysis, Elsevier, vol. 55(1), pages 57-70, January.
    8. Furman, Edward & Zitikis, Ričardas, 2017. "Beyond The Pearson Correlation: Heavy-Tailed Risks, Weighted Gini Correlations, And A Gini-Type Weighted Insurance Pricing Model," ASTIN Bulletin, Cambridge University Press, vol. 47(3), pages 919-942, September.
    9. Koen Decancq & María Ana Lugo, 2013. "Weights in Multidimensional Indices of Wellbeing: An Overview," Econometric Reviews, Taylor & Francis Journals, vol. 32(1), pages 7-34, January.
    10. Yaari, Menahem E., 1988. "A controversial proposal concerning inequality measurement," Journal of Economic Theory, Elsevier, vol. 44(2), pages 381-397, April.
    11. Giovanni Giorgi, 2013. "Back to the future: some considerations on Shlomo Yitzhaki and Edna Schechtman’s book “The Gini Methodology: A Primer on a Statistical Methodology”," METRON, Springer;Sapienza Università di Roma, vol. 71(2), pages 189-195, September.
    12. Yitzhaki, Shlomo, 1991. "Calculating Jackknife Variance Estimators for Parameters of the Gini Method," Journal of Business & Economic Statistics, American Statistical Association, vol. 9(2), pages 235-239, April.
    13. Marcel Carcea & Robert Serfling, 2015. "A Gini Autocovariance Function for Time Series Modelling," Journal of Time Series Analysis, Wiley Blackwell, vol. 36(6), pages 817-838, November.
    14. Korhonen, Pekka & Siljamaki, Aapo, 1998. "Ordinal principal component analysis theory and an application," Computational Statistics & Data Analysis, Elsevier, vol. 26(4), pages 411-424, February.
    15. Shlomo Yitzhaki, 2003. "Gini’s Mean difference: a superior measure of variability for non-normal distributions," Metron - International Journal of Statistics, Dipartimento di Statistica, Probabilità e Statistiche Applicate - University of Rome, vol. 0(2), pages 285-316.
    16. Carl Eckart & Gale Young, 1936. "The approximation of one matrix by another of lower rank," Psychometrika, Springer;The Psychometric Society, vol. 1(3), pages 211-218, September.
    17. Banerjee, Asis Kumar, 2010. "A multidimensional Gini index," Mathematical Social Sciences, Elsevier, vol. 60(2), pages 87-93, September.
    18. E. Schechtman & S. Yitzhaki, 2003. "A Family of Correlation Coefficients Based on the Extended Gini Index," The Journal of Economic Inequality, Springer;Society for the Study of Economic Inequality, vol. 1(2), pages 129-146, August.
    19. Shelef, Amit, 2016. "A Gini-based unit root test," Computational Statistics & Data Analysis, Elsevier, vol. 100(C), pages 763-772.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Charles Condevaux & Stéphane Mussard & Téa Ouraga & Guillaume Zambrano, 2020. "Generalized Gini linear and quadratic discriminant analyses," METRON, Springer;Sapienza Università di Roma, vol. 78(2), pages 219-236, August.
    2. Vasile Preda & Luigi-Ionut Catana, 2021. "Tsallis Log-Scale-Location Models. Moments, Gini Index and Some Stochastic Orders," Mathematics, MDPI, vol. 9(11), pages 1-22, May.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Charles Condevaux & Stéphane Mussard & Téa Ouraga & Guillaume Zambrano, 2020. "Generalized Gini linear and quadratic discriminant analyses," METRON, Springer;Sapienza Università di Roma, vol. 78(2), pages 219-236, August.
    2. Koen Decancq & María Ana Lugo, 2009. "Measuring inequality of well-being with a correlation-sensitive multidimensional Gini index," Working Papers 124, ECINEQ, Society for the Study of Economic Inequality.
    3. Sudheesh K. Kattumannil & N. Sreelakshmi & N. Balakrishnan, 2022. "Non-Parametric Inference for Gini Covariance and its Variants," Sankhya A: The Indian Journal of Statistics, Springer;Indian Statistical Institute, vol. 84(2), pages 790-807, August.
    4. Philipp Poppitz, 2019. "Multidimensional Inequality and Divergence: The Eurozone Crisis in Retrospect," Working Papers V-420-19, University of Oldenburg, Department of Economics, revised Feb 2019.
    5. Asis Banerjee, 2014. "A multidimensional Lorenz dominance relation," Social Choice and Welfare, Springer;The Society for Social Choice and Welfare, vol. 42(1), pages 171-191, January.
    6. Thi Kim Thanh Bui & Guido Erreygers, 2020. "Multidimensional Inequality in Vietnam, 2002–2012," Economies, MDPI, vol. 8(2), pages 1-31, April.
    7. Banerjee, Asis Kumar, 2010. "A multidimensional Gini index," Mathematical Social Sciences, Elsevier, vol. 60(2), pages 87-93, September.
    8. Thibault Gajdos & John Weymark, 2005. "Multidimensional generalized Gini indices," Economic Theory, Springer;Society for the Advancement of Economic Theory (SAET), vol. 26(3), pages 471-496, October.
    9. Casilda Lasso de la Vega & Ana Urrutia & Amaia Sarachu, 2010. "Characterizing multidimensional inequality measures which fulfil the Pigou–Dalton bundle principle," Social Choice and Welfare, Springer;The Society for Social Choice and Welfare, vol. 35(2), pages 319-329, July.
    10. Elisa Pagani, 2015. "Certainty Equivalent: Many Meanings of a Mean," Working Papers 24/2015, University of Verona, Department of Economics.
    11. Andrea Brandolini, 2008. "On applying synthetic indices of multidimensional well-being: health and income inequalities in selected EU countries," Temi di discussione (Economic working papers) 668, Bank of Italy, Economic Research and International Relations Area.
    12. Nicholas Rohde & Ross Guest, 2013. "Multidimensional Racial Inequality in the United States," Social Indicators Research: An International and Interdisciplinary Journal for Quality-of-Life Measurement, Springer, vol. 114(2), pages 591-605, November.
    13. Francesca Greselin & Ričardas Zitikis, 2018. "From the Classical Gini Index of Income Inequality to a New Zenga-Type Relative Measure of Risk: A Modeller’s Perspective," Econometrics, MDPI, vol. 6(1), pages 1-20, January.
    14. John A. Weymark, 2003. "The Normative Approach to the Measurement of Multidimensional Inequality," Vanderbilt University Department of Economics Working Papers 0314, Vanderbilt University Department of Economics, revised Jan 2004.
    15. Henar Diez & Mª Casilda Lasso de la Vega & Ana Marta Urrutia, 2007. "Unit-Consistent Aggregative Multidimensional Inequality Measures: A Characterization," Working Papers 66, ECINEQ, Society for the Study of Economic Inequality.
    16. Alain Chateauneuf & Patrick Moyes, 2005. "Lorenz non-consistent welfare and inequality measurement," The Journal of Economic Inequality, Springer;Society for the Study of Economic Inequality, vol. 2(2), pages 61-87, January.
    17. Stephen P. Jenkins & Philippe Van Kerm, 2016. "Assessing Individual Income Growth," Economica, London School of Economics and Political Science, vol. 83(332), pages 679-703, October.
    18. Peter J. Lambert & Helen T. Naughton, 2009. "The Equal Absolute Sacrifice Principle Revisited," Journal of Economic Surveys, Wiley Blackwell, vol. 23(2), pages 328-349, April.
    19. Stephen Bazen & Patrick Moyes, 2012. "Elitism and stochastic dominance," Social Choice and Welfare, Springer;The Society for Social Choice and Welfare, vol. 39(1), pages 207-251, June.
    20. Chateauneuf, Alain & Cohen, Michele & Meilijson, Isaac, 2004. "Four notions of mean-preserving increase in risk, risk attitudes and applications to the rank-dependent expected utility model," Journal of Mathematical Economics, Elsevier, vol. 40(5), pages 547-571, August.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:hal:wpaper:hal-02340386. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: CCSD (email available below). General contact details of provider: https://hal.archives-ouvertes.fr/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.