IDEAS home Printed from https://ideas.repec.org/a/taf/japsta/v40y2013i3p583-599.html
   My bibliography  Save this article

A clustering approach to interpretable principal components

Author

Listed:
  • Doyo Enki
  • Nickolay Trendafilov
  • Ian Jolliffe

Abstract

A new method for constructing interpretable principal components is proposed. The method first clusters the variables, and then interpretable (sparse) components are constructed from the correlation matrices of the clustered variables. For the first step of the method, a new weighted-variances method for clustering variables is proposed. It reflects the nature of the problem that the interpretable components should maximize the explained variance and thus provide sparse dimension reduction. An important feature of the new clustering procedure is that the optimal number of clusters (and components) can be determined in a non-subjective manner. The new method is illustrated using well-known simulated and real data sets. It clearly outperforms many existing methods for sparse principal component analysis in terms of both explained variance and sparseness.

Suggested Citation

  • Doyo Enki & Nickolay Trendafilov & Ian Jolliffe, 2013. "A clustering approach to interpretable principal components," Journal of Applied Statistics, Taylor & Francis Journals, vol. 40(3), pages 583-599.
  • Handle: RePEc:taf:japsta:v:40:y:2013:i:3:p:583-599
    DOI: 10.1080/02664763.2012.749846
    as

    Download full text from publisher

    File URL: http://hdl.handle.net/10.1080/02664763.2012.749846
    Download Restriction: Access to full text is restricted to subscribers.

    File URL: https://libkey.io/10.1080/02664763.2012.749846?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Nickolay Trendafilov, 2014. "From simple structure to sparse components: a review," Computational Statistics, Springer, vol. 29(3), pages 431-454, June.
    2. Adelaide Freitas & Eloísa Macedo & Maurizio Vichi, 2021. "An empirical comparison of two approaches for CDPCA in high-dimensional data," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 30(3), pages 1007-1031, September.
    3. Ni, Bin & Spatareanu, Mariana & Manole, Vlad & Otsuki, Tsunehiro & Yamada, Hiroyuki, 2017. "The origin of FDI and domestic firms’ productivity—Evidence from Vietnam," Journal of Asian Economics, Elsevier, vol. 52(C), pages 56-76.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:taf:japsta:v:40:y:2013:i:3:p:583-599. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Chris Longhurst (email available below). General contact details of provider: http://www.tandfonline.com/CJAS20 .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.