IDEAS home Printed from https://ideas.repec.org/a/spr/sankha/v81y2019i1d10.1007_s13171-018-00160-1.html
   My bibliography  Save this article

Gaussian Asymptotic Limits for the α-transformation in the Analysis of Compositional Data

Author

Listed:
  • Yannis Pantazis

    (Foundation for Research and Technology - Hellas)

  • Michail Tsagris

    (University of Crete)

  • Andrew T. A. Wood

    (University of Nottingham)

Abstract

Compositional data consists of vectors of proportions whose components sum to 1. Such vectors lie in the standard simplex, which is a manifold with boundary. One issue that has been rather controversial within the field of compositional data analysis is the choice of metric on the simplex. One popular possibility has been to use the metric implied by log-transforming the data, as proposed by Aitchison (Biometrika70, 57–65, 1983, 1986) and another popular approach has been to use the standard Euclidean metric inherited from the ambient space. Tsagris et al. (2011) proposed a one-parameter family of power transformations, the α-transformations, which include both the metric implied by Aitchison’s transformation and the Euclidean metric as particular cases. Our underlying philosophy is that, with many datasets, it may make sense to use the data to help us determine a suitable metric. A related possibility is to apply the α-transformations to a parametric family of distributions, and then estimate α along with the other parameters. However, as we shall see, when one follows this last approach with the Dirichlet family, some care is needed in a certain limiting case which arises (α → 0), as we found out when fitting this model to real and simulated data. Specifically, when the maximum likelihood estimator of α is close to 0, the other parameters tend to be large. The main purpose of the paper is to study this limiting case both theoretically and numerically and to provide insight into these numerical findings.

Suggested Citation

  • Yannis Pantazis & Michail Tsagris & Andrew T. A. Wood, 2019. "Gaussian Asymptotic Limits for the α-transformation in the Analysis of Compositional Data," Sankhya A: The Indian Journal of Statistics, Springer;Indian Statistical Institute, vol. 81(1), pages 63-82, February.
  • Handle: RePEc:spr:sankha:v:81:y:2019:i:1:d:10.1007_s13171-018-00160-1
    DOI: 10.1007/s13171-018-00160-1
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s13171-018-00160-1
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s13171-018-00160-1?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Tsagris, Michail & Preston, Simon & T.A. Wood, Andrew, 2016. "Improved classi cation for compositional data using the $\alpha$-transformation," MPRA Paper 67657, University Library of Munich, Germany.
    2. T. Tsagris, Michail & Preston, Simon & T.A. Wood, Andrew, 2011. "A data-based power transformation for compositional data," MPRA Paper 53068, University Library of Munich, Germany.
    3. Michail Tsagris & Simon Preston & Andrew T. A. Wood, 2016. "Improved Classification for Compositional Data Using the α-transformation," Journal of Classification, Springer;The Classification Society, vol. 33(2), pages 243-261, July.
    4. T. Hotz & S. Huckemann, 2015. "Intrinsic means on the circle: uniqueness, locus and asymptotics," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 67(1), pages 177-193, February.
    5. M. J. Baxter, 1995. "Standardization and Transformation in Principal Component Analysis, with Applications to Archaeometry," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 44(4), pages 513-527, December.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Wang, Dieter & Andrée, Bo Pieter Johannes & Chamorro, Andres Fernando & Spencer, Phoebe Girouard, 2022. "Transitions into and out of food insecurity: A probabilistic approach with panel data evidence from 15 countries," World Development, Elsevier, vol. 159(C).
    2. Wang,Dieter & Andree,Bo Pieter Johannes & Chamorro Elizondo,Andres Fernando & Spencer,Phoebe Girouard, 2020. "Stochastic Modeling of Food Insecurity," Policy Research Working Paper Series 9413, The World Bank.
    3. Rabi Bhattacharya & Rachel Oliver, 2019. "Nonparametric Analysis of Non-Euclidean Data on Shapes and Images," Sankhya A: The Indian Journal of Statistics, Springer;Indian Statistical Institute, vol. 81(1), pages 1-36, February.
    4. Chong, Alain Yee-Loong & Ooi, Keng-Boon & Sohal, Amrik, 2009. "The relationship between supply chain factors and adoption of e-Collaboration tools: An empirical examination," International Journal of Production Economics, Elsevier, vol. 122(1), pages 150-160, November.
    5. Tsagris, Michail & Preston, Simon & T.A. Wood, Andrew, 2016. "Improved classi cation for compositional data using the $\alpha$-transformation," MPRA Paper 67657, University Library of Munich, Germany.
    6. Arthur Pewsey & Eduardo García-Portugués, 2021. "Recent advances in directional statistics," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 30(1), pages 1-58, March.
    7. Stephan F. Huckemann, 2021. "Comments on: Recent advances in directional statistics," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 30(1), pages 71-75, March.
    8. Chrys Caroni & Nedret Billor, 2007. "Robust Detection of Multiple Outliers in Grouped Multivariate Data," Journal of Applied Statistics, Taylor & Francis Journals, vol. 34(10), pages 1241-1250.
    9. Michail Tsagris & Simon Preston & Andrew T. A. Wood, 2016. "Improved Classification for Compositional Data Using the α-transformation," Journal of Classification, Springer;The Classification Society, vol. 33(2), pages 243-261, July.
    10. Chae, Seong S. & Warde, William D., 2006. "Effect of using principal coordinates and principal components on retrieval of clusters," Computational Statistics & Data Analysis, Elsevier, vol. 50(6), pages 1407-1417, March.
    11. Rajesh Barik & Sanjaya Kumar Lenka, 2023. "Does financial inclusion control corruption in upper-middle and lower-middle income countries?," Asia-Pacific Journal of Regional Science, Springer, vol. 7(1), pages 69-92, March.
    12. Tsagris, Michail & Preston, Simon & T.A. Wood, Andrew, 2016. "Nonparametric hypothesis testing for equality of means on the simplex," MPRA Paper 72771, University Library of Munich, Germany.
    13. T. Tsagris, Michail & Preston, Simon & T.A. Wood, Andrew, 2011. "A data-based power transformation for compositional data," MPRA Paper 53068, University Library of Munich, Germany.
    14. Ceyhun Elgin & Gökçer Özgür & Kerem Cantekin, 2023. "Measuring green technology adoption across countries," Sustainable Development, John Wiley & Sons, Ltd., vol. 31(1), pages 1-11, February.
    15. Tsagris, Michail, 2015. "Regression analysis with compositional data containing zero values," MPRA Paper 67868, University Library of Munich, Germany.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:sankha:v:81:y:2019:i:1:d:10.1007_s13171-018-00160-1. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.