IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0056859.html
   My bibliography  Save this article

Edge Principal Components and Squash Clustering: Using the Special Structure of Phylogenetic Placement Data for Sample Comparison

Author

Listed:
  • Frederick A Matsen IV
  • Steven N Evans

Abstract

Principal components analysis (PCA) and hierarchical clustering are two of the most heavily used techniques for analyzing the differences between nucleic acid sequence samples taken from a given environment. They have led to many insights regarding the structure of microbial communities. We have developed two new complementary methods that leverage how this microbial community data sits on a phylogenetic tree. Edge principal components analysis enables the detection of important differences between samples that contain closely related taxa. Each principal component axis is a collection of signed weights on the edges of the phylogenetic tree, and these weights are easily visualized by a suitable thickening and coloring of the edges. Squash clustering outputs a (rooted) clustering tree in which each internal node corresponds to an appropriate “average” of the original samples at the leaves below the node. Moreover, the length of an edge is a suitably defined distance between the averaged samples associated with the two incident nodes, rather than the less interpretable average of distances produced by UPGMA, the most widely used hierarchical clustering method in this context. We present these methods and illustrate their use with data from the human microbiome.

Suggested Citation

  • Frederick A Matsen IV & Steven N Evans, 2013. "Edge Principal Components and Squash Clustering: Using the Special Structure of Phylogenetic Placement Data for Sample Comparison," PLOS ONE, Public Library of Science, vol. 8(3), pages 1-15, March.
  • Handle: RePEc:plo:pone00:0056859
    DOI: 10.1371/journal.pone.0056859
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0056859
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0056859&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0056859?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Steven N. Evans & Frederick A. Matsen, 2012. "The phylogenetic Kantorovich–Rubinstein metric for environmental sequence samples," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 74(3), pages 569-592, June.
    2. Ruth E. Ley & Peter J. Turnbaugh & Samuel Klein & Jeffrey I. Gordon, 2006. "Human gut microbes associated with obesity," Nature, Nature, vol. 444(7122), pages 1022-1023, December.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Aurélien Saghaï & Grace Pold & Christopher M. Jones & Sara Hallin, 2023. "Phyloecology of nitrate ammonifiers and their importance relative to denitrifiers in global terrestrial biomes," Nature Communications, Nature, vol. 14(1), pages 1-13, December.
    2. Antoine Bichat & Christophe Ambroise & Mahendra Mariadassou, 2022. "Hierarchical correction of p-values via an ultrametric tree running Ornstein-Uhlenbeck process," Computational Statistics, Springer, vol. 37(3), pages 995-1013, July.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Qian Zhang & Lijuan Zhang & Yang Lyu & Yutao Shi & Liangyun Zhu & Min Zhang & Yuyan Zhao & Di Zhao & Lei Wang & Dan Yi & Yongqing Hou & Tao Wu, 2022. "Dietary supplementation of Lactobacillus zeae regulated the gut microbiome in piglets infected with enterotoxigenic Escherichia coli," Czech Journal of Animal Science, Czech Academy of Agricultural Sciences, vol. 67(1), pages 27-38.
    2. Kiran Konain & Sadia & Turfa Nadeem & Adeed Khan & Warda Iqbal & Arsalan & Amir Javed & Ruby Khan & Kainat Jamil & Kainat Jamil, 2018. "Importance of Probiotics in Gastrointestinal Tract," Journal of Asian Scientific Research, Asian Economic and Social Society, vol. 8(3), pages 128-143, March.
    3. Pirjo Wacklin & Harri Mäkivuokko & Noora Alakulppi & Janne Nikkilä & Heli Tenkanen & Jarkko Räbinä & Jukka Partanen & Kari Aranko & Jaana Mättö, 2011. "Secretor Genotype (FUT2 gene) Is Strongly Associated with the Composition of Bifidobacteria in the Human Intestine," PLOS ONE, Public Library of Science, vol. 6(5), pages 1-10, May.
    4. Frisha Abkar & Sajjad ur Rahman & Ahsan Naveed & Hira Rasheed & Syed Ashar Mehfooz, 2019. "Evaluation of Oral Microflora in Obese and Non- Obese Humans from District Faisalabad, Pakistan," Current Research in Diabetes & Obesity Journal, Juniper Publishers Inc., vol. 10(1), pages 12-16, March.
    5. Mariana F. Fernández & Iris Reina-Pérez & Juan Manuel Astorga & Andrea Rodríguez-Carrillo & Julio Plaza-Díaz & Luis Fontana, 2018. "Breast Cancer and Its Relationship with the Microbiota," IJERPH, MDPI, vol. 15(8), pages 1-20, August.
    6. Vinod Nikhra, 2019. "The Novel Dimensions of Cardio-Metabolic Health Gut Microbiota, Dysbiosis and its Fallouts," Current Research in Diabetes & Obesity Journal, Juniper Publishers Inc., vol. 11(1), pages 28-37, June.
    7. Maja Czerwińska-Rogowska & Karolina Skonieczna-Żydecka & Krzysztof Kaseja & Karolina Jakubczyk & Joanna Palma & Marta Bott-Olejnik & Sławomir Brzozowski & Ewa Stachowska, 2022. "Kitchen Diet vs. Industrial Diets—Impact on Intestinal Barrier Parameters among Stroke Patients," IJERPH, MDPI, vol. 19(10), pages 1-11, May.
    8. Zemin Zheng & Jinchi Lv & Wei Lin, 2021. "Nonsparse Learning with Latent Variables," Operations Research, INFORMS, vol. 69(1), pages 346-359, January.
    9. Elio L Herzog & Melania Wäfler & Irene Keller & Sebastian Wolf & Martin S Zinkernagel & Denise C Zysset-Burri, 2021. "The importance of age in compositional and functional profiling of the human intestinal microbiome," PLOS ONE, Public Library of Science, vol. 16(10), pages 1-13, October.
    10. Pratheepa Jeganathan & Susan P. Holmes, 2021. "A Statistical Perspective on the Challenges in Molecular Microbial Biology," Journal of Agricultural, Biological and Environmental Statistics, Springer;The International Biometric Society;American Statistical Association, vol. 26(2), pages 131-160, June.
    11. Michael DiMarzio & Brigida Rusconi & Neela H Yennawar & Mark Eppinger & Andrew D Patterson & Edward G Dudley, 2017. "Identification of a mouse Lactobacillus johnsonii strain with deconjugase activity against the FXR antagonist T-β-MCA," PLOS ONE, Public Library of Science, vol. 12(9), pages 1-15, September.
    12. Tamar Ringel-Kulka & Jing Cheng & Yehuda Ringel & Jarkko Salojärvi & Ian Carroll & Airi Palva & Willem M de Vos & Reetta Satokari, 2013. "Intestinal Microbiota in Healthy U.S. Young Children and Adults—A High Throughput Microarray Analysis," PLOS ONE, Public Library of Science, vol. 8(5), pages 1-10, May.
    13. Ming Yang & Xiaoqiang Qi & Nan Li & Jussuf T. Kaifi & Shiyou Chen & Andrew A. Wheeler & Eric T. Kimchi & Aaron C. Ericsson & R. Scott Rector & Kevin F. Staveley-O’Carroll & Guangfu Li, 2023. "Western diet contributes to the pathogenesis of non-alcoholic steatohepatitis in male mice via remodeling gut microbiota and increasing production of 2-oleoylglycerol," Nature Communications, Nature, vol. 14(1), pages 1-17, December.
    14. Hannah Lees & Jonathan Swann & Simon M Poucher & Jeremy K Nicholson & Elaine Holmes & Ian D Wilson & Julian R Marchesi, 2014. "Age and Microenvironment Outweigh Genetic Influence on the Zucker Rat Microbiome," PLOS ONE, Public Library of Science, vol. 9(9), pages 1-11, September.
    15. Itay Daybog & Oren Kolodny, 2023. "A computational framework for resolving the microbiome diversity conundrum," Nature Communications, Nature, vol. 14(1), pages 1-13, December.
    16. Pamela N Luna & Jonathan M Mansbach & Chad A Shaw, 2020. "A joint modeling approach for longitudinal microbiome data improves ability to detect microbiome associations with disease," PLOS Computational Biology, Public Library of Science, vol. 16(12), pages 1-17, December.
    17. Edoardo Pasolli & Duy Tin Truong & Faizan Malik & Levi Waldron & Nicola Segata, 2016. "Machine Learning Meta-analysis of Large Metagenomic Datasets: Tools and Biological Insights," PLOS Computational Biology, Public Library of Science, vol. 12(7), pages 1-26, July.
    18. Sarah L Hagerty & Kent E Hutchison & Christopher A Lowry & Angela D Bryan, 2020. "An empirically derived method for measuring human gut microbiome alpha diversity: Demonstrated utility in predicting health-related outcomes among a human clinical sample," PLOS ONE, Public Library of Science, vol. 15(3), pages 1-21, March.
    19. Daniel Beck & James A Foster, 2014. "Machine Learning Techniques Accurately Classify Microbial Communities by Bacterial Vaginosis Characteristics," PLOS ONE, Public Library of Science, vol. 9(2), pages 1-8, February.
    20. Seung Jin Han & Kyoung Hwa Ha & Ja Young Jeon & Hae Jin Kim & Kwan Woo Lee & Dae Jung Kim, 2015. "Impact of Cadmium Exposure on the Association between Lipopolysaccharide and Metabolic Syndrome," IJERPH, MDPI, vol. 12(9), pages 1-14, September.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0056859. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.