IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0053253.html
   My bibliography  Save this article

Class Prediction and Feature Selection with Linear Optimization for Metagenomic Count Data

Author

Listed:
  • Zhenqiu Liu
  • Dechang Chen
  • Li Sheng
  • Amy Y Liu

Abstract

The amount of metagenomic data is growing rapidly while the computational methods for metagenome analysis are still in their infancy. It is important to develop novel statistical learning tools for the prediction of associations between bacterial communities and disease phenotypes and for the detection of differentially abundant features. In this study, we presented a novel statistical learning method for simultaneous association prediction and feature selection with metagenomic samples from two or multiple treatment populations on the basis of count data. We developed a linear programming based support vector machine with and joint penalties for binary and multiclass classifications with metagenomic count data (metalinprog). We evaluated the performance of our method on several real and simulation datasets. The proposed method can simultaneously identify features and predict classes with the metagenomic count data.

Suggested Citation

  • Zhenqiu Liu & Dechang Chen & Li Sheng & Amy Y Liu, 2013. "Class Prediction and Feature Selection with Linear Optimization for Metagenomic Count Data," PLOS ONE, Public Library of Science, vol. 8(3), pages 1-7, March.
  • Handle: RePEc:plo:pone00:0053253
    DOI: 10.1371/journal.pone.0053253
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0053253
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0053253&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0053253?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. James Robert White & Niranjan Nagarajan & Mihai Pop, 2009. "Statistical Methods for Detecting Differentially Abundant Features in Clinical Metagenomic Samples," PLOS Computational Biology, Public Library of Science, vol. 5(4), pages 1-11, April.
    2. Peter J. Turnbaugh & Ruth E. Ley & Micah Hamady & Claire M. Fraser-Liggett & Rob Knight & Jeffrey I. Gordon, 2007. "The Human Microbiome Project," Nature, Nature, vol. 449(7164), pages 804-810, October.
    3. Onur Dagliyan & Fadime Uney-Yuksektepe & I Halil Kavakli & Metin Turkay, 2011. "Optimization Based Tumor Classification from Microarray Gene Expression Data," PLOS ONE, Public Library of Science, vol. 6(2), pages 1-10, February.
    4. G. Al-Jeiroudi & J. Gondzio, 2009. "Convergence Analysis of the Inexact Infeasible Interior-Point Method for Linear Optimization," Journal of Optimization Theory and Applications, Springer, vol. 141(2), pages 231-247, May.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Shilan Li & Jianxin Shi & Paul Albert & Hong-Bin Fang, 2022. "Dependence Structure Analysis and Its Application in Human Microbiome," Mathematics, MDPI, vol. 11(1), pages 1-14, December.
    2. Amirhossein Shamsaddini & Kimia Dadkhah & Patrick M Gillevet, 2020. "BiomMiner: An advanced exploratory microbiome analysis and visualization pipeline," PLOS ONE, Public Library of Science, vol. 15(6), pages 1-13, June.
    3. Gregor Gorkiewicz & Gerhard G Thallinger & Slave Trajanoski & Stefan Lackner & Gernot Stocker & Thomas Hinterleitner & Christian Gülly & Christoph Högenauer, 2013. "Alterations in the Colonic Microbiota in Response to Osmotic Diarrhea," PLOS ONE, Public Library of Science, vol. 8(2), pages 1-17, February.
    4. Patricio S La Rosa & Berkley Shands & Elena Deych & Yanjiao Zhou & Erica Sodergren & George Weinstock & William D Shannon, 2012. "Statistical Object Data Analysis of Taxonomic Trees from Human Microbiome Data," PLOS ONE, Public Library of Science, vol. 7(11), pages 1-12, November.
    5. Pengfei Song & Wen Qin & YanGan Huang & Lei Wang & Zhenyuan Cai & Tongzuo Zhang, 2020. "Grazing Management Influences Gut Microbial Diversity of Livestock in the Same Area," Sustainability, MDPI, vol. 12(10), pages 1-12, May.
    6. Allison G. White & George S. Watts & Zhenqiang Lu & Maria M. Meza-Montenegro & Eric A. Lutz & Philip Harber & Jefferey L. Burgess, 2014. "Environmental Arsenic Exposure and Microbiota in Induced Sputum," IJERPH, MDPI, vol. 11(2), pages 1-15, February.
    7. Jae-Chang Cho, 2021. "Human microbiome privacy risks associated with summary statistics," PLOS ONE, Public Library of Science, vol. 16(4), pages 1-11, April.
    8. Pirjo Wacklin & Harri Mäkivuokko & Noora Alakulppi & Janne Nikkilä & Heli Tenkanen & Jarkko Räbinä & Jukka Partanen & Kari Aranko & Jaana Mättö, 2011. "Secretor Genotype (FUT2 gene) Is Strongly Associated with the Composition of Bifidobacteria in the Human Intestine," PLOS ONE, Public Library of Science, vol. 6(5), pages 1-10, May.
    9. Disha Tandon & Mohammed Monzoorul Haque & Sharmila S Mande, 2016. "Inferring Intra-Community Microbial Interaction Patterns from Metagenomic Datasets Using Associative Rule Mining Techniques," PLOS ONE, Public Library of Science, vol. 11(4), pages 1-16, April.
    10. Yong Li & Jiejie Zhang & Jianqiang Zhang & Wenlai Xu & Zishen Mou, 2019. "Microbial Community Structure in the Sediments and Its Relation to Environmental Factors in Eutrophicated Sancha Lake," IJERPH, MDPI, vol. 16(11), pages 1-15, May.
    11. Gondzio, Jacek, 2012. "Interior point methods 25 years later," European Journal of Operational Research, Elsevier, vol. 218(3), pages 587-601.
    12. Paul J McMurdie & Susan Holmes, 2014. "Waste Not, Want Not: Why Rarefying Microbiome Data Is Inadmissible," PLOS Computational Biology, Public Library of Science, vol. 10(4), pages 1-12, April.
    13. Ran Li & Yongming Wang & Han Hu & Yan Tan & Yingfei Ma, 2022. "Metagenomic analysis reveals unexplored diversity of archaeal virome in the human gut," Nature Communications, Nature, vol. 13(1), pages 1-12, December.
    14. Benedetta Morini & Valeria Simoncini, 2017. "Stability and Accuracy of Inexact Interior Point Methods for Convex Quadratic Programming," Journal of Optimization Theory and Applications, Springer, vol. 175(2), pages 450-477, November.
    15. Mohammadhossein Mohammadisiahroudi & Ramin Fakhimi & Tamás Terlaky, 2024. "Efficient Use of Quantum Linear System Algorithms in Inexact Infeasible IPMs for Linear Optimization," Journal of Optimization Theory and Applications, Springer, vol. 202(1), pages 146-183, July.
    16. Barbara Emmenegger & Julien Massoni & Christine M. Pestalozzi & Miriam Bortfeld-Miller & Benjamin A. Maier & Julia A. Vorholt, 2023. "Identifying microbiota community patterns important for plant protection using synthetic communities and machine learning," Nature Communications, Nature, vol. 14(1), pages 1-15, December.
    17. Margaret Coleman & Christopher Elkins & Bradford Gutting & Emmanuel Mongodin & Gloria Solano‐Aguilar & Isabel Walls, 2018. "Microbiota and Dose Response: Evolving Paradigm of Health Triangle," Risk Analysis, John Wiley & Sons, vol. 38(10), pages 2013-2028, October.
    18. Liangliang Zhang & Yushu Shi & Robert R. Jenq & Kim‐Anh Do & Christine B. Peterson, 2021. "Bayesian compositional regression with structured priors for microbiome feature selection," Biometrics, The International Biometric Society, vol. 77(3), pages 824-838, September.
    19. Bin Wang, 2020. "A Zipf-plot based normalization method for high-throughput RNA-seq data," PLOS ONE, Public Library of Science, vol. 15(4), pages 1-15, April.
    20. Sarah L Hagerty & Kent E Hutchison & Christopher A Lowry & Angela D Bryan, 2020. "An empirically derived method for measuring human gut microbiome alpha diversity: Demonstrated utility in predicting health-related outcomes among a human clinical sample," PLOS ONE, Public Library of Science, vol. 15(3), pages 1-21, March.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0053253. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.