IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0013066.html
   My bibliography  Save this article

Ontology-Based Meta-Analysis of Global Collections of High-Throughput Public Data

Author

Listed:
  • Ilya Kupershmidt
  • Qiaojuan Jane Su
  • Anoop Grewal
  • Suman Sundaresh
  • Inbal Halperin
  • James Flynn
  • Mamatha Shekar
  • Helen Wang
  • Jenny Park
  • Wenwu Cui
  • Gregory D Wall
  • Robert Wisotzkey
  • Satnam Alag
  • Saeid Akhtari
  • Mostafa Ronaghi

Abstract

Background: The investigation of the interconnections between the molecular and genetic events that govern biological systems is essential if we are to understand the development of disease and design effective novel treatments. Microarray and next-generation sequencing technologies have the potential to provide this information. However, taking full advantage of these approaches requires that biological connections be made across large quantities of highly heterogeneous genomic datasets. Leveraging the increasingly huge quantities of genomic data in the public domain is fast becoming one of the key challenges in the research community today. Methodology/Results: We have developed a novel data mining framework that enables researchers to use this growing collection of public high-throughput data to investigate any set of genes or proteins. The connectivity between molecular states across thousands of heterogeneous datasets from microarrays and other genomic platforms is determined through a combination of rank-based enrichment statistics, meta-analyses, and biomedical ontologies. We address data quality concerns through dataset replication and meta-analysis and ensure that the majority of the findings are derived using multiple lines of evidence. As an example of our strategy and the utility of this framework, we apply our data mining approach to explore the biology of brown fat within the context of the thousands of publicly available gene expression datasets. Conclusions: Our work presents a practical strategy for organizing, mining, and correlating global collections of large-scale genomic data to explore normal and disease biology. Using a hypothesis-free approach, we demonstrate how a data-driven analysis across very large collections of genomic data can reveal novel discoveries and evidence to support existing hypothesis.

Suggested Citation

  • Ilya Kupershmidt & Qiaojuan Jane Su & Anoop Grewal & Suman Sundaresh & Inbal Halperin & James Flynn & Mamatha Shekar & Helen Wang & Jenny Park & Wenwu Cui & Gregory D Wall & Robert Wisotzkey & Satnam , 2010. "Ontology-Based Meta-Analysis of Global Collections of High-Throughput Public Data," PLOS ONE, Public Library of Science, vol. 5(9), pages 1-13, September.
  • Handle: RePEc:plo:pone00:0013066
    DOI: 10.1371/journal.pone.0013066
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0013066
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0013066&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0013066?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Andrea H. Bild & Guang Yao & Jeffrey T. Chang & Quanli Wang & Anil Potti & Dawn Chasse & Mary-Beth Joshi & David Harpole & Johnathan M. Lancaster & Andrew Berchuck & John A. Olson & Jeffrey R. Marks &, 2006. "Oncogenic pathway signatures in human cancers as a guide to targeted therapies," Nature, Nature, vol. 439(7074), pages 353-357, January.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Thomas J Crisman & Ivette Zelaya & Dan R Laks & Yining Zhao & Riki Kawaguchi & Fuying Gao & Harley I Kornblum & Giovanni Coppola, 2016. "Identification of an Efficient Gene Expression Panel for Glioblastoma Classification," PLOS ONE, Public Library of Science, vol. 11(11), pages 1-19, November.
    2. Hong-Tao Li & Liya Xu & Daniel J. Weisenberger & Meng Li & Wanding Zhou & Chen-Ching Peng & Kevin Stachelek & David Cobrinik & Gangning Liang & Jesse L. Berry, 2022. "Characterizing DNA methylation signatures of retinoblastoma using aqueous humor liquid biopsy," Nature Communications, Nature, vol. 13(1), pages 1-14, December.
    3. José Caldas & Susana Vinga, 2014. "Global Meta-Analysis of Transcriptomics Studies," PLOS ONE, Public Library of Science, vol. 9(2), pages 1-11, February.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Junjie Su & Byung-Jun Yoon & Edward R Dougherty, 2009. "Accurate and Reliable Cancer Classification Based on Probabilistic Inference of Pathway Activity," PLOS ONE, Public Library of Science, vol. 4(12), pages 1-10, December.
    2. Carey K Anders & Chaitanya R Acharya & David S Hsu & Gloria Broadwater & Katherine Garman & John A Foekens & Yi Zhang & Yixin Wang & Kelly Marcom & Jeffrey R Marks & Sayan Mukherjee & Joseph R Nevins , 2008. "Age-Specific Differences in Oncogenic Pathway Deregulation Seen in Human Breast Tumors," PLOS ONE, Public Library of Science, vol. 3(1), pages 1-8, January.
    3. Verena Jabs & Karolina Edlund & Helena König & Marianna Grinberg & Katrin Madjar & Jörg Rahnenführer & Simon Ekman & Michael Bergkvist & Lars Holmberg & Katja Ickstadt & Johan Botling & Jan G Hengstle, 2017. "Integrative analysis of genome-wide gene copy number changes and gene expression in non-small cell lung cancer," PLOS ONE, Public Library of Science, vol. 12(11), pages 1-23, November.
    4. Eun Sung Park & Ju-Seog Lee & Hyun Goo Woo & Fenghuang Zhan & Joanna H Shih & John D Shaughnessy Jr. & J Frederic Mushinski, 2007. "Heterologous Tissue Culture Expression Signature Predicts Human Breast Cancer Prognosis," PLOS ONE, Public Library of Science, vol. 2(1), pages 1-16, January.
    5. David Lindgren & Gottfrid Sjödahl & Martin Lauss & Johan Staaf & Gunilla Chebil & Kristina Lövgren & Sigurdur Gudjonsson & Fredrik Liedberg & Oliver Patschan & Wiking Månsson & Mårten Fernö & Mattias , 2012. "Integrated Genomic and Gene Expression Profiling Identifies Two Major Genomic Circuits in Urothelial Carcinoma," PLOS ONE, Public Library of Science, vol. 7(6), pages 1-11, June.
    6. Matthias Weber & Martin Schumacher & Harald Binder, 2014. "Regularized Regression Incorporating Network Information: Simultaneous Estimation of Covariate Coefficients and Connection Signs," Tinbergen Institute Discussion Papers 14-089/I, Tinbergen Institute.
    7. Hu, Jianwei & Chai, Hao, 2013. "Adjusted regularized estimation in the accelerated failure time model with high dimensional covariates," Journal of Multivariate Analysis, Elsevier, vol. 122(C), pages 96-114.
    8. Xuan Bich Trinh & Wiebren A A Tjalma & Luc Y Dirix & Peter B Vermeulen & Dieter J Peeters & Dimcho Bachvarov & Marie Plante & Els M Berns & Jozien Helleman & Steven J Van Laere & Peter A van Dam, 2011. "Microarray-Based Oncogenic Pathway Profiling in Advanced Serous Papillary Ovarian Carcinoma," PLOS ONE, Public Library of Science, vol. 6(7), pages 1-9, July.
    9. Lucas Joseph & Carvalho Carlos & West Mike, 2009. "A Bayesian Analysis Strategy for Cross-Study Translation of Gene Expression Biomarkers," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 8(1), pages 1-28, February.
    10. Peter Langfelder & Paul S Mischel & Steve Horvath, 2013. "When Is Hub Gene Selection Better than Standard Meta-Analysis?," PLOS ONE, Public Library of Science, vol. 8(4), pages 1-16, April.
    11. Andrew E Teschendorff & Michel Journée & Pierre A Absil & Rodolphe Sepulchre & Carlos Caldas, 2007. "Elucidating the Altered Transcriptional Programs in Breast Cancer using Independent Component Analysis," PLOS Computational Biology, Public Library of Science, vol. 3(8), pages 1-16, August.
    12. Brian D Bennett & Qing Xiong & Sayan Mukherjee & Terrence S Furey, 2012. "A Predictive Framework for Integrating Disparate Genomic Data Types Using Sample-Specific Gene Set Enrichment Analysis and Multi-Task Learning," PLOS ONE, Public Library of Science, vol. 7(9), pages 1-13, September.
    13. Haleh Yasrebi & Peter Sperisen & Viviane Praz & Philipp Bucher, 2009. "Can Survival Prediction Be Improved By Merging Gene Expression Data Sets?," PLOS ONE, Public Library of Science, vol. 4(10), pages 1-14, October.
    14. Hung-Chia Chen & Wen Zou & Tzu-Pin Lu & James J Chen, 2014. "A Composite Model for Subgroup Identification and Prediction via Bicluster Analysis," PLOS ONE, Public Library of Science, vol. 9(10), pages 1-14, October.
    15. Dennis Kostka & Rainer Spang, 2008. "Microarray Based Diagnosis Profits from Better Documentation of Gene Expression Signatures," PLOS Computational Biology, Public Library of Science, vol. 4(2), pages 1-6, February.
    16. Kuang Du & Shiyou Wei & Zhi Wei & Dennie T. Frederick & Benchun Miao & Tabea Moll & Tian Tian & Eric Sugarman & Dmitry I. Gabrilovich & Ryan J. Sullivan & Lunxu Liu & Keith T. Flaherty & Genevieve M. , 2021. "Pathway signatures derived from on-treatment tumor specimens predict response to anti-PD1 blockade in metastatic melanoma," Nature Communications, Nature, vol. 12(1), pages 1-16, December.
    17. Ruoqi Peng & Sriram Sridhar & Gaurav Tyagi & Jonathan E Phillips & Rosario Garrido & Paul Harris & Lisa Burns & Lorena Renteria & John Woods & Leena Chen & John Allard & Palanikumar Ravindran & Hans B, 2013. "Bleomycin Induces Molecular Changes Directly Relevant to Idiopathic Pulmonary Fibrosis: A Model for “Active” Disease," PLOS ONE, Public Library of Science, vol. 8(4), pages 1-15, April.
    18. Dario Zimmerli & Chiara S. Brambillasca & Francien Talens & Jinhyuk Bhin & Renske Linstra & Lou Romanens & Arkajyoti Bhattacharya & Stacey E. P. Joosten & Ana Moises Silva & Nuno Padrao & Max D. Welle, 2022. "MYC promotes immune-suppression in triple-negative breast cancer via inhibition of interferon signaling," Nature Communications, Nature, vol. 13(1), pages 1-18, December.
    19. Balázs Győrffy & Pawel Surowiak & Jan Budczies & András Lánczky, 2013. "Online Survival Analysis Software to Assess the Prognostic Value of Biomarkers Using Transcriptomic Data in Non-Small-Cell Lung Cancer," PLOS ONE, Public Library of Science, vol. 8(12), pages 1-8, December.
    20. Mariëlle I Gallegos Ruiz & Karijn Floor & Paul Roepman & José A Rodriguez & Gerrit A Meijer & Wolter J Mooi & Ewa Jassem & Jacek Niklinski & Thomas Muley & Nico van Zandwijk & Egbert F Smit & Kristin , 2008. "Integration of Gene Dosage and Gene Expression in Non-Small Cell Lung Cancer, Identification of HSP90 as Potential Target," PLOS ONE, Public Library of Science, vol. 3(3), pages 1-8, March.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0013066. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.