IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0012776.html
   My bibliography  Save this article

Inferring Regulatory Networks from Expression Data Using Tree-Based Methods

Author

Listed:
  • Vân Anh Huynh-Thu
  • Alexandre Irrthum
  • Louis Wehenkel
  • Pierre Geurts

Abstract

One of the pressing open problems of computational systems biology is the elucidation of the topology of genetic regulatory networks (GRNs) using high throughput genomic data, in particular microarray gene expression data. The Dialogue for Reverse Engineering Assessments and Methods (DREAM) challenge aims to evaluate the success of GRN inference algorithms on benchmarks of simulated data. In this article, we present GENIE3, a new algorithm for the inference of GRNs that was best performer in the DREAM4 In Silico Multifactorial challenge. GENIE3 decomposes the prediction of a regulatory network between p genes into p different regression problems. In each of the regression problems, the expression pattern of one of the genes (target gene) is predicted from the expression patterns of all the other genes (input genes), using tree-based ensemble methods Random Forests or Extra-Trees. The importance of an input gene in the prediction of the target gene expression pattern is taken as an indication of a putative regulatory link. Putative regulatory links are then aggregated over all genes to provide a ranking of interactions from which the whole network is reconstructed. In addition to performing well on the DREAM4 In Silico Multifactorial challenge simulated data, we show that GENIE3 compares favorably with existing algorithms to decipher the genetic regulatory network of Escherichia coli. It doesn't make any assumption about the nature of gene regulation, can deal with combinatorial and non-linear interactions, produces directed GRNs, and is fast and scalable. In conclusion, we propose a new algorithm for GRN inference that performs well on both synthetic and real gene expression data. The algorithm, based on feature selection with tree-based ensemble methods, is simple and generic, making it adaptable to other types of genomic data and interactions.

Suggested Citation

  • Vân Anh Huynh-Thu & Alexandre Irrthum & Louis Wehenkel & Pierre Geurts, 2010. "Inferring Regulatory Networks from Expression Data Using Tree-Based Methods," PLOS ONE, Public Library of Science, vol. 5(9), pages 1-10, September.
  • Handle: RePEc:plo:pone00:0012776
    DOI: 10.1371/journal.pone.0012776
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0012776
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0012776&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0012776?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Natalie M. Clark & Trevor M. Nolan & Ping Wang & Gaoyuan Song & Christian Montes & Conner T. Valentine & Hongqing Guo & Rosangela Sozzani & Yanhai Yin & Justin W. Walley, 2021. "Integrated omics networks reveal the temporal signaling events of brassinosteroid response in Arabidopsis," Nature Communications, Nature, vol. 12(1), pages 1-13, December.
    2. Li, Jiawen & Meng, Lu & Zhang, Zelin & Yang, Kejia, 2023. "Low-frequency, high-impact: Discovering important rare events from UGC," Journal of Retailing and Consumer Services, Elsevier, vol. 70(C).
    3. Mingyi Wang & Jerome Verdier & Vagner A Benedito & Yuhong Tang & Jeremy D Murray & Yinbing Ge & Jörg D Becker & Helena Carvalho & Christian Rogers & Michael Udvardi & Ji He, 2013. "LegumeGRN: A Gene Regulatory Network Prediction Server for Functional and Comparative Studies," PLOS ONE, Public Library of Science, vol. 8(7), pages 1-7, July.
    4. Holger Weishaupt & Patrik Johansson & Christopher Engström & Sven Nelander & Sergei Silvestrov & Fredrik J Swartling, 2017. "Loss of Conservation of Graph Centralities in Reverse-engineered Transcriptional Regulatory Networks," Methodology and Computing in Applied Probability, Springer, vol. 19(4), pages 1089-1105, December.
    5. Cecilia Pessoa Rodrigues & Aindrila Chatterjee & Meike Wiese & Thomas Stehle & Witold Szymanski & Maria Shvedunova & Asifa Akhtar, 2021. "Histone H4 lysine 16 acetylation controls central carbon metabolism and diet-induced obesity in mice," Nature Communications, Nature, vol. 12(1), pages 1-21, December.
    6. Fei Wang & Peiwen Ding & Xue Liang & Xiangning Ding & Camilla Blunk Brandt & Evelina Sjöstedt & Jiacheng Zhu & Saga Bolund & Lijing Zhang & Laura P. M. H. Rooij & Lihua Luo & Yanan Wei & Wandong Zhao , 2022. "Endothelial cell heterogeneity and microglia regulons revealed by a pig cell landscape at single-cell level," Nature Communications, Nature, vol. 13(1), pages 1-18, December.
    7. Qiao Wen Tan & Peng Ken Lim & Zhong Chen & Asher Pasha & Nicholas Provart & Marius Arend & Zoran Nikoloski & Marek Mutwil, 2023. "Cross-stress gene expression atlas of Marchantia polymorpha reveals the hierarchy and regulatory principles of abiotic stress responses," Nature Communications, Nature, vol. 14(1), pages 1-19, December.
    8. Jie Xiong & Tong Zhou, 2012. "Gene Regulatory Network Inference from Multifactorial Perturbation Data Using both Regression and Correlation Analyses," PLOS ONE, Public Library of Science, vol. 7(9), pages 1-13, September.
    9. Alfonso Monaco & Nicola Amoroso & Loredana Bellantuono & Eufemia Lella & Angela Lombardi & Anna Monda & Andrea Tateo & Roberto Bellotti & Sabina Tangaro, 2019. "Shannon entropy approach reveals relevant genes in Alzheimer’s disease," PLOS ONE, Public Library of Science, vol. 14(12), pages 1-29, December.
    10. Marco Grimaldi & Roberto Visintainer & Giuseppe Jurman, 2011. "RegnANN: Reverse Engineering Gene Networks Using Artificial Neural Networks," PLOS ONE, Public Library of Science, vol. 6(12), pages 1-19, December.
    11. Yichi Niu & Jiayi Luo & Chenghang Zong, 2024. "Single-cell total-RNA profiling unveils regulatory hubs of transcription factors," Nature Communications, Nature, vol. 15(1), pages 1-18, December.
    12. Bastien Lextrait, 2021. "Scaling up SME's credit scoring scope with LightGBM," EconomiX Working Papers 2021-25, University of Paris Nanterre, EconomiX.
    13. Meichen Dong & Yiping He & Yuchao Jiang & Fei Zou, 2023. "Joint gene network construction by single‐cell RNA sequencing data," Biometrics, The International Biometric Society, vol. 79(2), pages 915-925, June.
    14. Fei Liu & Shao-Wu Zhang & Wei-Feng Guo & Ze-Gang Wei & Luonan Chen, 2016. "Inference of Gene Regulatory Network Based on Local Bayesian Networks," PLOS Computational Biology, Public Library of Science, vol. 12(8), pages 1-17, August.
    15. Maghsoodi, Masoume, 2016. "A New Method to Build Gene Regulation Network Based on Fuzzy Hierarchical Clustering Methods," MPRA Paper 79743, University Library of Munich, Germany.
    16. Marius Arend & Yizhong Yuan & M. Águila Ruiz-Sola & Nooshin Omranian & Zoran Nikoloski & Dimitris Petroutsos, 2023. "Widening the landscape of transcriptional regulation of green algal photoprotection," Nature Communications, Nature, vol. 14(1), pages 1-15, December.
    17. Takeshi Hase & Samik Ghosh & Ryota Yamanaka & Hiroaki Kitano, 2013. "Harnessing Diversity towards the Reconstructing of Large Scale Gene Regulatory Networks," PLOS Computational Biology, Public Library of Science, vol. 9(11), pages 1-16, November.
    18. Ze Yan & Ji Yang & Wen-Tian Wei & Ming-Liang Zhou & Dong-Xin Mo & Xing Wan & Rui Ma & Mei-Ming Wu & Jia-Hui Huang & Ya-Jing Liu & Feng-Hua Lv & Meng-Hua Li, 2024. "A time-resolved multi-omics atlas of transcriptional regulation in response to high-altitude hypoxia across whole-body tissues," Nature Communications, Nature, vol. 15(1), pages 1-22, December.
    19. Hyunje Yang & Honggeun Lim & Haewon Moon & Qiwen Li & Sooyoun Nam & Jaehoon Kim & Hyung Tae Choi, 2022. "Simple Optimal Sampling Algorithm to Strengthen Digital Soil Mapping Using the Spatial Distribution of Machine Learning Predictive Uncertainty: A Case Study for Field Capacity Prediction," Land, MDPI, vol. 11(11), pages 1-18, November.
    20. Natalie Jane de Vries & Rodrigo Reis & Pablo Moscato, 2015. "Clustering Consumers Based on Trust, Confidence and Giving Behaviour: Data-Driven Model Building for Charitable Involvement in the Australian Not-For-Profit Sector," PLOS ONE, Public Library of Science, vol. 10(4), pages 1-28, April.
    21. Lingfei Wang & Tom Michoel, 2017. "Efficient and accurate causal inference with hidden confounders from genome-transcriptome variation data," PLOS Computational Biology, Public Library of Science, vol. 13(8), pages 1-26, August.
    22. Ruonan Wu & Michelle R. Davison & William C. Nelson & Montana L. Smith & Mary S. Lipton & Janet K. Jansson & Ryan S. McClure & Jason E. McDermott & Kirsten S. Hofmockel, 2023. "Hi-C metagenome sequencing reveals soil phage–host interactions," Nature Communications, Nature, vol. 14(1), pages 1-12, December.
    23. Rachael M. Zemek & Wee Loong Chin & Vanessa S. Fear & Ben Wylie & Thomas H. Casey & Cath Forbes & Caitlin M. Tilsed & Louis Boon & Belinda B. Guo & Anthony Bosco & Alistair R. R. Forrest & Michael J. , 2022. "Temporally restricted activation of IFNβ signaling underlies response to immune checkpoint therapy in mice," Nature Communications, Nature, vol. 13(1), pages 1-16, December.
    24. Dongsheng Chen & Jian Sun & Jiacheng Zhu & Xiangning Ding & Tianming Lan & Xiran Wang & Weiying Wu & Zhihua Ou & Linnan Zhu & Peiwen Ding & Haoyu Wang & Lihua Luo & Rong Xiang & Xiaoling Wang & Jiayin, 2021. "Single cell atlas for 11 non-model mammals, reptiles and birds," Nature Communications, Nature, vol. 12(1), pages 1-17, December.
    25. Kinzy Tyler G. & Starr Timothy K. & Tseng George C. & Ho Yen-Yi, 2019. "Meta-analytic framework for modeling genetic coexpression dynamics," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 18(1), pages 1-13, February.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0012776. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.