IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0092709.html
   My bibliography  Save this article

NIMEFI: Gene Regulatory Network Inference using Multiple Ensemble Feature Importance Algorithms

Author

Listed:
  • Joeri Ruyssinck
  • Vân Anh Huynh-Thu
  • Pierre Geurts
  • Tom Dhaene
  • Piet Demeester
  • Yvan Saeys

Abstract

One of the long-standing open challenges in computational systems biology is the topology inference of gene regulatory networks from high-throughput omics data. Recently, two community-wide efforts, DREAM4 and DREAM5, have been established to benchmark network inference techniques using gene expression measurements. In these challenges the overall top performer was the GENIE3 algorithm. This method decomposes the network inference task into separate regression problems for each gene in the network in which the expression values of a particular target gene are predicted using all other genes as possible predictors. Next, using tree-based ensemble methods, an importance measure for each predictor gene is calculated with respect to the target gene and a high feature importance is considered as putative evidence of a regulatory link existing between both genes. The contribution of this work is twofold. First, we generalize the regression decomposition strategy of GENIE3 to other feature importance methods. We compare the performance of support vector regression, the elastic net, random forest regression, symbolic regression and their ensemble variants in this setting to the original GENIE3 algorithm. To create the ensemble variants, we propose a subsampling approach which allows us to cast any feature selection algorithm that produces a feature ranking into an ensemble feature importance algorithm. We demonstrate that the ensemble setting is key to the network inference task, as only ensemble variants achieve top performance. As second contribution, we explore the effect of using rankwise averaged predictions of multiple ensemble algorithms as opposed to only one. We name this approach NIMEFI (Network Inference using Multiple Ensemble Feature Importance algorithms) and show that this approach outperforms all individual methods in general, although on a specific network a single method can perform better. An implementation of NIMEFI has been made publicly available.

Suggested Citation

  • Joeri Ruyssinck & Vân Anh Huynh-Thu & Pierre Geurts & Tom Dhaene & Piet Demeester & Yvan Saeys, 2014. "NIMEFI: Gene Regulatory Network Inference using Multiple Ensemble Feature Importance Algorithms," PLOS ONE, Public Library of Science, vol. 9(3), pages 1-13, March.
  • Handle: RePEc:plo:pone00:0092709
    DOI: 10.1371/journal.pone.0092709
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0092709
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0092709&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0092709?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Jeremiah J Faith & Boris Hayete & Joshua T Thaden & Ilaria Mogno & Jamey Wierzbowski & Guillaume Cottarel & Simon Kasif & James J Collins & Timothy S Gardner, 2007. "Large-Scale Mapping and Validation of Escherichia coli Transcriptional Regulation from a Compendium of Expression Profiles," PLOS Biology, Public Library of Science, vol. 5(1), pages 1-13, January.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Vinny Davies & Richard Reeve & William T. Harvey & Francois F. Maree & Dirk Husmeier, 2017. "A sparse hierarchical Bayesian model for detecting relevant antigenic sites in virus evolution," Computational Statistics, Springer, vol. 32(3), pages 803-843, September.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Hossein Zare & Mostafa Kaveh & Arkady Khodursky, 2011. "Inferring a Transcriptional Regulatory Network from Gene Expression Data Using Nonlinear Manifold Embedding," PLOS ONE, Public Library of Science, vol. 6(8), pages 1-7, August.
    2. Diambra, L., 2011. "Coarse-grain reconstruction of genetic networks from expression levels," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 390(11), pages 2198-2207.
    3. Marco Grimaldi & Roberto Visintainer & Giuseppe Jurman, 2011. "RegnANN: Reverse Engineering Gene Networks Using Artificial Neural Networks," PLOS ONE, Public Library of Science, vol. 6(12), pages 1-19, December.
    4. Ruonan Wu & Michelle R. Davison & William C. Nelson & Montana L. Smith & Mary S. Lipton & Janet K. Jansson & Ryan S. McClure & Jason E. McDermott & Kirsten S. Hofmockel, 2023. "Hi-C metagenome sequencing reveals soil phage–host interactions," Nature Communications, Nature, vol. 14(1), pages 1-12, December.
    5. repec:jss:jstsof:37:i01 is not listed on IDEAS
    6. Tom Wilderjans & Dirk Depril & Iven Van Mechelen, 2013. "Additive Biclustering: A Comparison of One New and Two Existing ALS Algorithms," Journal of Classification, Springer;The Classification Society, vol. 30(1), pages 56-74, April.
    7. Shuhei Kimura & Masanao Sato & Mariko Okada-Hatakeyama, 2013. "Inference of Vohradský's Models of Genetic Networks by Solving Two-Dimensional Function Optimization Problems," PLOS ONE, Public Library of Science, vol. 8(12), pages 1-11, December.
    8. Xiaomeng Zhang & Bin Shao & Yangle Wu & Ouyang Qi, 2013. "A Reverse Engineering Approach to Optimize Experiments for the Construction of Biological Regulatory Networks," PLOS ONE, Public Library of Science, vol. 8(9), pages 1-9, September.
    9. Takanori Hasegawa & Rui Yamaguchi & Masao Nagasaki & Satoru Miyano & Seiya Imoto, 2014. "Inference of Gene Regulatory Networks Incorporating Multi-Source Biological Knowledge via a State Space Model with L1 Regularization," PLOS ONE, Public Library of Science, vol. 9(8), pages 1-19, August.
    10. Kannan Venkateshan & Tegner Jesper, 2016. "Adaptive input data transformation for improved network reconstruction with information theoretic algorithms," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 15(6), pages 507-520, December.
    11. Fei Liu & Shao-Wu Zhang & Wei-Feng Guo & Ze-Gang Wei & Luonan Chen, 2016. "Inference of Gene Regulatory Network Based on Local Bayesian Networks," PLOS Computational Biology, Public Library of Science, vol. 12(8), pages 1-17, August.
    12. Hirose, Kei & Fujisawa, Hironori & Sese, Jun, 2017. "Robust sparse Gaussian graphical modeling," Journal of Multivariate Analysis, Elsevier, vol. 161(C), pages 172-190.
    13. Benafsh Husain & F Alex Feltus, 2019. "EdgeScaping: Mapping the spatial distribution of pairwise gene expression intensities," PLOS ONE, Public Library of Science, vol. 14(8), pages 1-15, August.
    14. Zhen Yang & Yen‐Yi Ho, 2022. "Modeling dynamic correlation in zero‐inflated bivariate count data with applications to single‐cell RNA sequencing data," Biometrics, The International Biometric Society, vol. 78(2), pages 766-776, June.
    15. Mingyi Wang & Jerome Verdier & Vagner A Benedito & Yuhong Tang & Jeremy D Murray & Yinbing Ge & Jörg D Becker & Helena Carvalho & Christian Rogers & Michael Udvardi & Ji He, 2013. "LegumeGRN: A Gene Regulatory Network Prediction Server for Functional and Comparative Studies," PLOS ONE, Public Library of Science, vol. 8(7), pages 1-7, July.
    16. Scott Christley & Qing Nie & Xiaohui Xie, 2009. "Incorporating Existing Network Information into Gene Network Inference," PLOS ONE, Public Library of Science, vol. 4(8), pages 1-13, August.
    17. Maghsoodi, Masoume, 2016. "A New Method to Build Gene Regulation Network Based on Fuzzy Hierarchical Clustering Methods," MPRA Paper 79743, University Library of Munich, Germany.
    18. Ambroise Jérôme & Robert Annie & Macq Benoit & Gala Jean-Luc, 2012. "Transcriptional Network Inference from Functional Similarity and Expression Data: A Global Supervised Approach," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 11(1), pages 1-24, January.
    19. Shiori Sagawa & Morgan N Price & Adam M Deutschbauer & Adam P Arkin, 2017. "Validating regulatory predictions from diverse bacteria with mutant fitness data," PLOS ONE, Public Library of Science, vol. 12(5), pages 1-14, May.
    20. Guibo Ye & Mengfan Tang & Jian-Feng Cai & Qing Nie & Xiaohui Xie, 2013. "Low-Rank Regularization for Learning Gene Expression Programs," PLOS ONE, Public Library of Science, vol. 8(12), pages 1-9, December.
    21. Yichao Han & Wanji Li & Alden Filko & Jingyao Li & Fuzhong Zhang, 2023. "Genome-wide promoter responses to CRISPR perturbations of regulators reveal regulatory networks in Escherichia coli," Nature Communications, Nature, vol. 14(1), pages 1-13, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0092709. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.