IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0105942.html
   My bibliography  Save this article

Inference of Gene Regulatory Networks Incorporating Multi-Source Biological Knowledge via a State Space Model with L1 Regularization

Author

Listed:
  • Takanori Hasegawa
  • Rui Yamaguchi
  • Masao Nagasaki
  • Satoru Miyano
  • Seiya Imoto

Abstract

Comprehensive understanding of gene regulatory networks (GRNs) is a major challenge in the field of systems biology. Currently, there are two main approaches in GRN analysis using time-course observation data, namely an ordinary differential equation (ODE)-based approach and a statistical model-based approach. The ODE-based approach can generate complex dynamics of GRNs according to biologically validated nonlinear models. However, it cannot be applied to ten or more genes to simultaneously estimate system dynamics and regulatory relationships due to the computational difficulties. The statistical model-based approach uses highly abstract models to simply describe biological systems and to infer relationships among several hundreds of genes from the data. However, the high abstraction generates false regulations that are not permitted biologically. Thus, when dealing with several tens of genes of which the relationships are partially known, a method that can infer regulatory relationships based on a model with low abstraction and that can emulate the dynamics of ODE-based models while incorporating prior knowledge is urgently required. To accomplish this, we propose a method for inference of GRNs using a state space representation of a vector auto-regressive (VAR) model with L1 regularization. This method can estimate the dynamic behavior of genes based on linear time-series modeling constructed from an ODE-based model and can infer the regulatory structure among several tens of genes maximizing prediction ability for the observational data. Furthermore, the method is capable of incorporating various types of existing biological knowledge, e.g., drug kinetics and literature-recorded pathways. The effectiveness of the proposed method is shown through a comparison of simulation studies with several previous methods. For an application example, we evaluated mRNA expression profiles over time upon corticosteroid stimulation in rats, thus incorporating corticosteroid kinetics/dynamics, literature-recorded pathways and transcription factor (TF) information.

Suggested Citation

  • Takanori Hasegawa & Rui Yamaguchi & Masao Nagasaki & Satoru Miyano & Seiya Imoto, 2014. "Inference of Gene Regulatory Networks Incorporating Multi-Source Biological Knowledge via a State Space Model with L1 Regularization," PLOS ONE, Public Library of Science, vol. 9(8), pages 1-19, August.
  • Handle: RePEc:plo:pone00:0105942
    DOI: 10.1371/journal.pone.0105942
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0105942
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0105942&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0105942?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. R. H. Shumway & D. S. Stoffer, 1982. "An Approach To Time Series Smoothing And Forecasting Using The Em Algorithm," Journal of Time Series Analysis, Wiley Blackwell, vol. 3(4), pages 253-264, July.
    2. Jeremiah J Faith & Boris Hayete & Joshua T Thaden & Ilaria Mogno & Jamey Wierzbowski & Guillaume Cottarel & Simon Kasif & James J Collins & Timothy S Gardner, 2007. "Large-Scale Mapping and Validation of Escherichia coli Transcriptional Regulation from a Compendium of Expression Profiles," PLOS Biology, Public Library of Science, vol. 5(1), pages 1-13, January.
    3. Michael B. Elowitz & Stanislas Leibler, 2000. "A synthetic oscillatory network of transcriptional regulators," Nature, Nature, vol. 403(6767), pages 335-338, January.
    4. J. O. Ramsay & G. Hooker & D. Campbell & J. Cao, 2007. "Parameter estimation for differential equations: a generalized smoothing approach," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 69(5), pages 741-796, November.
    5. Xiaodian Sun & Li Jin & Momiao Xiong, 2008. "Extended Kalman Filter for Estimation of Parameters in Nonlinear State-Space Models of Biochemical Networks," PLOS ONE, Public Library of Science, vol. 3(11), pages 1-13, November.
    6. Gabriele Lillacci & Mustafa Khammash, 2010. "Parameter Estimation and Model Selection in Computational Biology," PLOS Computational Biology, Public Library of Science, vol. 6(3), pages 1-17, March.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Gabriele Lillacci & Mustafa Khammash, 2010. "Parameter Estimation and Model Selection in Computational Biology," PLOS Computational Biology, Public Library of Science, vol. 6(3), pages 1-17, March.
    2. Afnizanfaizal Abdullah & Safaai Deris & Mohd Saberi Mohamad & Sohail Anwar, 2013. "An Improved Swarm Optimization for Parameter Estimation and Biological Model Selection," PLOS ONE, Public Library of Science, vol. 8(4), pages 1-16, April.
    3. Yong-Jun Shin & Ali H Sayed & Xiling Shen, 2012. "Adaptive Models for Gene Networks," PLOS ONE, Public Library of Science, vol. 7(2), pages 1-6, February.
    4. González Javier & Vujačić Ivan & Wit Ernst, 2013. "Inferring latent gene regulatory network kinetics," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 12(1), pages 109-127, March.
    5. Mazzocchi, Mario, 2006. "Time patterns in UK demand for alcohol and tobacco: an application of the EM algorithm," Computational Statistics & Data Analysis, Elsevier, vol. 50(9), pages 2191-2205, May.
    6. Jin Wang & Bo Huang & Xuefeng Xia & Zhirong Sun, 2006. "Funneled Landscape Leads to Robustness of Cell Networks: Yeast Cell Cycle," PLOS Computational Biology, Public Library of Science, vol. 2(11), pages 1-10, November.
    7. Qianwen Tan & Subhashis Ghosal, 2021. "Bayesian Analysis of Mixed-effect Regression Models Driven by Ordinary Differential Equations," Sankhya B: The Indian Journal of Statistics, Springer;Indian Statistical Institute, vol. 83(1), pages 3-29, May.
    8. Jaeger, Jonathan & Lambert, Philippe, 2012. "Bayesian penalized smoothing approaches in models specified using affine differential equations with unknown error distributions," LIDAM Discussion Papers ISBA 2012017, Université catholique de Louvain, Institute of Statistics, Biostatistics and Actuarial Sciences (ISBA).
    9. Matteo Barigozzi & Matteo Luciani, 2019. "Quasi Maximum Likelihood Estimation and Inference of Large Approximate Dynamic Factor Models via the EM algorithm," Papers 1910.03821, arXiv.org, revised Sep 2024.
    10. Zirogiannis, Nikolaos & Tripodis, Yorghos, 2013. "A Generalized Dynamic Factor Model for Panel Data: Estimation with a Two-Cycle Conditional Expectation-Maximization Algorithm," Working Paper Series 142752, University of Massachusetts, Amherst, Department of Resource Economics.
    11. Tobias Hartl & Roland Jucknewitz, 2022. "Approximate state space modelling of unobserved fractional components," Econometric Reviews, Taylor & Francis Journals, vol. 41(1), pages 75-98, January.
    12. Cao, Jiguo & Ramsay, James O., 2009. "Generalized profiling estimation for global and adaptive penalized spline smoothing," Computational Statistics & Data Analysis, Elsevier, vol. 53(7), pages 2550-2562, May.
    13. Hooker, Giles & Ramsay, James O. & Xiao, Luo, 2016. "CollocInfer: Collocation Inference in Differential Equation Models," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 75(i02).
    14. Joseph Ndong & Ted Soubdhan, 2022. "Extracting Statistical Properties of Solar and Photovoltaic Power Production for the Scope of Building a Sophisticated Forecasting Framework," Forecasting, MDPI, vol. 5(1), pages 1-21, December.
    15. David de Antonio Liedo, 2014. "Nowcasting Belgium," Working Paper Research 256, National Bank of Belgium.
    16. Proietti, Tommaso, 2008. "Estimation of Common Factors under Cross-Sectional and Temporal Aggregation Constraints: Nowcasting Monthly GDP and its Main Components," MPRA Paper 6860, University Library of Munich, Germany.
    17. Matteo Barigozzi & Marc Hallin, 2023. "Dynamic Factor Models: a Genealogy," Papers 2310.17278, arXiv.org, revised Jan 2024.
    18. Avraham E Mayo & Yaakov Setty & Seagull Shavit & Alon Zaslaver & Uri Alon, 2006. "Plasticity of the cis-Regulatory Input Function of a Gene," PLOS Biology, Public Library of Science, vol. 4(4), pages 1-1, March.
    19. Ankit Gupta & Mustafa Khammash, 2022. "Frequency spectra and the color of cellular noise," Nature Communications, Nature, vol. 13(1), pages 1-18, December.
    20. Alexander Tsyplakov, 2011. "An introduction to state space modeling (in Russian)," Quantile, Quantile, issue 9, pages 1-24, July.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0105942. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.