IDEAS home Printed from https://ideas.repec.org/a/plo/pcbi00/1005234.html
   My bibliography  Save this article

Sparse Regression Based Structure Learning of Stochastic Reaction Networks from Single Cell Snapshot Time Series

Author

Listed:
  • Anna Klimovskaia
  • Stefan Ganscha
  • Manfred Claassen

Abstract

Stochastic chemical reaction networks constitute a model class to quantitatively describe dynamics and cell-to-cell variability in biological systems. The topology of these networks typically is only partially characterized due to experimental limitations. Current approaches for refining network topology are based on the explicit enumeration of alternative topologies and are therefore restricted to small problem instances with almost complete knowledge. We propose the reactionet lasso, a computational procedure that derives a stepwise sparse regression approach on the basis of the Chemical Master Equation, enabling large-scale structure learning for reaction networks by implicitly accounting for billions of topology variants. We have assessed the structure learning capabilities of the reactionet lasso on synthetic data for the complete TRAIL induced apoptosis signaling cascade comprising 70 reactions. We find that the reactionet lasso is able to efficiently recover the structure of these reaction systems, ab initio, with high sensitivity and specificity. With only 6000 possible reactions and over 102000 network topologies. In conjunction with information rich single cell technologies such as single cell RNA sequencing or mass cytometry, the reactionet lasso will enable large-scale structure learning, particularly in areas with partial network structure knowledge, such as cancer biology, and thereby enable the detection of pathological alterations of reaction networks. We provide software to allow for wide applicability of the reactionet lasso.Author Summary: Virtually all biological processes are driven by biochemical reactions. However, their quantitative description in terms of stochastic chemical reaction networks is often precluded by the computational difficulty of structure learning, i.e. the identification of biologically active reaction networks among the combinatorially many possible topologies. This work describes the reactionet lasso, a structure learning approach that takes advantage of novel, information-rich single cell data and a tractable problem formulation to achieve structure learning for problem instances hundreds of orders of magnitude larger than previously reported. This approach opens the prospect of obtaining quantitative and predictive reaction models in many areas of biology and medicine, and in particular areas such as cancer biology, which are characterized by significant system alterations and many unknown reactions.

Suggested Citation

  • Anna Klimovskaia & Stefan Ganscha & Manfred Claassen, 2016. "Sparse Regression Based Structure Learning of Stochastic Reaction Networks from Single Cell Snapshot Time Series," PLOS Computational Biology, Public Library of Science, vol. 12(12), pages 1-20, December.
  • Handle: RePEc:plo:pcbi00:1005234
    DOI: 10.1371/journal.pcbi.1005234
    as

    Download full text from publisher

    File URL: https://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1005234
    Download Restriction: no

    File URL: https://journals.plos.org/ploscompbiol/article/file?id=10.1371/journal.pcbi.1005234&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pcbi.1005234?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Robert Tibshirani, 2011. "Regression shrinkage and selection via the lasso: a retrospective," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 73(3), pages 273-282, June.
    2. J. O. Ramsay & G. Hooker & D. Campbell & J. Cao, 2007. "Parameter estimation for differential equations: a generalized smoothing approach," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 69(5), pages 741-796, November.
    3. Andreas Raue & Marcel Schilling & Julie Bachmann & Andrew Matteson & Max Schelke & Daniel Kaschek & Sabine Hug & Clemens Kreutz & Brian D Harms & Fabian J Theis & Ursula Klingmüller & Jens Timmer, 2013. "Lessons Learned from Quantitative Dynamical Modeling in Systems Biology," PLOS ONE, Public Library of Science, vol. 8(9), pages 1-17, September.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Wei, Baolei, 2022. "Sparse dynamical system identification with simultaneous structural parameters and initial condition estimation," Chaos, Solitons & Fractals, Elsevier, vol. 165(P2).
    2. Lucian Belascu & Alexandra Horobet & Georgiana Vrinceanu & Consuela Popescu, 2021. "Performance Dissimilarities in European Union Manufacturing: The Effect of Ownership and Technological Intensity," Sustainability, MDPI, vol. 13(18), pages 1-19, September.
    3. Qianwen Tan & Subhashis Ghosal, 2021. "Bayesian Analysis of Mixed-effect Regression Models Driven by Ordinary Differential Equations," Sankhya B: The Indian Journal of Statistics, Springer;Indian Statistical Institute, vol. 83(1), pages 3-29, May.
    4. Jaeger, Jonathan & Lambert, Philippe, 2012. "Bayesian penalized smoothing approaches in models specified using affine differential equations with unknown error distributions," LIDAM Discussion Papers ISBA 2012017, Université catholique de Louvain, Institute of Statistics, Biostatistics and Actuarial Sciences (ISBA).
    5. Cao, Jiguo & Ramsay, James O., 2009. "Generalized profiling estimation for global and adaptive penalized spline smoothing," Computational Statistics & Data Analysis, Elsevier, vol. 53(7), pages 2550-2562, May.
    6. Alberti, Federica & Mantilla, César, 2020. "Provision of noxious facilities using a market-like mechanism: A simple implementation in the lab," Working papers 35, Red Investigadores de Economía.
    7. Hooker, Giles & Ramsay, James O. & Xiao, Luo, 2016. "CollocInfer: Collocation Inference in Differential Equation Models," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 75(i02).
    8. Camila Epprecht & Dominique Guegan & Álvaro Veiga & Joel Correa da Rosa, 2017. "Variable selection and forecasting via automated methods for linear models: LASSO/adaLASSO and Autometrics," Post-Print halshs-00917797, HAL.
    9. Sandro Radovanovic & Boris Delibasic & Milija Suknovic & Dajana Matovic, 2019. "Where will the next ski injury occur? A system for visual and predictive analytics of ski injuries," Operational Research, Springer, vol. 19(4), pages 973-992, December.
    10. Peter Martey Addo & Dominique Guegan & Bertrand Hassani, 2018. "Credit Risk Analysis Using Machine and Deep Learning Models," Risks, MDPI, vol. 6(2), pages 1-20, April.
    11. Zhang, Guike & Gao, Zengan & Dong, June & Mei, Dexiang, 2023. "Machine learning approaches for constructing the national anti-money laundering index," Finance Research Letters, Elsevier, vol. 52(C).
    12. Gianluca Frasso & Jonathan Jaeger & Philippe Lambert, 2016. "Parameter estimation and inference in dynamic systems described by linear partial differential equations," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 100(3), pages 259-287, July.
    13. Lee Anthony & Caron Francois & Doucet Arnaud & Holmes Chris, 2012. "Bayesian Sparsity-Path-Analysis of Genetic Association Signal using Generalized t Priors," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 11(2), pages 1-31, January.
    14. Dexin Chen & Meiting Fu & Liangjie Chi & Liyan Lin & Jiaxin Cheng & Weisong Xue & Chenyan Long & Wei Jiang & Xiaoyu Dong & Jian Sui & Dajia Lin & Jianping Lu & Shuangmu Zhuo & Side Liu & Guoxin Li & G, 2022. "Prognostic and predictive value of a pathomics signature in gastric cancer," Nature Communications, Nature, vol. 13(1), pages 1-13, December.
    15. Sokbae Lee & Myung Hwan Seo & Youngki Shin, 2016. "The lasso for high dimensional regression with a possible change point," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 78(1), pages 193-210, January.
    16. Hautsch, Nikolaus & Okhrin, Ostap & Ristig, Alexander, 2014. "Efficient iterative maximum likelihood estimation of high-parameterized time series models," SFB 649 Discussion Papers 2014-010, Humboldt University Berlin, Collaborative Research Center 649: Economic Risk.
    17. Valdemar Melicher & Tom Haber & Wim Vanroose, 2017. "Fast derivatives of likelihood functionals for ODE based models using adjoint-state method," Computational Statistics, Springer, vol. 32(4), pages 1621-1643, December.
    18. Jin, Shaobo & Moustaki, Irini & Yang-Wallentin, Fan, 2018. "Approximated penalized maximum likelihood for exploratory factor analysis: an orthogonal case," LSE Research Online Documents on Economics 88118, London School of Economics and Political Science, LSE Library.
    19. repec:hum:wpaper:sfb649dp2014-010 is not listed on IDEAS
    20. Jin, Ding & Thube, Sneha Dattatraya & Hedtrich, Johannes & Henning, Christian & Delzeit, Ruth, 2019. "A Baseline Calibration Procedure for CGE models: An Application for DART," Conference papers 333057, Purdue University, Center for Global Trade Analysis, Global Trade Analysis Project.
    21. Hettihewa, Samanthala & Saha, Shrabani & Zhang, Hanxiong, 2018. "Does an aging population influence stock markets? Evidence from New Zealand," Economic Modelling, Elsevier, vol. 75(C), pages 142-158.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pcbi00:1005234. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: ploscompbiol (email available below). General contact details of provider: https://journals.plos.org/ploscompbiol/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.