IDEAS home Printed from https://ideas.repec.org/a/taf/jnlasa/v107y2012i500p1259-1271.html
   My bibliography  Save this article

A Bayesian Model of NMR Spectra for the Deconvolution and Quantification of Metabolites in Complex Biological Mixtures

Author

Listed:
  • William Astle
  • Maria De Iorio
  • Sylvia Richardson
  • David Stephens
  • Timothy Ebbels

Abstract

Nuclear magnetic resonance (NMR) spectra are widely used in metabolomics to obtain profiles of metabolites dissolved in biofluids such as cell supernatants. Methods for estimating metabolite concentrations from these spectra are presently confined to manual peak fitting and to binning procedures for integrating resonance peaks. Extensive information on the patterns of spectral resonance generated by human metabolites is now available in online databases. By incorporating this information into a Bayesian model, we can deconvolve resonance peaks from a spectrum and obtain explicit concentration estimates for the corresponding metabolites. Spectral resonances that cannot be deconvolved in this way may also be of scientific interest; so, we model them jointly using wavelets. We describe a Markov chain Monte Carlo algorithm that allows us to sample from the joint posterior distribution of the model parameters, using specifically designed block updates to improve mixing. The strong prior on resonance patterns allows the algorithm to identify peaks corresponding to particular metabolites automatically, eliminating the need for manual peak assignment. We assess our method for peak alignment and concentration estimation. Except in cases when the target resonance signal is very weak, alignment is unbiased and precise. We compare the Bayesian concentration estimates with those obtained from a conventional numerical integration method and find that our point estimates have six-fold lower mean squared error. Finally, we apply our method to a spectral dataset taken from an investigation of the metabolic response of yeast to recombinant protein expression. We estimate the concentrations of 26 metabolites and compare with manual quantification by five expert spectroscopists. We discuss the reason for discrepancies and the robustness of our method's concentration estimates. This article has supplementary materials online.

Suggested Citation

  • William Astle & Maria De Iorio & Sylvia Richardson & David Stephens & Timothy Ebbels, 2012. "A Bayesian Model of NMR Spectra for the Deconvolution and Quantification of Metabolites in Complex Biological Mixtures," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 107(500), pages 1259-1271, December.
  • Handle: RePEc:taf:jnlasa:v:107:y:2012:i:500:p:1259-1271
    DOI: 10.1080/01621459.2012.695661
    as

    Download full text from publisher

    File URL: http://hdl.handle.net/10.1080/01621459.2012.695661
    Download Restriction: Access to full text is restricted to subscribers.

    File URL: https://libkey.io/10.1080/01621459.2012.695661?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Ajay Jasra & David A. Stephens & Christopher C. Holmes, 2007. "Population-Based Reversible Jump Markov Chain Monte Carlo," Biometrika, Biometrika Trust, vol. 94(4), pages 787-807.
    2. Elaine Holmes & Ruey Leng Loo & Jeremiah Stamler & Magda Bictash & Ivan K. S. Yap & Queenie Chan & Tim Ebbels & Maria De Iorio & Ian J. Brown & Kirill A. Veselkov & Martha L. Daviglus & Hugo Kesteloot, 2008. "Human metabolic phenotype diversity and its association with diet and blood pressure," Nature, Nature, vol. 453(7193), pages 396-400, May.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Spezia, Luigi, 2020. "Bayesian variable selection in non-homogeneous hidden Markov models through an evolutionary Monte Carlo method," Computational Statistics & Data Analysis, Elsevier, vol. 143(C).
    2. Edoardo Saccenti & Leonardo Tenori & Paul Verbruggen & Marieke E Timmerman & Jildau Bouwman & Jan van der Greef & Claudio Luchinat & Age K Smilde, 2014. "Of Monkeys and Men: A Metabolomic Analysis of Static and Dynamic Urinary Metabolic Phenotypes in Two Species," PLOS ONE, Public Library of Science, vol. 9(9), pages 1-13, September.
    3. Eslami, Seyed Pouyan & Ghasemaghaei, Maryam, 2018. "Effects of online review positiveness and review score inconsistency on sales: A comparison by product involvement," Journal of Retailing and Consumer Services, Elsevier, vol. 45(C), pages 74-80.
    4. Shinji Fukuda & Yumiko Nakanishi & Eisuke Chikayama & Hiroshi Ohno & Tsuneo Hino & Jun Kikuchi, 2009. "Evaluation and Characterization of Bacterial Metabolic Dynamics with a Novel Profiling Technique, Real-Time Metabolotyping," PLOS ONE, Public Library of Science, vol. 4(3), pages 1-10, March.
    5. McGrory, C.A. & Pettitt, A.N. & Faddy, M.J., 2009. "A fully Bayesian approach to inference for Coxian phase-type distributions with covariate dependent mean," Computational Statistics & Data Analysis, Elsevier, vol. 53(12), pages 4311-4321, October.
    6. Athanassios Petralias & Pródromos Prodromídis, 2015. "Price discovery under crisis: uncovering the determinant factors of prices using efficient Bayesian model selection methods," Empirical Economics, Springer, vol. 49(3), pages 859-879, November.
    7. Singer Burton, 2011. "Genome-Phenome Linkages in Human Population Surveys, with Special Emphasis on the Health and Retirement Survey," Forum for Health Economics & Policy, De Gruyter, vol. 14(3), pages 1-24, April.
    8. Pandolfi, Silvia & Bartolucci, Francesco & Friel, Nial, 2014. "A generalized multiple-try version of the Reversible Jump algorithm," Computational Statistics & Data Analysis, Elsevier, vol. 72(C), pages 298-314.
    9. Enrico Petretto & Leonardo Bottolo & Sarah R Langley & Matthias Heinig & Chris McDermott-Roe & Rizwan Sarwar & Michal Pravenec & Norbert Hübner & Timothy J Aitman & Stuart A Cook & Sylvia Richardson, 2010. "New Insights into the Genetic Control of Gene Expression using a Bayesian Multi-tissue Approach," PLOS Computational Biology, Public Library of Science, vol. 6(4), pages 1-13, April.
    10. Marco Di Zio & Ugo Guarnera, 2008. "A multiple imputation method for non-Gaussian data," Metron - International Journal of Statistics, Dipartimento di Statistica, Probabilità e Statistiche Applicate - University of Rome, vol. 0(1), pages 75-90.
    11. Chiara Roberta Girelli & Laura Del Coco & Francesco Paolo Fanizzi, 2017. "Tunisian Extra Virgin Olive Oil Traceability in the EEC Market: Tunisian/Italian (Coratina) EVOOs Blend as a Case Study," Sustainability, MDPI, vol. 9(8), pages 1-11, August.
    12. Komárek, Arnost, 2009. "A new R package for Bayesian estimation of multivariate normal mixtures allowing for selection of the number of components and interval-censored data," Computational Statistics & Data Analysis, Elsevier, vol. 53(12), pages 3932-3947, October.
    13. Li Ma, 2015. "Scalable Bayesian Model Averaging Through Local Information Propagation," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 110(510), pages 795-809, June.
    14. Jasra, Ajay & Doucet, Arnaud & Stephens, David A. & Holmes, Christopher C., 2008. "Interacting sequential Monte Carlo samplers for trans-dimensional simulation," Computational Statistics & Data Analysis, Elsevier, vol. 52(4), pages 1765-1791, January.
    15. Fabrizio Leisen & Roberto Casarin & David Luengo & Luca Martino, 2013. "Adaptive Sticky Generalized Metropolis," Working Papers 2013:19, Department of Economics, University of Venice "Ca' Foscari".
    16. Rigat, F. & Mira, A., 2012. "Parallel hierarchical sampling: A general-purpose interacting Markov chains Monte Carlo algorithm," Computational Statistics & Data Analysis, Elsevier, vol. 56(6), pages 1450-1467.
    17. Lefebvre, Geneviève & Steele, Russell & Vandal, Alain C., 2010. "A path sampling identity for computing the Kullback-Leibler and J divergences," Computational Statistics & Data Analysis, Elsevier, vol. 54(7), pages 1719-1731, July.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:taf:jnlasa:v:107:y:2012:i:500:p:1259-1271. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Chris Longhurst (email available below). General contact details of provider: http://www.tandfonline.com/UASA20 .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.