IDEAS home Printed from https://ideas.repec.org/a/eee/csdana/v143y2020ics0167947319302014.html
   My bibliography  Save this article

Bayesian modeling and computation for analyte quantification in complex mixtures using Raman spectroscopy

Author

Listed:
  • Han, Ningren
  • Ram, Rajeev J.

Abstract

A two-stage algorithm based on Bayesian modeling and computation for quantifying analyte concentration in complex mixtures with Raman spectroscopy is proposed. A hierarchical Bayesian model is constructed for spectral signal analysis, and reversible-jump Markov chain Monte Carlo (RJMCMC) computation is carried out for model selection and spectral variable estimation. Processing is performed in two stages. In the first stage, the peak representation for a target analyte spectrum is learned. In the second, the peak variables learned from the first stage are used to estimate the concentration of the target analyte in a mixture. Numerical experiments validated the performance over a wide range of simulation conditions and established the algorithm accuracy over conventional multivariate regression algorithms for analyte quantification (when constrained to a small training sample size). In addition, the algorithm was applied to analyze experimental spontaneous Raman spectroscopy data collected for glucose concentration estimation in a biopharmaceutical process monitoring application. The results show that this algorithm can be a promising complementary tool alongside conventional multivariate regression algorithms in Raman spectroscopy-based mixture quantification studies, especially when collection of a large training dataset is challenging or resource-intensive.

Suggested Citation

  • Han, Ningren & Ram, Rajeev J., 2020. "Bayesian modeling and computation for analyte quantification in complex mixtures using Raman spectroscopy," Computational Statistics & Data Analysis, Elsevier, vol. 143(C).
  • Handle: RePEc:eee:csdana:v:143:y:2020:i:c:s0167947319302014
    DOI: 10.1016/j.csda.2019.106846
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0167947319302014
    Download Restriction: Full text for ScienceDirect subscribers only.

    File URL: https://libkey.io/10.1016/j.csda.2019.106846?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Sylvia. Richardson & Peter J. Green, 1997. "On Bayesian Analysis of Mixtures with an Unknown Number of Components (with discussion)," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 59(4), pages 731-792.
    2. McGrory, C.A. & Pettitt, A.N. & Titterington, D.M. & Alston, C.L. & Kelly, M., 2016. "Transdimensional sequential Monte Carlo using variational Bayes — SMCVB," Computational Statistics & Data Analysis, Elsevier, vol. 93(C), pages 246-254.
    3. Mingjun Zhong & Mark Girolami & Karen Faulds & Duncan Graham, 2011. "Bayesian methods to detect dye‐labelled DNA oligonucleotides in multiplexed Raman spectra," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 60(2), pages 187-206, March.
    4. David I. Hastie & Peter J. Green, 2012. "Model choice using reversible jump Markov chain Monte Carlo," Statistica Neerlandica, Netherlands Society for Statistics and Operations Research, vol. 66(3), pages 309-338, August.
    5. Jeffrey W. Miller & Matthew T. Harrison, 2018. "Mixture Models With a Prior on the Number of Components," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 113(521), pages 340-356, January.
    6. Liang, Feng & Paulo, Rui & Molina, German & Clyde, Merlise A. & Berger, Jim O., 2008. "Mixtures of g Priors for Bayesian Variable Selection," Journal of the American Statistical Association, American Statistical Association, vol. 103, pages 410-423, March.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Im, Yunju & Tan, Aixin, 2021. "Bayesian subgroup analysis in regression using mixture models," Computational Statistics & Data Analysis, Elsevier, vol. 162(C).
    2. Sylvia Frühwirth-Schnatter & Gertraud Malsiner-Walli, 2019. "From here to infinity: sparse finite versus Dirichlet process mixtures in model-based clustering," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 13(1), pages 33-64, March.
    3. Caio Waisman, 2024. "Bayesian estimation of finite mixtures of Tobit models," Papers 2411.09771, arXiv.org.
    4. Creal, Drew & Kim, Jaeho, 2024. "Bayesian estimation of cluster covariance matrices of unknown form," Journal of Econometrics, Elsevier, vol. 241(1).
    5. Louise Alamichel & Daria Bystrova & Julyan Arbel & Guillaume Kon Kam King, 2024. "Bayesian mixture models (in)consistency for the number of clusters," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 51(4), pages 1619-1660, December.
    6. You, Na & Dai, Hongsheng & Wang, Xueqin & Yu, Qingyun, 2024. "Sequential estimation for mixture of regression models for heterogeneous population," Computational Statistics & Data Analysis, Elsevier, vol. 194(C).
    7. Bettina Grün & Gertraud Malsiner-Walli & Sylvia Frühwirth-Schnatter, 2022. "How many data clusters are in the Galaxy data set?," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 16(2), pages 325-349, June.
    8. Grazian, Clara & Villa, Cristiano & Liseo, Brunero, 2020. "On a loss-based prior for the number of components in mixture models," Statistics & Probability Letters, Elsevier, vol. 158(C).
    9. Domenico Giannone & Michele Lenza & Lucrezia Reichlin, 2011. "Market Freedom and the Global Recession," IMF Economic Review, Palgrave Macmillan;International Monetary Fund, vol. 59(1), pages 111-135, April.
    10. Riccardo (Jack) Lucchetti & Luca Pedini, 2020. "ParMA: Parallelised Bayesian Model Averaging for Generalised Linear Models," Working Papers 2020:28, Department of Economics, University of Venice "Ca' Foscari".
    11. Shuang Zhang & Xingdong Feng, 2022. "Distributed identification of heterogeneous treatment effects," Computational Statistics, Springer, vol. 37(1), pages 57-89, March.
    12. Jiao Jieying & Hu Guanyu & Yan Jun, 2021. "A Bayesian marked spatial point processes model for basketball shot chart," Journal of Quantitative Analysis in Sports, De Gruyter, vol. 17(2), pages 77-90, June.
    13. Li, Feng & Kang, Yanfei, 2018. "Improving forecasting performance using covariate-dependent copula models," International Journal of Forecasting, Elsevier, vol. 34(3), pages 456-476.
    14. Ons Jedidi & Jean Sébastien Pentecote, 2015. "Robust Signals for Banking Crises," Economics Bulletin, AccessEcon, vol. 35(3), pages 1617-1629.
    15. Anna Sokolova, 2023. "Marginal Propensity to Consume and Unemployment: a Meta-analysis," Review of Economic Dynamics, Elsevier for the Society for Economic Dynamics, vol. 51, pages 813-846, December.
    16. Sik-Yum Lee, 2006. "Bayesian Analysis of Nonlinear Structural Equation Models with Nonignorable Missing Data," Psychometrika, Springer;The Psychometric Society, vol. 71(3), pages 541-564, September.
    17. Fisher, Mark & Jensen, Mark J., 2022. "Bayesian nonparametric learning of how skill is distributed across the mutual fund industry," Journal of Econometrics, Elsevier, vol. 230(1), pages 131-153.
    18. Hasan, Iftekhar & Horvath, Roman & Mares, Jan, 2020. "Finance and wealth inequality," Journal of International Money and Finance, Elsevier, vol. 108(C).
    19. Mariam Camarero & Sergi Moliner & Cecilio Tamarit, 2021. "Is there a euro effect in the drivers of US FDI? New evidence using Bayesian model averaging techniques," Review of World Economics (Weltwirtschaftliches Archiv), Springer;Institut für Weltwirtschaft (Kiel Institute for the World Economy), vol. 157(4), pages 881-926, November.
    20. Ley, Eduardo & Steel, Mark F.J., 2012. "Mixtures of g-priors for Bayesian model averaging with economic applications," Journal of Econometrics, Elsevier, vol. 171(2), pages 251-266.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:csdana:v:143:y:2020:i:c:s0167947319302014. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/csda .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.