IDEAS home Printed from https://ideas.repec.org/a/bla/biomet/v75y2019i1p193-201.html
   My bibliography  Save this article

Bayesian additive adaptive basis tensor product models for modeling high dimensional surfaces: an application to high‐throughput toxicity testing

Author

Listed:
  • Matthew W. Wheeler

Abstract

Many modern datasets are sampled with error from complex high‐dimensional surfaces. Methods such as tensor product splines or Gaussian processes are effective and well suited for characterizing a surface in two or three dimensions, but they may suffer from difficulties when representing higher dimensional surfaces. Motivated by high throughput toxicity testing where observed dose‐response curves are cross sections of a surface defined by a chemical's structural properties, a model is developed to characterize this surface to predict untested chemicals’ dose‐responses. This manuscript proposes a novel approach that models the multidimensional surface as a sum of learned basis functions formed as the tensor product of lower dimensional functions, which are themselves representable by a basis expansion learned from the data. The model is described and a Gibbs sampling algorithm is proposed. The approach is investigated in a simulation study and through data taken from the US EPA's ToxCast high throughput toxicity testing platform.

Suggested Citation

  • Matthew W. Wheeler, 2019. "Bayesian additive adaptive basis tensor product models for modeling high dimensional surfaces: an application to high‐throughput toxicity testing," Biometrics, The International Biometric Society, vol. 75(1), pages 193-201, March.
  • Handle: RePEc:bla:biomet:v:75:y:2019:i:1:p:193-201
    DOI: 10.1111/biom.12942
    as

    Download full text from publisher

    File URL: https://doi.org/10.1111/biom.12942
    Download Restriction: no

    File URL: https://libkey.io/10.1111/biom.12942?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Nicholas G. Polson & James G. Scott & Jesse Windle, 2013. "Bayesian Inference for Logistic Models Using Pólya--Gamma Latent Variables," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 108(504), pages 1339-1349, December.
    2. Gramacy, Robert B & Lee, Herbert K. H, 2008. "Bayesian Treed Gaussian Process Models With an Application to Computer Modeling," Journal of the American Statistical Association, American Statistical Association, vol. 103(483), pages 1119-1130.
    3. Aurore Delaigle & Peter Hall, 2013. "Classification Using Censored Functional Data," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 108(504), pages 1269-1283, December.
    4. A. Bhattacharya & D. B. Dunson, 2011. "Sparse Bayesian infinite factor models," Biometrika, Biometrika Trust, vol. 98(2), pages 291-306.
    5. Silvia Montagna & Surya T. Tokdar & Brian Neelon & David B. Dunson, 2012. "Bayesian Latent Factor Regression for Functional and Longitudinal Data," Biometrics, The International Biometric Society, vol. 68(4), pages 1064-1073, December.
    6. Gramacy, Robert B., 2007. "tgp: An R Package for Bayesian Nonstationary, Semiparametric Nonlinear Regression and Design by Treed Gaussian Process Models," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 19(i09).
    7. Anjishnu Banerjee & David B. Dunson & Surya T. Tokdar, 2013. "Efficient Gaussian process regression for large datasets," Biometrika, Biometrika Trust, vol. 100(1), pages 75-89.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Durante, Daniele, 2017. "A note on the multiplicative gamma process," Statistics & Probability Letters, Elsevier, vol. 122(C), pages 198-204.
    2. Erickson, Collin B. & Ankenman, Bruce E. & Sanchez, Susan M., 2018. "Comparison of Gaussian process modeling software," European Journal of Operational Research, Elsevier, vol. 266(1), pages 179-192.
    3. Monterrubio-Gómez, Karla & Roininen, Lassi & Wade, Sara & Damoulas, Theodoros & Girolami, Mark, 2020. "Posterior inference for sparse hierarchical non-stationary models," Computational Statistics & Data Analysis, Elsevier, vol. 148(C).
    4. Kelly R. Moran & Matthew W. Wheeler, 2022. "Fast increased fidelity samplers for approximate Bayesian Gaussian process regression," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 84(4), pages 1198-1228, September.
    5. Daewon Yang & Taeryon Choi & Eric Lavigne & Yeonseung Chung, 2022. "Non‐parametric Bayesian covariate‐dependent multivariate functional clustering: An application to time‐series data for multiple air pollutants," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 71(5), pages 1521-1542, November.
    6. Horiguchi, Akira & Pratola, Matthew T. & Santner, Thomas J., 2021. "Assessing variable activity for Bayesian regression trees," Reliability Engineering and System Safety, Elsevier, vol. 207(C).
    7. MacDonald, Blake & Ranjan, Pritam & Chipman, Hugh, 2015. "GPfit: An R Package for Fitting a Gaussian Process Model to Deterministic Simulator Outputs," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 64(i12).
    8. Daniel W. Gladish & Daniel E. Pagendam & Luk J. M. Peeters & Petra M. Kuhnert & Jai Vaze, 2018. "Emulation Engines: Choice and Quantification of Uncertainty for Complex Hydrological Models," Journal of Agricultural, Biological and Environmental Statistics, Springer;The International Biometric Society;American Statistical Association, vol. 23(1), pages 39-62, March.
    9. Davis, Casey B. & Hans, Christopher M. & Santner, Thomas J., 2021. "Prediction of non-stationary response functions using a Bayesian composite Gaussian process," Computational Statistics & Data Analysis, Elsevier, vol. 154(C).
    10. Veronika Ročková & Edward I. George, 2016. "Fast Bayesian Factor Analysis via Automatic Rotations to Sparsity," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 111(516), pages 1608-1622, October.
    11. Daniel R. Kowal & Antonio Canale, 2021. "Semiparametric Functional Factor Models with Bayesian Rank Selection," Papers 2108.02151, arXiv.org, revised May 2022.
    12. Patric Dolmeta & Raffaele Argiento & Silvia Montagna, 2023. "Bayesian GARCH modeling of functional sports data," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 32(2), pages 401-423, June.
    13. Pantelis Samartsidis & Shaun R. Seaman & Silvia Montagna & André Charlett & Matthew Hickman & Daniela De Angelis, 2020. "A Bayesian multivariate factor analysis model for evaluating an intervention by using observational time series data on multiple outcomes," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 183(4), pages 1437-1459, October.
    14. Daniele Durante & David B. Dunson & Joshua T. Vogelstein, 2017. "Nonparametric Bayes Modeling of Populations of Networks," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 112(520), pages 1516-1530, October.
    15. Waley W. J. Liang & Herbert K. H. Lee, 2019. "Bayesian nonstationary Gaussian process models via treed process convolutions," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 13(3), pages 797-818, September.
    16. Durante, Daniele & Dunson, David B., 2014. "Bayesian dynamic financial networks with time-varying predictors," Statistics & Probability Letters, Elsevier, vol. 93(C), pages 19-26.
    17. Samuel W. Malone & Robert B. Gramacy & Enrique Ter Horst, 2016. "Timing Foreign Exchange Markets," Econometrics, MDPI, vol. 4(1), pages 1-23, March.
    18. Daniele Durante & Sally Paganin & Bruno Scarpa & David B. Dunson, 2017. "Bayesian modelling of networks in complex business intelligence problems," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 66(3), pages 555-580, April.
    19. Buddhavarapu, Prasad & Bansal, Prateek & Prozzi, Jorge A., 2021. "A new spatial count data model with time-varying parameters," Transportation Research Part B: Methodological, Elsevier, vol. 150(C), pages 566-586.
    20. Niko Hauzenberger & Florian Huber, 2020. "Model instability in predictive exchange rate regressions," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 39(2), pages 168-186, March.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bla:biomet:v:75:y:2019:i:1:p:193-201. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: http://www.blackwellpublishing.com/journal.asp?ref=0006-341X .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.