IDEAS home Printed from https://ideas.repec.org/a/bla/jorssb/v71y2009i4p755-782.html
   My bibliography  Save this article

Hybrid Dirichlet mixture models for functional data

Author

Listed:
  • Sonia Petrone
  • Michele Guindani
  • Alan E. Gelfand

Abstract

Summary. In functional data analysis, curves or surfaces are observed, up to measurement error, at a finite set of locations, for, say, a sample of n individuals. Often, the curves are homogeneous, except perhaps for individual‐specific regions that provide heterogeneous behaviour (e.g. ‘damaged’ areas of irregular shape on an otherwise smooth surface). Motivated by applications with functional data of this nature, we propose a Bayesian mixture model, with the aim of dimension reduction, by representing the sample of n curves through a smaller set of canonical curves. We propose a novel prior on the space of probability measures for a random curve which extends the popular Dirichlet priors by allowing local clustering: non‐homogeneous portions of a curve can be allocated to different clusters and the n individual curves can be represented as recombinations (hybrids) of a few canonical curves. More precisely, the prior proposed envisions a conceptual hidden factor with k‐levels that acts locally on each curve. We discuss several models incorporating this prior and illustrate its performance with simulated and real data sets. We examine theoretical properties of the proposed finite hybrid Dirichlet mixtures, specifically, their behaviour as the number of the mixture components goes to ∞ and their connection with Dirichlet process mixtures.

Suggested Citation

  • Sonia Petrone & Michele Guindani & Alan E. Gelfand, 2009. "Hybrid Dirichlet mixture models for functional data," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 71(4), pages 755-782, September.
  • Handle: RePEc:bla:jorssb:v:71:y:2009:i:4:p:755-782
    DOI: 10.1111/j.1467-9868.2009.00708.x
    as

    Download full text from publisher

    File URL: https://doi.org/10.1111/j.1467-9868.2009.00708.x
    Download Restriction: no

    File URL: https://libkey.io/10.1111/j.1467-9868.2009.00708.x?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Bigelow, Jamie L. & Dunson, David B., 2009. "Bayesian Semiparametric Joint Models for Functional Predictors," Journal of the American Statistical Association, American Statistical Association, vol. 104(485), pages 26-36.
    2. Teh, Yee Whye & Jordan, Michael I. & Beal, Matthew J. & Blei, David M., 2006. "Hierarchical Dirichlet Processes," Journal of the American Statistical Association, American Statistical Association, vol. 101, pages 1566-1581, December.
    3. Gelfand, Alan E. & Kottas, Athanasios & MacEachern, Steven N., 2005. "Bayesian Nonparametric Spatial Modeling With Dirichlet Process Mixing," Journal of the American Statistical Association, American Statistical Association, vol. 100, pages 1021-1035, September.
    4. repec:dau:papers:123456789/1906 is not listed on IDEAS
    5. Dunson, David B. & Xue, Ya & Carin, Lawrence, 2008. "The Matrix Stick-Breaking Process: Flexible Bayes Meta-Analysis," Journal of the American Statistical Association, American Statistical Association, vol. 103, pages 317-327, March.
    6. Jeremy Oakley, 2002. "Bayesian inference for the uncertainty distribution of computer model outputs," Biometrika, Biometrika Trust, vol. 89(4), pages 769-784, December.
    7. Griffin, J.E. & Steel, M.F.J., 2006. "Order-Based Dependent Dirichlet Processes," Journal of the American Statistical Association, American Statistical Association, vol. 101, pages 179-194, March.
    8. Shubhankar Ray & Bani Mallick, 2006. "Functional clustering by Bayesian wavelet methods," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 68(2), pages 305-332, April.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Igor Prünster & Matteo Ruggiero, 2011. "A Bayesian nonparametric approach to modeling market share dynamics," Carlo Alberto Notebooks 217, Collegio Carlo Alberto.
    2. Stefano Favaro & Antonio Lijoi & Igor Prünster, 2012. "On the stick–breaking representation of normalized inverse Gaussian priors," DEM Working Papers Series 008, University of Pavia, Department of Economics and Management.
    3. Silvia Montagna & Surya T. Tokdar & Brian Neelon & David B. Dunson, 2012. "Bayesian Latent Factor Regression for Functional and Longitudinal Data," Biometrics, The International Biometric Society, vol. 68(4), pages 1064-1073, December.
    4. Peter Müeller & Fernando A. Quintana & Garritt Page, 2018. "Nonparametric Bayesian inference in applications," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 27(2), pages 175-206, June.
    5. Roy, Arkaprava & Ghosal, Subhashis, 2022. "Optimal Bayesian smoothing of functional observations over a large graph," Journal of Multivariate Analysis, Elsevier, vol. 189(C).
    6. Michele Guindani & Peter Müller & Song Zhang, 2009. "A Bayesian discovery procedure," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 71(5), pages 905-925, November.
    7. Christoph Hellmayr & Alan E. Gelfand, 2021. "A Partition Dirichlet Process Model for Functional Data Analysis," Sankhya B: The Indian Journal of Statistics, Springer;Indian Statistical Institute, vol. 83(1), pages 30-65, May.
    8. Lian, Heng & Choi, Taeryon & Meng, Jie & Jo, Seongil, 2016. "Posterior convergence for Bayesian functional linear regression," Journal of Multivariate Analysis, Elsevier, vol. 150(C), pages 27-41.
    9. Zahra Barzegar & Firoozeh Rivaz, 2020. "A scalable Bayesian nonparametric model for large spatio-temporal data," Computational Statistics, Springer, vol. 35(1), pages 153-173, March.
    10. Philip A. White & Alan E. Gelfand, 2021. "Multivariate functional data modeling with time-varying clustering," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 30(3), pages 586-602, September.
    11. Stefano Favaro & Igor Prünster & Stephen G. Walker, 2012. "On a Generalized Chu–Vandermonde Identity," Methodology and Computing in Applied Probability, Springer, vol. 14(2), pages 253-262, June.
    12. Bruno Scarpa & David B. Dunson, 2014. "Enriched Stick-Breaking Processes for Functional Data," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 109(506), pages 647-660, June.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Bruno Scarpa & David B. Dunson, 2014. "Enriched Stick-Breaking Processes for Functional Data," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 109(506), pages 647-660, June.
    2. Bruno Scarpa & David B. Dunson, 2009. "Bayesian Hierarchical Functional Data Analysis Via Contaminated Informative Priors," Biometrics, The International Biometric Society, vol. 65(3), pages 772-780, September.
    3. Bassetti, Federico & Casarin, Roberto & Leisen, Fabrizio, 2014. "Beta-product dependent Pitman–Yor processes for Bayesian inference," Journal of Econometrics, Elsevier, vol. 180(1), pages 49-72.
    4. González, Jorge & Barrientos, Andrés F. & Quintana, Fernando A., 2015. "Bayesian nonparametric estimation of test equating functions with covariates," Computational Statistics & Data Analysis, Elsevier, vol. 89(C), pages 222-244.
    5. Bassetti, Federico & Casarin, Roberto & Leisen, Fabrizio, 2011. "Beta-product Poisson-Dirichlet Processes," DES - Working Papers. Statistics and Econometrics. WS 12160, Universidad Carlos III de Madrid. Departamento de Estadística.
    6. Jin, Xin & Maheu, John M., 2016. "Bayesian semiparametric modeling of realized covariance matrices," Journal of Econometrics, Elsevier, vol. 192(1), pages 19-39.
    7. Stefano Favaro & Antonio Lijoi & Igor Prünster, 2012. "On the stick–breaking representation of normalized inverse Gaussian priors," DEM Working Papers Series 008, University of Pavia, Department of Economics and Management.
    8. Pati, Debdeep & Dunson, David B. & Tokdar, Surya T., 2013. "Posterior consistency in conditional distribution estimation," Journal of Multivariate Analysis, Elsevier, vol. 116(C), pages 456-472.
    9. Michael L. Pennell & David B. Dunson, 2008. "Nonparametric Bayes Testing of Changes in a Response Distribution with an Ordinal Predictor," Biometrics, The International Biometric Society, vol. 64(2), pages 413-423, June.
    10. Robert M. Dorazio & Bhramar Mukherjee & Li Zhang & Malay Ghosh & Howard L. Jelks & Frank Jordan, 2008. "Modeling Unobserved Sources of Heterogeneity in Animal Abundance Using a Dirichlet Process Prior," Biometrics, The International Biometric Society, vol. 64(2), pages 635-644, June.
    11. Shamsi Zamenjani, Azam, 2021. "Do financial variables help predict the conditional distribution of the market portfolio?," Journal of Empirical Finance, Elsevier, vol. 62(C), pages 327-345.
    12. Sudhir Voleti & Praveen K. Kopalle & Pulak Ghosh, 2015. "An Interproduct Competition Model Incorporating Branding Hierarchy and Product Similarities Using Store-Level Data," Management Science, INFORMS, vol. 61(11), pages 2720-2738, November.
    13. Brian J. Reich & Dipankar Bandyopadhyay & Howard D. Bondell, 2013. "A Nonparametric Spatial Model for Periodontal Data With Nonrandom Missingness," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 108(503), pages 820-831, September.
    14. Kiranmoy Das & Bhuvanesh Pareek & Sarah Brown & Pulak Ghosh, 2017. "A Semiparametric Bayesian Approach to a New Dynamic Zero-Inflated Model," Working Papers 2017001, The University of Sheffield, Department of Economics.
    15. Michele Guindani & Wesley O. Johnson, 2018. "More nonparametric Bayesian inference in applications," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 27(2), pages 239-251, June.
    16. J. E. Griffin & M. Kolossiatis & M. F. J. Steel, 2013. "Comparing distributions by using dependent normalized random-measure mixtures," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 75(3), pages 499-529, June.
    17. Leisen, Fabrizio & Lijoi, Antonio, 2011. "Vectors of two-parameter Poisson-Dirichlet processes," Journal of Multivariate Analysis, Elsevier, vol. 102(3), pages 482-495, March.
    18. Gutiérrez, Luis & Mena, Ramsés H. & Ruggiero, Matteo, 2016. "A time dependent Bayesian nonparametric model for air quality analysis," Computational Statistics & Data Analysis, Elsevier, vol. 95(C), pages 161-175.
    19. Cai, Bo & Meyer, Renate, 2011. "Bayesian semiparametric modeling of survival data based on mixtures of B-spline distributions," Computational Statistics & Data Analysis, Elsevier, vol. 55(3), pages 1260-1272, March.
    20. Antonio Lijoi & Bernardo Nipoti, 2013. "A class of hazard rate mixtures for combining survival data from different experiments," DEM Working Papers Series 059, University of Pavia, Department of Economics and Management.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bla:jorssb:v:71:y:2009:i:4:p:755-782. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: https://edirc.repec.org/data/rssssea.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.