IDEAS home Printed from https://ideas.repec.org/a/eee/jmvana/v111y2012icp1-19.html
   My bibliography  Save this article

Nonparametric Bayes classification and hypothesis testing on manifolds

Author

Listed:
  • Bhattacharya, Abhishek
  • Dunson, David

Abstract

Our first focus is prediction of a categorical response variable using features that lie on a general manifold. For example, the manifold may correspond to the surface of a hypersphere. We propose a general kernel mixture model for the joint distribution of the response and predictors, with the kernel expressed in product form and dependence induced through the unknown mixing measure. We provide simple sufficient conditions for large support and weak and strong posterior consistency in estimating both the joint distribution of the response and predictors and the conditional distribution of the response. Focusing on a Dirichlet process prior for the mixing measure, these conditions hold using von Mises–Fisher kernels when the manifold is the unit hypersphere. In this case, Bayesian methods are developed for efficient posterior computation using slice sampling. Next we develop Bayesian nonparametric methods for testing whether there is a difference in distributions between groups of observations on the manifold having unknown densities. We prove consistency of the Bayes factor and develop efficient computational methods for its calculation. The proposed classification and testing methods are evaluated using simulation examples and applied to spherical data applications.

Suggested Citation

  • Bhattacharya, Abhishek & Dunson, David, 2012. "Nonparametric Bayes classification and hypothesis testing on manifolds," Journal of Multivariate Analysis, Elsevier, vol. 111(C), pages 1-19.
  • Handle: RePEc:eee:jmvana:v:111:y:2012:i:c:p:1-19
    DOI: 10.1016/j.jmva.2012.02.020
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0047259X12000814
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.jmva.2012.02.020?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Wu, Yuefeng & Ghosal, Subhashis, 2010. "The L1-consistency of Dirichlet mixtures in multivariate Bayesian density estimation," Journal of Multivariate Analysis, Elsevier, vol. 101(10), pages 2411-2419, November.
    2. Basu S. & Chib S., 2003. "Marginal Likelihood and Bayes Factors for Dirichlet Process Mixture Models," Journal of the American Statistical Association, American Statistical Association, vol. 98, pages 224-235, January.
    3. Abhishek Bhattacharya & David B. Dunson, 2010. "Nonparametric Bayesian density estimation on manifolds with applications to planar shapes," Biometrika, Biometrika Trust, vol. 97(4), pages 851-865.
    4. Bigelow, Jamie L. & Dunson, David B., 2009. "Bayesian Semiparametric Joint Models for Functional Predictors," Journal of the American Statistical Association, American Statistical Association, vol. 104(485), pages 26-36.
    5. Michael L. Pennell & David B. Dunson, 2008. "Nonparametric Bayes Testing of Changes in a Response Distribution with an Ordinal Predictor," Biometrics, The International Biometric Society, vol. 64(2), pages 413-423, June.
    6. David B. Dunson & Shyamal D. Peddada, 2008. "Bayesian nonparametric inference on stochastic ordering," Biometrika, Biometrika Trust, vol. 95(4), pages 859-874.
    7. Rolando De la Cruz‐Mesía & Fernando A. Quintana & Peter Müller, 2007. "Semiparametric Bayesian classification with longitudinal markers," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 56(2), pages 119-137, March.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Kelter, Riko, 2022. "Power analysis and type I and type II error rates of Bayesian nonparametric two-sample tests for location-shifts based on the Bayes factor under Cauchy priors," Computational Statistics & Data Analysis, Elsevier, vol. 165(C).
    2. Arthur Pewsey & Eduardo García-Portugués, 2021. "Recent advances in directional statistics," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 30(1), pages 1-58, March.
    3. Emil Cornea & Hongtu Zhu & Peter Kim & Joseph G. Ibrahim, 2017. "Regression models on Riemannian symmetric spaces," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 79(2), pages 463-482, March.
    4. Luz Adriana Pereira & Daniel Taylor‐Rodríguez & Luis Gutiérrez, 2020. "A Bayesian nonparametric testing procedure for paired samples," Biometrics, The International Biometric Society, vol. 76(4), pages 1133-1146, December.
    5. You, Kisung & Suh, Changhee, 2022. "Parameter estimation and model-based clustering with spherical normal distribution on the unit hypersphere," Computational Statistics & Data Analysis, Elsevier, vol. 171(C).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Kelter, Riko, 2022. "Power analysis and type I and type II error rates of Bayesian nonparametric two-sample tests for location-shifts based on the Bayes factor under Cauchy priors," Computational Statistics & Data Analysis, Elsevier, vol. 165(C).
    2. Rabi Bhattacharya & Rachel Oliver, 2019. "Nonparametric Analysis of Non-Euclidean Data on Shapes and Images," Sankhya A: The Indian Journal of Statistics, Springer;Indian Statistical Institute, vol. 81(1), pages 1-36, February.
    3. Silvia Montagna & Surya T. Tokdar & Brian Neelon & David B. Dunson, 2012. "Bayesian Latent Factor Regression for Functional and Longitudinal Data," Biometrics, The International Biometric Society, vol. 68(4), pages 1064-1073, December.
    4. Beom Seuk Hwang & Zhen Chen, 2015. "An Integrated Bayesian Nonparametric Approach for Stochastic and Variability Orders in ROC Curve Estimation: An Application to Endometriosis Diagnosis," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 110(511), pages 923-934, September.
    5. Georges Bresson & Cheng Hsiao & Alain Pirotte, 2011. "Assessing the contribution of R&D to total factor productivity—a Bayesian approach to account for heterogeneity and heteroskedasticity," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 95(4), pages 435-452, December.
    6. Andr'es Ram'irez-Hassan & Alejandro L'opez-Vera, 2021. "Semi-parametric estimation of the EASI model: Welfare implications of taxes identifying clusters due to unobserved preference heterogeneity," Papers 2109.07646, arXiv.org.
    7. Emil Cornea & Hongtu Zhu & Peter Kim & Joseph G. Ibrahim, 2017. "Regression models on Riemannian symmetric spaces," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 79(2), pages 463-482, March.
    8. Georges Bresson & Cheng Hsiao, 2011. "A functional connectivity approach for modeling cross-sectional dependence with an application to the estimation of hedonic housing prices in Paris," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 95(4), pages 501-529, December.
    9. Pati, Debdeep & Dunson, David B. & Tokdar, Surya T., 2013. "Posterior consistency in conditional distribution estimation," Journal of Multivariate Analysis, Elsevier, vol. 116(C), pages 456-472.
    10. Kirkby, J. Lars & Leitao, Álvaro & Nguyen, Duy, 2021. "Nonparametric density estimation and bandwidth selection with B-spline bases: A novel Galerkin method," Computational Statistics & Data Analysis, Elsevier, vol. 159(C).
    11. Jensen, Mark J. & Maheu, John M., 2014. "Estimating a semiparametric asymmetric stochastic volatility model with a Dirichlet process mixture," Journal of Econometrics, Elsevier, vol. 178(P3), pages 523-538.
    12. Han, Shengtong & Zhang, Hongmei & Karmaus, Wilfried & Roberts, Graham & Arshad, Hasan, 2017. "Adjusting background noise in cluster analyses of longitudinal data," Computational Statistics & Data Analysis, Elsevier, vol. 109(C), pages 93-104.
    13. Angela Schörgendorfer & Adam J. Branscum & Timothy E. Hanson, 2013. "A Bayesian Goodness of Fit Test and Semiparametric Generalization of Logistic Regression with Measurement Data," Biometrics, The International Biometric Society, vol. 69(2), pages 508-519, June.
    14. Laura Liu, 2017. "Density Forecasts in Panel Models: A semiparametric Bayesian Perspective," PIER Working Paper Archive 17-006, Penn Institute for Economic Research, Department of Economics, University of Pennsylvania, revised 28 Apr 2017.
    15. Im, Yunju & Tan, Aixin, 2021. "Bayesian subgroup analysis in regression using mixture models," Computational Statistics & Data Analysis, Elsevier, vol. 162(C).
    16. Guohua Feng & Chuan Wang & Xibin Zhang, 2019. "Estimation of inefficiency in stochastic frontier models: a Bayesian kernel approach," Journal of Productivity Analysis, Springer, vol. 51(1), pages 1-19, February.
    17. Liverani, Silvia & Hastie, David I. & Azizi, Lamiae & Papathomas, Michail & Richardson, Sylvia, 2015. "PReMiuM: An R Package for Profile Regression Mixture Models Using Dirichlet Processes," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 64(i07).
    18. Bruno Scarpa & David B. Dunson, 2014. "Enriched Stick-Breaking Processes for Functional Data," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 109(506), pages 647-660, June.
    19. Laura Liu, 2018. "Density Forecasts in Panel Data Models : A Semiparametric Bayesian Perspective," Finance and Economics Discussion Series 2018-036, Board of Governors of the Federal Reserve System (U.S.).
    20. J. Griffin, 2011. "Bayesian clustering of distributions in stochastic frontier analysis," Journal of Productivity Analysis, Springer, vol. 36(3), pages 275-283, December.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:jmvana:v:111:y:2012:i:c:p:1-19. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/wps/find/journaldescription.cws_home/622892/description#description .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.