IDEAS home Printed from https://ideas.repec.org/a/bla/scjsta/v42y2015i4p1065-1077.html
   My bibliography  Save this article

Cluster-Specific Variable Selection for Product Partition Models

Author

Listed:
  • Fernando A. Quintana
  • Peter Müller
  • Ana Luisa Papoila

Abstract

type="main" xml:id="sjos12151-abs-0001"> We propose a random partition model that implements prediction with many candidate covariates and interactions. The model is based on a modified product partition model that includes a regression on covariates by favouring homogeneous clusters in terms of these covariates. Additionally, the model allows for a cluster-specific choice of the covariates that are included in this evaluation of homogeneity. The variable selection is implemented by introducing a set of cluster-specific latent indicators that include or exclude covariates. The proposed model is motivated by an application to predicting mortality in an intensive care unit in Lisboa, Portugal.

Suggested Citation

  • Fernando A. Quintana & Peter Müller & Ana Luisa Papoila, 2015. "Cluster-Specific Variable Selection for Product Partition Models," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 42(4), pages 1065-1077, December.
  • Handle: RePEc:bla:scjsta:v:42:y:2015:i:4:p:1065-1077
    as

    Download full text from publisher

    File URL: http://hdl.handle.net/10.1111/sjos.12151
    Download Restriction: Access to full text is restricted to subscribers.
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Chung, Yeonseung & Dunson, David B., 2009. "Nonparametric Bayes Conditional Distribution Modeling With Variable Selection," Journal of the American Statistical Association, American Statistical Association, vol. 104(488), pages 1646-1660.
    2. Peter D. Hoff, 2005. "Subset Clustering of Binary Sequences, with an Application to Genomic Abnormality Data," Biometrics, The International Biometric Society, vol. 61(4), pages 1027-1036, December.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Qi Li & Juan Lin & Jeffrey S. Racine, 2013. "Optimal Bandwidth Selection for Nonparametric Conditional Distribution and Quantile Functions," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 31(1), pages 57-65, January.
    2. Igari, Ryosuke & Hoshino, Takahiro, 2018. "A Bayesian data combination approach for repeated durations under unobserved missing indicators: Application to interpurchase-timing in marketing," Computational Statistics & Data Analysis, Elsevier, vol. 126(C), pages 150-166.
    3. Pati, Debdeep & Dunson, David B. & Tokdar, Surya T., 2013. "Posterior consistency in conditional distribution estimation," Journal of Multivariate Analysis, Elsevier, vol. 116(C), pages 456-472.
    4. Ryo Kato & Takahiro Hoshino, 2020. "Semiparametric Bayesian multiple imputation for regression models with missing mixed continuous–discrete covariates," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 72(3), pages 803-825, June.
    5. Liverani, Silvia & Hastie, David I. & Azizi, Lamiae & Papathomas, Michail & Richardson, Sylvia, 2015. "PReMiuM: An R Package for Profile Regression Mixture Models Using Dirichlet Processes," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 64(i07).
    6. Villani, Mattias & Kohn, Robert & Nott, David J., 2012. "Generalized smooth finite mixtures," Journal of Econometrics, Elsevier, vol. 171(2), pages 121-133.
    7. Lee, Kuo-Jung & Chen, Ray-Bing & Wu, Ying Nian, 2016. "Bayesian variable selection for finite mixture model of linear regressions," Computational Statistics & Data Analysis, Elsevier, vol. 95(C), pages 1-16.
    8. Cozzini, Alberto & Jasra, Ajay & Montana, Giovanni & Persing, Adam, 2014. "A Bayesian mixture of lasso regressions with t-errors," Computational Statistics & Data Analysis, Elsevier, vol. 77(C), pages 84-97.
    9. Eric Coker & Robert Gunier & Asa Bradman & Kim Harley & Katherine Kogut & John Molitor & Brenda Eskenazi, 2017. "Association between Pesticide Profiles Used on Agricultural Fields near Maternal Residences during Pregnancy and IQ at Age 7 Years," IJERPH, MDPI, vol. 14(5), pages 1-20, May.
    10. Lauren Hoskovec & Matthew D. Koslovsky & Kirsten Koehler & Nicholas Good & Jennifer L. Peel & John Volckens & Ander Wilson, 2023. "Infinite hidden Markov models for multiple multivariate time series with missing data," Biometrics, The International Biometric Society, vol. 79(3), pages 2592-2604, September.
    11. Lauren Hoskovec & Wande Benka-Coker & Rachel Severson & Sheryl Magzamen & Ander Wilson, 2021. "Model choice for estimating the association between exposure to chemical mixtures and health outcomes: A simulation study," PLOS ONE, Public Library of Science, vol. 16(3), pages 1-21, March.
    12. Barrientos, Andrés F. & Canale, Antonio, 2021. "A Bayesian goodness-of-fit test for regression," Computational Statistics & Data Analysis, Elsevier, vol. 155(C).
    13. Nadja Klein & Michael Stanley Smith, 2021. "Bayesian variable selection for non‐Gaussian responses: a marginally calibrated copula approach," Biometrics, The International Biometric Society, vol. 77(3), pages 809-823, September.
    14. Debdeep Pati & David Dunson, 2014. "Bayesian nonparametric regression with varying residual density," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 66(1), pages 1-31, February.
    15. Dipankar Bandyopadhyay & Antonio Canale, 2016. "Non-parametric spatial models for clustered ordered periodontal data," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 65(4), pages 619-640, August.
    16. Chung, Hwan & Chang, Hsiu-Ching, 2012. "Bayesian approaches to the model selection problem in the analysis of latent stage-sequential process," Computational Statistics & Data Analysis, Elsevier, vol. 56(12), pages 4097-4110.
    17. Takahiro Hoshino & Ryosuke Igari, 2017. "Quasi-Bayesian Inference for Latent Variable Models with External Information: Application to generalized linear mixed models for biased data," Keio-IES Discussion Paper Series 2017-014, Institute for Economics Studies, Keio University.
    18. Huang, Yifan & Meng, Shengwang, 2020. "A Bayesian nonparametric model and its application in insurance loss prediction," Insurance: Mathematics and Economics, Elsevier, vol. 93(C), pages 84-94.
    19. Yang, Mingan, 2012. "Bayesian variable selection for logistic mixed model with nonparametric random effects," Computational Statistics & Data Analysis, Elsevier, vol. 56(9), pages 2663-2674.
    20. Norets, Andriy, 2015. "Bayesian regression with nonparametric heteroskedasticity," Journal of Econometrics, Elsevier, vol. 185(2), pages 409-419.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bla:scjsta:v:42:y:2015:i:4:p:1065-1077. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: http://www.blackwellpublishing.com/journal.asp?ref=0303-6898 .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.