IDEAS home Printed from https://ideas.repec.org/a/eee/csdana/v55y2011i1p261-279.html
   My bibliography  Save this article

A Bayesian approach to model-based clustering for binary panel probit models

Author

Listed:
  • Aßmann, Christian
  • Boysen-Hogrefe, Jens

Abstract

Considering latent heterogeneity is of special importance in nonlinear models in order to gauge correctly the effect of explanatory variables on the dependent variable. A stratified model-based clustering approach is adapted for modeling latent heterogeneity in binary panel probit models. Within a Bayesian framework an estimation algorithm dealing with the inherent label switching problem is provided. Determination of the number of clusters is based on the marginal likelihood and a cross-validation approach. A simulation study is conducted to assess the ability of both approaches to determine on the correct number of clusters indicating high accuracy for the marginal likelihood criterion, with the cross-validation approach performing similarly well in most circumstances. Different concepts of marginal effects incorporating latent heterogeneity at different degrees arise within the considered model setup and are directly at hand within Bayesian estimation via MCMC methodology. An empirical illustration of the methodology developed indicates that consideration of latent heterogeneity via latent clusters provides the preferred model specification over a pooled and a random coefficient specification.

Suggested Citation

  • Aßmann, Christian & Boysen-Hogrefe, Jens, 2011. "A Bayesian approach to model-based clustering for binary panel probit models," Computational Statistics & Data Analysis, Elsevier, vol. 55(1), pages 261-279, January.
  • Handle: RePEc:eee:csdana:v:55:y:2011:i:1:p:261-279
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0167-9473(10)00161-1
    Download Restriction: Full text for ScienceDirect subscribers only.
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Juárez, Miguel A. & Steel, Mark F. J., 2010. "Model-Based Clustering of Non-Gaussian Panel Data Based on Skew-t Distributions," Journal of Business & Economic Statistics, American Statistical Association, vol. 28(1), pages 52-66.
    2. Ishwaran H. & James L.F. & Sun J., 2001. "Bayesian Model Selection in Finite Mixtures by Marginal Density Decompositions," Journal of the American Statistical Association, American Statistical Association, vol. 96, pages 1316-1332, December.
    3. Congdon, Peter, 2006. "Bayesian model choice based on Monte Carlo estimates of posterior model probabilities," Computational Statistics & Data Analysis, Elsevier, vol. 50(2), pages 346-357, January.
    4. Fruhwirth-Schnatter, Sylvia & Fruhwirth, Rudolf, 2007. "Auxiliary mixture sampling with applications to logistic models," Computational Statistics & Data Analysis, Elsevier, vol. 51(7), pages 3509-3528, April.
    5. Bertschek, Irene, 1995. "Product and Process Innovation as a Response to Increasing Import and Foreign Direct Investment," Journal of Industrial Economics, Wiley Blackwell, vol. 43(4), pages 341-357, December.
    6. William Greene, 2004. "Convenient estimators for the panel probit model: Further results," Empirical Economics, Springer, vol. 29(1), pages 21-47, January.
    7. Chakraborty, Sounak, 2009. "Bayesian binary kernel probit model for microarray based cancer classification and gene selection," Computational Statistics & Data Analysis, Elsevier, vol. 53(12), pages 4198-4209, October.
    8. Geweke, John & Keane, Michael, 2007. "Smoothly mixing regressions," Journal of Econometrics, Elsevier, vol. 138(1), pages 252-290, May.
    9. William Greene, 2004. "The behaviour of the maximum likelihood estimator of limited dependent variable models in the presence of fixed effects," Econometrics Journal, Royal Economic Society, vol. 7(1), pages 98-119, June.
    10. Cameron,A. Colin & Trivedi,Pravin K., 2005. "Microeconometrics," Cambridge Books, Cambridge University Press, number 9780521848053, September.
    11. Matthew Stephens, 2000. "Dealing with label switching in mixture models," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 62(4), pages 795-809.
    12. Surajit Ray & Bruce G. Lindsay, 2008. "Model selection in high dimensions: a quadratic‐risk‐based approach," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 70(1), pages 95-118, February.
    13. Chib S. & Jeliazkov I., 2001. "Marginal Likelihood From the Metropolis-Hastings Output," Journal of the American Statistical Association, American Statistical Association, vol. 96, pages 270-281, March.
    14. Fraley C. & Raftery A.E., 2002. "Model-Based Clustering, Discriminant Analysis, and Density Estimation," Journal of the American Statistical Association, American Statistical Association, vol. 97, pages 611-631, June.
    15. Train,Kenneth E., 2009. "Discrete Choice Methods with Simulation," Cambridge Books, Cambridge University Press, number 9780521747387, September.
    16. Lancaster, Tony, 2000. "The incidental parameter problem since 1948," Journal of Econometrics, Elsevier, vol. 95(2), pages 391-413, April.
    17. Chen, Jiahua & Khalili, Abbas, 2008. "Order Selection in Finite Mixture Models With a Nonsmooth Penalty," Journal of the American Statistical Association, American Statistical Association, vol. 103(484), pages 1674-1683.
    18. Geweke, John, 2007. "Interpretation and inference in mixture models: Simple MCMC works," Computational Statistics & Data Analysis, Elsevier, vol. 51(7), pages 3529-3550, April.
    19. Sylvia Frühwirth-Schnatter & Christoph Pamminger, 2009. "Bayesian Clustering of Categorical Time Series Using Finite Mixtures of Markov Chain Models," NRN working papers 2009-07, The Austrian Center for Labor Economics and the Analysis of the Welfare State, Johannes Kepler University Linz, Austria.
    20. Heard, Nicholas A. & Holmes, Christopher C. & Stephens, David A., 2006. "A Quantitative Study of Gene Regulation Involved in the Immune Response of Anopheline Mosquitoes: An Application of Bayesian Hierarchical Clustering of Curves," Journal of the American Statistical Association, American Statistical Association, vol. 101, pages 18-29, March.
    21. Mark S. Handcock & Adrian E. Raftery & Jeremy M. Tantrum, 2007. "Model‐based clustering for social networks," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 170(2), pages 301-354, March.
    22. Juarez, Miguel A. & Steel, Mark F. J., 2006. "Model-based Clustering of non-Gaussian Panel Data," MPRA Paper 880, University Library of Munich, Germany.
    23. Greene, William H. & Hensher, David A., 2003. "A latent class model for discrete choice analysis: contrasts with mixed logit," Transportation Research Part B: Methodological, Elsevier, vol. 37(8), pages 681-698, September.
    24. David Revelt & Kenneth Train, 1998. "Mixed Logit With Repeated Choices: Households' Choices Of Appliance Efficiency Level," The Review of Economics and Statistics, MIT Press, vol. 80(4), pages 647-657, November.
    25. Jara, Alejandro & Jose Garcia-Zattera, Maria & Lesaffre, Emmanuel, 2007. "A Dirichlet process mixture model for the analysis of correlated binary responses," Computational Statistics & Data Analysis, Elsevier, vol. 51(11), pages 5402-5415, July.
    26. Dunson, David B. & Herring, Amy H. & Siega-Riz, Anna Maria, 2008. "Bayesian Inference on Changes in Response Densities Over Predictor Clusters," Journal of the American Statistical Association, American Statistical Association, vol. 103(484), pages 1508-1517.
    27. Sylvia Fruhwirth-Schnatter, 2004. "Estimating marginal likelihoods for mixture and Markov switching models using bridge sampling techniques," Econometrics Journal, Royal Economic Society, vol. 7(1), pages 143-167, June.
    28. Fruhwirth-Schnatter, Sylvia & Kaufmann, Sylvia, 2008. "Model-Based Clustering of Multiple Time Series," Journal of Business & Economic Statistics, American Statistical Association, vol. 26, pages 78-89, January.
    29. Bertschek, Irene & Lechner, Michael, 1998. "Convenient estimators for the panel probit model," Journal of Econometrics, Elsevier, vol. 87(2), pages 329-371, September.
    30. Wagner, Helga & Tüchler, Regina, 2010. "Bayesian estimation of random effects models for multivariate responses of mixed data," Computational Statistics & Data Analysis, Elsevier, vol. 54(5), pages 1206-1218, May.
    31. Aßmann, Christian, 2007. "Determinants and Costs of Current Account Reversals under Heterogeneity and Serial Correlation," Economics Working Papers 2007-17, Christian-Albrechts-University of Kiel, Department of Economics.
    32. Yao, Weixin & Lindsay, Bruce G., 2009. "Bayesian Mixture Labeling by Highest Posterior Density," Journal of the American Statistical Association, American Statistical Association, vol. 104(486), pages 758-767.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Carmelo J. León & Jorge E. Araña, 2012. "The Dynamics of Preference Elicitation after an Environmental Disaster: Stability and Emotional Load," Land Economics, University of Wisconsin Press, vol. 88(2), pages 362-381.
    2. León, Carmelo J. & Araña, Jorge E. & Hanemann, W. Michael & Riera, Pere, 2014. "Heterogeneity and emotions in the valuation of non-use damages caused by oil spills," Ecological Economics, Elsevier, vol. 97(C), pages 129-139.
    3. Michael Bergrab & Christian Aßmann, 2024. "Automated Bayesian variable selection methods for binary regression models with missing covariate data," AStA Wirtschafts- und Sozialstatistisches Archiv, Springer;Deutsche Statistische Gesellschaft - German Statistical Society, vol. 18(2), pages 203-244, June.
    4. Sylvia Frühwirth-Schnatter, 2011. "Panel data analysis: a survey on model-based clustering of time series," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 5(4), pages 251-280, December.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Aßmann, Christian & Boysen-Hogrefe, Jens, 2009. "A bayesian approach to model-based clustering for panel probit models," Economics Working Papers 2009-03, Christian-Albrechts-University of Kiel, Department of Economics.
    2. Sylvia Frühwirth-Schnatter, 2011. "Panel data analysis: a survey on model-based clustering of time series," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 5(4), pages 251-280, December.
    3. Sylvia Frühwirth‐Schnatter & Christoph Pamminger & Andrea Weber & Rudolf Winter‐Ebmer, 2012. "Labor market entry and earnings dynamics: Bayesian inference using mixtures‐of‐experts Markov chain clustering," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 27(7), pages 1116-1137, November.
    4. Juarez, Miguel A. & Steel, Mark F. J., 2006. "Model-based Clustering of non-Gaussian Panel Data," MPRA Paper 880, University Library of Munich, Germany.
    5. Martin Burda & Roman Liesenfeld & Jean-Francois Richard, 2008. "Bayesian Analysis of a Probit Panel Data Model with Unobserved Individual Heterogeneity and Autocorrelated Errors," Working Papers tecipa-321, University of Toronto, Department of Economics.
    6. William Greene, 2007. "Discrete Choice Modeling," Working Papers 07-6, New York University, Leonard N. Stern School of Business, Department of Economics.
    7. Kerem Tuzcuoglu, 2019. "Composite Likelihood Estimation of an Autoregressive Panel Probit Model with Random Effects," Staff Working Papers 19-16, Bank of Canada.
    8. Paleti, Rajesh, 2018. "Generalized multinomial probit Model: Accommodating constrained random parameters," Transportation Research Part B: Methodological, Elsevier, vol. 118(C), pages 248-262.
    9. Domanski, Adam, 2009. "Estimating Mixed Logit Recreation Demand Models With Large Choice Sets," 2009 Annual Meeting, July 26-28, 2009, Milwaukee, Wisconsin 49413, Agricultural and Applied Economics Association.
    10. Stefania Troiano & Daniel Vecchiato & Francesco Marangon & Tiziano Tempesta & Federico Nassivera, 2019. "Households’ Preferences for a New ‘Climate-Friendly’ Heating System: Does Contribution to Reducing Greenhouse Gases Matter?," Energies, MDPI, vol. 12(13), pages 1-19, July.
    11. Olsthoorn, Mark & Schleich, Joachim & Guetlein, Marie-Charlotte & Durand, Antoine & Faure, Corinne, 2023. "Beyond energy efficiency: Do consumers care about life-cycle properties of household appliances?," Energy Policy, Elsevier, vol. 174(C).
    12. Falco, Paolo & Maloney, William F. & Rijkers, Bob & Sarrias, Mauricio, 2015. "Heterogeneity in subjective wellbeing: An application to occupational allocation in Africa," Journal of Economic Behavior & Organization, Elsevier, vol. 111(C), pages 137-153.
    13. Bauwens, Luc & Dufays, Arnaud & Rombouts, Jeroen V.K., 2014. "Marginal likelihood for Markov-switching and change-point GARCH models," Journal of Econometrics, Elsevier, vol. 178(P3), pages 508-522.
    14. Rufo, M.J. & Martín, J. & Pérez, C.J., 2010. "New approaches to compute Bayes factor in finite mixture models," Computational Statistics & Data Analysis, Elsevier, vol. 54(12), pages 3324-3335, December.
    15. Abildtrup, Jens & Garcia, Serge & Olsen, Søren Bøye & Stenger, Anne, 2013. "Spatial preference heterogeneity in forest recreation," Ecological Economics, Elsevier, vol. 92(C), pages 67-77.
    16. Joan L. Walker & Moshe Ben-Akiva, 2011. "Advances in Discrete Choice: Mixture Models," Chapters, in: André de Palma & Robin Lindsey & Emile Quinet & Roger Vickerman (ed.), A Handbook of Transport Economics, chapter 8, Edward Elgar Publishing.
    17. Talevi, Marta & Pattanayak, Subhrendu K. & Das, Ipsita & Lewis, Jessica J. & Singha, Ashok K., 2022. "Speaking from experience: Preferences for cooking with biogas in rural India," Energy Economics, Elsevier, vol. 107(C).
    18. Tinessa, Fiore & Marzano, Vittorio & Papola, Andrea, 2020. "Mixing distributions of tastes with a Combination of Nested Logit (CoNL) kernel: Formulation and performance analysis," Transportation Research Part B: Methodological, Elsevier, vol. 141(C), pages 1-23.
    19. Sergio Colombo & Nick Hanley & Jordan Louviere, 2009. "Modeling preference heterogeneity in stated choice data: an analysis for public goods generated by agriculture," Agricultural Economics, International Association of Agricultural Economists, vol. 40(3), pages 307-322, May.
    20. Daniel A. Brent & Lata Gangadharan & Anke D. Leroux & Paul A. Raschky, 2022. "Reducing bias in preference elicitation for environmental public goods," Australian Journal of Agricultural and Resource Economics, Australian Agricultural and Resource Economics Society, vol. 66(2), pages 280-308, April.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:csdana:v:55:y:2011:i:1:p:261-279. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/csda .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.