IDEAS home Printed from https://ideas.repec.org/a/eee/jmvana/v135y2015icp43-58.html
   My bibliography  Save this article

A Bayesian method for analyzing combinations of continuous, ordinal, and nominal categorical data with missing values

Author

Listed:
  • Zhang, Xiao
  • Boscardin, W. John
  • Belin, Thomas R.
  • Wan, Xiaohai
  • He, Yulei
  • Zhang, Kui

Abstract

From a Bayesian perspective, we propose a general method for analyzing a combination of continuous, ordinal (including binary), and categorical/nominal multivariate measures with missing values. We assume multivariate normal linear regression models for multivariate continuous measures, multivariate probit models for correlated ordinal measures, and multivariate multinomial probit models for multivariate categorical/nominal measures. Then we assume a multivariate normal linear model on the continuous vector comprised of continuous variables and those underlying normal variables for ordinal variables from multivariate probit models and for categorical variables from multinomial probit models. We develop a Markov chain Monte Carlo (MCMC) algorithm to estimate unknown parameters including regression parameters, cut-points for ordinal data from the multivariate probit models, and the covariance matrix encompassing both continuous variables and the underlying normal latent variables. Combining the continuous variables and the normal latent variables allows us to model combinations of continuous, ordinal, and categorical multivariate data simultaneously. The framework incorporates flexible priors for the covariance matrix, provides a foundation for inference about the underlying covariance structure, and imputes missing data where needed. The method is illustrated through simulated examples and two real data applications.

Suggested Citation

  • Zhang, Xiao & Boscardin, W. John & Belin, Thomas R. & Wan, Xiaohai & He, Yulei & Zhang, Kui, 2015. "A Bayesian method for analyzing combinations of continuous, ordinal, and nominal categorical data with missing values," Journal of Multivariate Analysis, Elsevier, vol. 135(C), pages 43-58.
  • Handle: RePEc:eee:jmvana:v:135:y:2015:i:c:p:43-58
    DOI: 10.1016/j.jmva.2014.11.007
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0047259X14002620
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.jmva.2014.11.007?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Diana L. Miglioretti, 2003. "Latent Transition Regression for Mixed Outcomes," Biometrics, The International Biometric Society, vol. 59(3), pages 710-720, September.
    2. Golob, Thomas F. & Regan, A C, 2002. "Trucking Industry Adoption of Information Technology: A Structural Multivariate Probit Model," University of California Transportation Center, Working Papers qt9w1988t7, University of California Transportation Center.
    3. J.‐Q. Shi & S.‐Y. Lee, 2000. "Latent variable models with mixed continuous and polytomous data," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 62(1), pages 77-87.
    4. Golob, Thomas F. & Reagan, Amelia C., 2002. "Trucking Industry Adoption of Information Technology: A structural Multivariate Discrete Choice Model," University of California Transportation Center, Working Papers qt7kv5f17n, University of California Transportation Center.
    5. Geweke, John & Keane, Michael P & Runkle, David, 1994. "Alternative Computational Approaches to Inference in the Multinomial Probit Model," The Review of Economics and Statistics, MIT Press, vol. 76(4), pages 609-632, November.
    6. Geweke, John & Keane, Michael & Runkle, David, 1994. "Recursively Simulating Multinomial Multiperiod Probit Probabilities," MPRA Paper 55140, University Library of Munich, Germany.
    7. McCulloch, Robert E. & Polson, Nicholas G. & Rossi, Peter E., 2000. "A Bayesian analysis of the multinomial probit model with fully identified parameters," Journal of Econometrics, Elsevier, vol. 99(1), pages 173-193, November.
    8. McFadden, Daniel, 1989. "A Method of Simulated Moments for Estimation of Discrete Response Models without Numerical Integration," Econometrica, Econometric Society, vol. 57(5), pages 995-1026, September.
    9. Philip Heidelberger & Peter D. Welch, 1983. "Simulation Run Length Control in the Presence of an Initial Transient," Operations Research, INFORMS, vol. 31(6), pages 1109-1144, December.
    10. Geweke, John F. & Keane, Michael P. & Runkle, David E., 1997. "Statistical inference in the multinomial multiperiod probit model," Journal of Econometrics, Elsevier, vol. 80(1), pages 125-165, September.
    11. William Greene, 2004. "Convenient estimators for the panel probit model: Further results," Empirical Economics, Springer, vol. 29(1), pages 21-47, January.
    12. Martin Spieß, 2006. "Estimation of a Two-Equation Panel Model with Mixed Continuous and Ordered Categorical Outcomes and Missing Data," Discussion Papers 010, Europa-Universität Flensburg, International Institute of Management.
    13. Meredith M. Regan & Paul J. Catalano, 2000. "Regression Models and Risk Estimation for Mixed Discrete and Continuous Outcomes in Developmental Toxicology," Risk Analysis, John Wiley & Sons, vol. 20(3), pages 363-376, June.
    14. Martin Spiess, 2006. "Estimation of a two‐equation panel model with mixed continuous and ordered categorical outcomes and missing data," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 55(4), pages 525-538, August.
    15. Irini Moustaki & Martin Knott, 2000. "Generalized latent trait models," Psychometrika, Springer;The Psychometric Society, vol. 65(3), pages 391-411, September.
    16. Mary Dupuis Sammel & Louise M. Ryan & Julie M. Legler, 1997. "Latent Variable Models for Mixed Discrete and Continuous Outcomes," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 59(3), pages 667-678.
    17. G. O. Roberts & S. K. Sahu, 1997. "Updating Schemes, Correlation Structure, Blocking and Parameterization for the Gibbs Sampler," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 59(2), pages 291-317.
    18. Zhang, Xiao & Boscardin, W. John & Belin, Thomas R., 2008. "Bayesian analysis of multivariate nominal measures using multivariate multinomial probit models," Computational Statistics & Data Analysis, Elsevier, vol. 52(7), pages 3697-3708, March.
    19. Ziegler, Andreas, 2002. "Simulated Classical Tests in the Multiperiod Multinomial Probit Model," ZEW Discussion Papers 02-38, ZEW - Leibniz Centre for European Economic Research.
    20. D. B. Dunson, 2000. "Bayesian latent variable models for clustered mixed outcomes," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 62(2), pages 355-366.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Leila Amiri & Mojtaba Khazaei & Mojtaba Ganjali, 2018. "A mixture latent variable model for modeling mixed data in heterogeneous populations and its applications," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 102(1), pages 95-115, January.
    2. Kang, Xiaoning & Kang, Lulu & Chen, Wei & Deng, Xinwei, 2022. "A generative approach to modeling data with quantitative and qualitative responses," Journal of Multivariate Analysis, Elsevier, vol. 190(C).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Zhang, Xiao & Boscardin, W. John & Belin, Thomas R., 2008. "Bayesian analysis of multivariate nominal measures using multivariate multinomial probit models," Computational Statistics & Data Analysis, Elsevier, vol. 52(7), pages 3697-3708, March.
    2. Ricardo A. Daziano & Martin Achtnicht, 2014. "Forecasting Adoption of Ultra-Low-Emission Vehicles Using Bayes Estimates of a Multinomial Probit Model and the GHK Simulator," Transportation Science, INFORMS, vol. 48(4), pages 671-683, November.
    3. Jason D. Lemp & Kara M. Kockelman & Paul Damien, 2012. "A Bivariate Multinomial Probit Model for Trip Scheduling: Bayesian Analysis of the Work Tour," Transportation Science, INFORMS, vol. 46(3), pages 405-424, August.
    4. Kerem Tuzcuoglu, 2019. "Composite Likelihood Estimation of an Autoregressive Panel Probit Model with Random Effects," Staff Working Papers 19-16, Bank of Canada.
    5. John Geweke & Joel Horowitz & M. Hashem Pesaran, 2006. "Econometrics: A Bird’s Eye View," CESifo Working Paper Series 1870, CESifo.
    6. Gael M. Martin & David T. Frazier & Ruben Loaiza-Maya & Florian Huber & Gary Koop & John Maheu & Didier Nibbering & Anastasios Panagiotelis, 2023. "Bayesian Forecasting in the 21st Century: A Modern Review," Monash Econometrics and Business Statistics Working Papers 1/23, Monash University, Department of Econometrics and Business Statistics.
    7. Zhang, Q. & Ip, E.H., 2014. "Variable assessment in latent class models," Computational Statistics & Data Analysis, Elsevier, vol. 77(C), pages 146-156.
    8. Leila Amiri & Mojtaba Khazaei & Mojtaba Ganjali, 2018. "A mixture latent variable model for modeling mixed data in heterogeneous populations and its applications," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 102(1), pages 95-115, January.
    9. Harris, Katherine M. & Keane, Michael P., 1998. "A model of health plan choice:: Inferring preferences and perceptions from a combination of revealed preference and attitudinal data," Journal of Econometrics, Elsevier, vol. 89(1-2), pages 131-157, November.
    10. Lee, Lung-Fei, 1997. "Simulated maximum likelihood estimation of dynamic discrete choice statistical models some Monte Carlo results," Journal of Econometrics, Elsevier, vol. 82(1), pages 1-35.
    11. Prowse, Victoria L., 2005. "State Dependence in a Multi-State Model of Employment Dynamics," IZA Discussion Papers 1623, Institute of Labor Economics (IZA).
    12. Patrick Ding & Guido Imbens & Zhaonan Qu & Yinyu Ye, 2024. "Computationally Efficient Estimation of Large Probit Models," Papers 2407.09371, arXiv.org, revised Sep 2024.
    13. Belderbos, Rene & Carree, Martin & Diederen, Bert & Lokshin, Boris & Veugelers, Reinhilde, 2004. "Heterogeneity in R&D cooperation strategies," International Journal of Industrial Organization, Elsevier, vol. 22(8-9), pages 1237-1263, November.
    14. Park, Sang Soo & Lee, Chung-Ki, 2011. "베이지안 추정법을 이용한 주택선택의 다항프로빗 모형 분석 [Analysis of housing choice using multinomial probit model – Bayesian estimation]," MPRA Paper 37150, University Library of Munich, Germany.
    15. Martin, Gael M. & Frazier, David T. & Maneesoonthorn, Worapree & Loaiza-Maya, Rubén & Huber, Florian & Koop, Gary & Maheu, John & Nibbering, Didier & Panagiotelis, Anastasios, 2024. "Bayesian forecasting in economics and finance: A modern review," International Journal of Forecasting, Elsevier, vol. 40(2), pages 811-839.
    16. Daziano, Ricardo A. & Achtnicht, Martin, 2012. "Forecasting adoption of ultra-low-emission vehicles using the GHK simulator and Bayes estimates of a multinomial probit model," ZEW Discussion Papers 12-017, ZEW - Leibniz Centre for European Economic Research.
    17. Andreas Ziegler, 2007. "Simulated classical tests in multinomial probit models," Statistical Papers, Springer, vol. 48(4), pages 655-681, October.
    18. William Greene, 2001. "Fixed and Random Effects in Nonlinear Models," Working Papers 01-01, New York University, Leonard N. Stern School of Business, Department of Economics.
    19. Holloway, Garth J. & Barrett, Christopher B. & Ehui, Simeon K., 2002. "Bayes' Estimates Of The Double Hurdle Model In The Presence Of Fixed Costs," Working Papers 14741, Cornell University, Department of Applied Economics and Management.
    20. Ziegler Andreas, 2010. "Z-Tests in Multinomial Probit Models under Simulated Maximum Likelihood Estimation: Some Small Sample Properties," Journal of Economics and Statistics (Jahrbuecher fuer Nationaloekonomie und Statistik), De Gruyter, vol. 230(5), pages 630-652, October.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:jmvana:v:135:y:2015:i:c:p:43-58. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/wps/find/journaldescription.cws_home/622892/description#description .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.