IDEAS home Printed from https://ideas.repec.org/p/msh/ebswps/2023-19.html
   My bibliography  Save this paper

A High-dimensional Multinomial Logit Model

Author

Listed:
  • Didier Nibbering

Abstract

The number of parameters in a standard multinomial logit model increases linearly with the number of choice alternatives and number of explanatory variables. Since many modern applications involve large choice sets with categorical explanatory variables, which enter the model as large sets of binary dummies, the number of parameters in a multinomial logit model is often large. This paper proposes a new method for data-driven two-way parameter clustering over outcome categories and explanatory dummy categories in a multinomial logit model. A Bayesian Dirichlet process mixture model encourages parameters to cluster over the categories, which reduces the number of unique model parameters and provides interpretable clusters of categories. In an empirical application, we estimate the holiday preferences of 11 household types over 49 holiday destinations, and identify a small number of household segments with different preferences across clusters of holiday destinations.

Suggested Citation

  • Didier Nibbering, 2023. "A High-dimensional Multinomial Logit Model," Monash Econometrics and Business Statistics Working Papers 19/23, Monash University, Department of Econometrics and Business Statistics.
  • Handle: RePEc:msh:ebswps:2023-19
    as

    Download full text from publisher

    File URL: https://www.monash.edu/business/ebs/research/publications/ebs/2023/wp19-2023.pdf
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Luc Bauwens & Jean-François Carpantier & Arnaud Dufays, 2017. "Autoregressive Moving Average Infinite Hidden Markov-Switching Models," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 35(2), pages 162-182, April.
    2. Geweke, John & Keane, Michael P & Runkle, David, 1994. "Alternative Computational Approaches to Inference in the Multinomial Probit Model," The Review of Economics and Statistics, MIT Press, vol. 76(4), pages 609-632, November.
    3. Cramer, J. S. & Ridder, G., 1991. "Pooling states in the multinomial logit model," Journal of Econometrics, Elsevier, vol. 47(2-3), pages 267-272, February.
    4. Carson, Richard T. & Louviere, Jordan J., 2014. "Statistical properties of consideration sets," Journal of choice modelling, Elsevier, vol. 13(C), pages 37-48.
    5. Train,Kenneth E., 2009. "Discrete Choice Methods with Simulation," Cambridge Books, Cambridge University Press, number 9780521766555, September.
    6. Bruno J.D. Jacobs & Bas Donkers & Dennis Fok, 2016. "Model-Based Purchase Predictions for Large Assortments," Marketing Science, INFORMS, vol. 35(3), pages 389-404, May.
    7. Newey, Whitney & West, Kenneth, 2014. "A simple, positive semi-definite, heteroscedasticity and autocorrelation consistent covariance matrix," Applied Econometrics, Russian Presidential Academy of National Economy and Public Administration (RANEPA), vol. 33(1), pages 125-132.
    8. John Geweke & Gautam Gowrisankaran & Robert J. Town, 2003. "Bayesian Inference for Hospital Quality in a Selection Model," Econometrica, Econometric Society, vol. 71(4), pages 1215-1238, July.
    9. Vincent, Martin & Hansen, Niels Richard, 2014. "Sparse group lasso and high dimensional multinomial classification," Computational Statistics & Data Analysis, Elsevier, vol. 71(C), pages 771-786.
    10. Richard F. MacLehose & David B. Dunson, 2010. "Bayesian Semiparametric Multiple Shrinkage," Biometrics, The International Biometric Society, vol. 66(2), pages 455-462, June.
    11. Conley, Timothy G. & Hansen, Christian B. & McCulloch, Robert E. & Rossi, Peter E., 2008. "A semi-parametric Bayesian approach to the instrumental variable problem," Journal of Econometrics, Elsevier, vol. 144(1), pages 276-305, May.
    12. Geweke, John, 2007. "Interpretation and inference in mixture models: Simple MCMC works," Computational Statistics & Data Analysis, Elsevier, vol. 51(7), pages 3529-3550, April.
    13. Denzil G. Fiebig & Michael P. Keane & Jordan Louviere & Nada Wasi, 2010. "The Generalized Multinomial Logit Model: Accounting for Scale and Coefficient Heterogeneity," Marketing Science, INFORMS, vol. 29(3), pages 393-421, 05-06.
    14. Gneiting, Tilmann & Raftery, Adrian E., 2007. "Strictly Proper Scoring Rules, Prediction, and Estimation," Journal of the American Statistical Association, American Statistical Association, vol. 102, pages 359-378, March.
    15. Howard D. Bondell & Brian J. Reich, 2009. "Simultaneous Factor Selection and Collapsing Levels in ANOVA," Biometrics, The International Biometric Society, vol. 65(1), pages 169-177, March.
    16. William Greene & David Hensher, 2010. "Does scale heterogeneity across individuals matter? An empirical assessment of alternative logit models," Transportation, Springer, vol. 37(3), pages 413-428, May.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Didier Nibbering, 2024. "A high‐dimensional multinomial logit model," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 39(3), pages 481-497, April.
    2. Didier Nibbering, 2019. "A High-dimensional Multinomial Choice Model," Monash Econometrics and Business Statistics Working Papers 19/19, Monash University, Department of Econometrics and Business Statistics.
    3. David Hensher & John Rose & Zheng Li, 2012. "Does the choice model method and/or the data matter?," Transportation, Springer, vol. 39(2), pages 351-385, March.
    4. Line Bjørnskov Pedersen & Julie Riise & Arne Risa Hole & Dorte Gyrd-Hansen, 2014. "GPs' shifting agencies in choice of treatment," Applied Economics, Taylor & Francis Journals, vol. 46(7), pages 750-761, March.
    5. Chen, Tiantian & Fu, Xiaowen & Hensher, David A. & Li, Zhi-Chun & Sze, N.N., 2022. "Air travel choice, online meeting and passenger heterogeneity – An international study on travellers’ preference during a pandemic," Transportation Research Part A: Policy and Practice, Elsevier, vol. 165(C), pages 439-453.
    6. Holte, Jon Helgheim & Kjaer, Trine & Abelsen, Birgit & Olsen, Jan Abel, 2015. "The impact of pecuniary and non-pecuniary incentives for attracting young doctors to rural general practice," Social Science & Medicine, Elsevier, vol. 128(C), pages 1-9.
    7. Kaambwa, Billingsley & Lancsar, Emily & McCaffrey, Nicola & Chen, Gang & Gill, Liz & Cameron, Ian D. & Crotty, Maria & Ratcliffe, Julie, 2015. "Investigating consumers' and informal carers' views and preferences for consumer directed care: A discrete choice experiment," Social Science & Medicine, Elsevier, vol. 140(C), pages 81-94.
    8. Balogh, Péter & Békési, Dániel & Gorton, Matthew & Popp, József & Lengyel, Péter, 2016. "Consumer willingness to pay for traditional food products," Food Policy, Elsevier, vol. 61(C), pages 176-184.
    9. I. G. Ukpong & K. G. Balcombe & I. M. Fraser & F. J. Areal, 2019. "Preferences for Mitigation of the Negative Impacts of the Oil and Gas Industry in the Niger Delta Region of Nigeria," Environmental & Resource Economics, Springer;European Association of Environmental and Resource Economists, vol. 74(2), pages 811-843, October.
    10. Yuanyuan Gu & Arne Risa Hole & Stephanie Knox, 2013. "Fitting the generalized multinomial logit model in Stata," Stata Journal, StataCorp LP, vol. 13(2), pages 382-397, June.
    11. Shr, Yau-Huo & Ready, Richard C. & Orland, Brian & Echols, Stuart, 2017. "Do Visual Representations Influence Survey Responses? Evidence from a Choice Experiment on Landscape Attributes of Green Infrastructure," 2017 Annual Meeting, July 30-August 1, Chicago, Illinois 258397, Agricultural and Applied Economics Association.
    12. Rocha, Luiz Eduardo Vasconcelos & Santos, Gilnei Costa & Bastos, Patricia de Melo Abrita, 2006. "Evolução Da Distribuição Da Renda E Da Pobreza Das Famílias Ocupadas E Residentes No Meio Rural Do Estado De Minas Gerais, De 1981 A 2003," 44th Congress, July 23-27, 2006, Fortaleza, Ceará, Brazil 148649, Sociedade Brasileira de Economia, Administracao e Sociologia Rural (SOBER).
    13. John C. Whitehead & Daniel K. Lew, 2020. "Estimating recreation benefits through joint estimation of revealed and stated preference discrete choice data," Empirical Economics, Springer, vol. 58(4), pages 2009-2029, April.
    14. Yuanyuan Gu & Richard Norman & Rosalie Viney, 2014. "Estimating Health State Utility Values From Discrete Choice Experiments—A Qaly Space Model Approach," Health Economics, John Wiley & Sons, Ltd., vol. 23(9), pages 1098-1114, September.
    15. Leite, Sheila Cristina Ferreira & De Figueiredo, Margarida Garcia, 2006. "Fluxos De Algodão Em Pluma Para Exportação No Estado Da Bahia: Uma Aplicação De Programação Linear," 44th Congress, July 23-27, 2006, Fortaleza, Ceará, Brazil 149116, Sociedade Brasileira de Economia, Administracao e Sociologia Rural (SOBER).
    16. Arne Risa Hole & Hong Il Yoo, 2017. "The use of heuristic optimization algorithms to facilitate maximum simulated likelihood estimation of random parameter logit models," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 66(5), pages 997-1013, November.
    17. Völker, Marc & Lienhoop, Nele, 2016. "Exploring group dynamics in deliberative choice experiments," Ecological Economics, Elsevier, vol. 123(C), pages 57-67.
    18. Owusu, Rebecca & Dadzie, Samuel Kwesi Ndzebah, 2021. "Heterogeneity in consumer preferences for organic and genetically modified food products in Ghana," African Journal of Agricultural and Resource Economics, African Association of Agricultural Economists, vol. 16(2), June.
    19. Santos, Jair Carvalho Dos, 2006. "Estimativa De Custo De Coleta E Rentabilidade Para Sistema Extrativo De Latex De Seringueira Na Amazonia," 44th Congress, July 23-27, 2006, Fortaleza, Ceará, Brazil 145689, Sociedade Brasileira de Economia, Administracao e Sociologia Rural (SOBER).
    20. Ajayi, V. & Reiner, D., 2020. "Consumer Willingness to Pay for Reducing the Environmental Footprint of Green Plastics," Cambridge Working Papers in Economics 20110, Faculty of Economics, University of Cambridge.

    More about this item

    Keywords

    large choice sets; Dirichlet process prior; multinomial logit model; highdimensional models;
    All these keywords.

    JEL classification:

    • C11 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods and Methodology: General - - - Bayesian Analysis: General
    • C14 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods and Methodology: General - - - Semiparametric and Nonparametric Methods: General
    • C25 - Mathematical and Quantitative Methods - - Single Equation Models; Single Variables - - - Discrete Regression and Qualitative Choice Models; Discrete Regressors; Proportions; Probabilities
    • C35 - Mathematical and Quantitative Methods - - Multiple or Simultaneous Equation Models; Multiple Variables - - - Discrete Regression and Qualitative Choice Models; Discrete Regressors; Proportions
    • C51 - Mathematical and Quantitative Methods - - Econometric Modeling - - - Model Construction and Estimation

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:msh:ebswps:2023-19. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Professor Xibin Zhang (email available below). General contact details of provider: https://edirc.repec.org/data/dxmonau.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.