IDEAS home Printed from https://ideas.repec.org/a/wly/japmet/v39y2024i3p481-497.html
   My bibliography  Save this article

A high‐dimensional multinomial logit model

Author

Listed:
  • Didier Nibbering

Abstract

The number of parameters in a standard multinomial logit model increases linearly with the number of choice alternatives and number of explanatory variables. Because many modern applications involve large choice sets with categorical explanatory variables, which enter the model as large sets of binary dummies, the number of parameters in a multinomial logit model is often large. This paper proposes a new method for data‐driven two‐way parameter clustering over outcome categories and explanatory dummy categories in a multinomial logit model. A Bayesian Dirichlet process mixture model encourages parameters to cluster over the categories, which reduces the number of unique model parameters and provides interpretable clusters of categories. In an empirical application, we estimate the holiday preferences of 11 household types over 49 holiday destinations and identify a small number of household segments with different preferences across clusters of holiday destinations.

Suggested Citation

  • Didier Nibbering, 2024. "A high‐dimensional multinomial logit model," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 39(3), pages 481-497, April.
  • Handle: RePEc:wly:japmet:v:39:y:2024:i:3:p:481-497
    DOI: 10.1002/jae.3034
    as

    Download full text from publisher

    File URL: https://doi.org/10.1002/jae.3034
    Download Restriction: no

    File URL: https://libkey.io/10.1002/jae.3034?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Luc Bauwens & Jean-François Carpantier & Arnaud Dufays, 2017. "Autoregressive Moving Average Infinite Hidden Markov-Switching Models," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 35(2), pages 162-182, April.
    2. Hausman, Jerry & McFadden, Daniel, 1984. "Specification Tests for the Multinomial Logit Model," Econometrica, Econometric Society, vol. 52(5), pages 1219-1240, September.
    3. Cramer, J. S. & Ridder, G., 1991. "Pooling states in the multinomial logit model," Journal of Econometrics, Elsevier, vol. 47(2-3), pages 267-272, February.
    4. Elaine Zanutto & Eric Bradlow, 2006. "Data pruning in consumer choice models," Quantitative Marketing and Economics (QME), Springer, vol. 4(3), pages 267-287, September.
    5. Carson, Richard T. & Louviere, Jordan J., 2014. "Statistical properties of consideration sets," Journal of choice modelling, Elsevier, vol. 13(C), pages 37-48.
    6. Khai Xiang Chiong & Matthew Shum, 2019. "Random Projection Estimation of Discrete-Choice Models with Large Choice Sets," Management Science, INFORMS, vol. 65(1), pages 256-271, January.
    7. Train,Kenneth E., 2009. "Discrete Choice Methods with Simulation," Cambridge Books, Cambridge University Press, number 9780521766555, November.
    8. John Geweke & Gautam Gowrisankaran & Robert J. Town, 2003. "Bayesian Inference for Hospital Quality in a Selection Model," Econometrica, Econometric Society, vol. 71(4), pages 1215-1238, July.
    9. Raffaella Giacomini & Halbert White, 2006. "Tests of Conditional Predictive Ability," Econometrica, Econometric Society, vol. 74(6), pages 1545-1578, November.
    10. Vincent, Martin & Hansen, Niels Richard, 2014. "Sparse group lasso and high dimensional multinomial classification," Computational Statistics & Data Analysis, Elsevier, vol. 71(C), pages 771-786.
    11. Richard F. MacLehose & David B. Dunson, 2010. "Bayesian Semiparametric Multiple Shrinkage," Biometrics, The International Biometric Society, vol. 66(2), pages 455-462, June.
    12. Nicholas G. Polson & James G. Scott & Jesse Windle, 2013. "Bayesian Inference for Logistic Models Using Pólya--Gamma Latent Variables," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 108(504), pages 1339-1349, December.
    13. Geweke, John, 2007. "Interpretation and inference in mixture models: Simple MCMC works," Computational Statistics & Data Analysis, Elsevier, vol. 51(7), pages 3529-3550, April.
    14. Denzil G. Fiebig & Michael P. Keane & Jordan Louviere & Nada Wasi, 2010. "The Generalized Multinomial Logit Model: Accounting for Scale and Coefficient Heterogeneity," Marketing Science, INFORMS, vol. 29(3), pages 393-421, 05-06.
    15. Gneiting, Tilmann & Raftery, Adrian E., 2007. "Strictly Proper Scoring Rules, Prediction, and Estimation," Journal of the American Statistical Association, American Statistical Association, vol. 102, pages 359-378, March.
    16. Howard D. Bondell & Brian J. Reich, 2009. "Simultaneous Factor Selection and Collapsing Levels in ANOVA," Biometrics, The International Biometric Society, vol. 65(1), pages 169-177, March.
    17. William Greene & David Hensher, 2010. "Does scale heterogeneity across individuals matter? An empirical assessment of alternative logit models," Transportation, Springer, vol. 37(3), pages 413-428, May.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Didier Nibbering, 2023. "A High-dimensional Multinomial Logit Model," Monash Econometrics and Business Statistics Working Papers 19/23, Monash University, Department of Econometrics and Business Statistics.
    2. Nibbering, Didier & Hastie, Trevor J., 2022. "Multiclass-penalized logistic regression," Computational Statistics & Data Analysis, Elsevier, vol. 169(C).
    3. Haile, Kaleab K. & Tirivayi, Nyasha & Tesfaye, Wondimagegn, 2019. "Farmers’ willingness to accept payments for ecosystem services on agricultural land: The case of climate-smart agroforestry in Ethiopia," Ecosystem Services, Elsevier, vol. 39(C).
    4. Erik Stam & Roy Thurik & Peter van der Zwan, 2010. "Entrepreneurial exit in real and imagined markets," Industrial and Corporate Change, Oxford University Press and the Associazione ICC, vol. 19(4), pages 1109-1139, August.
    5. David Hensher & John Rose & Zheng Li, 2012. "Does the choice model method and/or the data matter?," Transportation, Springer, vol. 39(2), pages 351-385, March.
    6. Haoying Wang & Guohui Wu, 2022. "Modeling discrete choices with large fine-scale spatial data: opportunities and challenges," Journal of Geographical Systems, Springer, vol. 24(3), pages 325-351, July.
    7. Line Bjørnskov Pedersen & Julie Riise & Arne Risa Hole & Dorte Gyrd-Hansen, 2014. "GPs' shifting agencies in choice of treatment," Applied Economics, Taylor & Francis Journals, vol. 46(7), pages 750-761, March.
    8. Reinhard A. Weisser, 2020. "How Personality Shapes Study Location Choices," Research in Higher Education, Springer;Association for Institutional Research, vol. 61(1), pages 88-116, February.
    9. Chen, Tiantian & Fu, Xiaowen & Hensher, David A. & Li, Zhi-Chun & Sze, N.N., 2022. "Air travel choice, online meeting and passenger heterogeneity – An international study on travellers’ preference during a pandemic," Transportation Research Part A: Policy and Practice, Elsevier, vol. 165(C), pages 439-453.
    10. Holte, Jon Helgheim & Kjaer, Trine & Abelsen, Birgit & Olsen, Jan Abel, 2015. "The impact of pecuniary and non-pecuniary incentives for attracting young doctors to rural general practice," Social Science & Medicine, Elsevier, vol. 128(C), pages 1-9.
    11. Kaambwa, Billingsley & Lancsar, Emily & McCaffrey, Nicola & Chen, Gang & Gill, Liz & Cameron, Ian D. & Crotty, Maria & Ratcliffe, Julie, 2015. "Investigating consumers' and informal carers' views and preferences for consumer directed care: A discrete choice experiment," Social Science & Medicine, Elsevier, vol. 140(C), pages 81-94.
    12. Balogh, Péter & Békési, Dániel & Gorton, Matthew & Popp, József & Lengyel, Péter, 2016. "Consumer willingness to pay for traditional food products," Food Policy, Elsevier, vol. 61(C), pages 176-184.
    13. I. G. Ukpong & K. G. Balcombe & I. M. Fraser & F. J. Areal, 2019. "Preferences for Mitigation of the Negative Impacts of the Oil and Gas Industry in the Niger Delta Region of Nigeria," Environmental & Resource Economics, Springer;European Association of Environmental and Resource Economists, vol. 74(2), pages 811-843, October.
    14. Yuanyuan Gu & Arne Risa Hole & Stephanie Knox, 2013. "Fitting the generalized multinomial logit model in Stata," Stata Journal, StataCorp LLC, vol. 13(2), pages 382-397, June.
    15. Shr, Yau-Huo & Ready, Richard C. & Orland, Brian & Echols, Stuart, 2017. "Do Visual Representations Influence Survey Responses? Evidence from a Choice Experiment on Landscape Attributes of Green Infrastructure," 2017 Annual Meeting, July 30-August 1, Chicago, Illinois 258397, Agricultural and Applied Economics Association.
    16. Rocha, Luiz Eduardo Vasconcelos & Santos, Gilnei Costa & Bastos, Patricia de Melo Abrita, 2006. "Evolução Da Distribuição Da Renda E Da Pobreza Das Famílias Ocupadas E Residentes No Meio Rural Do Estado De Minas Gerais, De 1981 A 2003," 44th Congress, July 23-27, 2006, Fortaleza, Ceará, Brazil 148649, Sociedade Brasileira de Economia, Administracao e Sociologia Rural (SOBER).
    17. John C. Whitehead & Daniel K. Lew, 2020. "Estimating recreation benefits through joint estimation of revealed and stated preference discrete choice data," Empirical Economics, Springer, vol. 58(4), pages 2009-2029, April.
    18. Haghani, Milad & Bliemer, Michiel C.J. & Hensher, David A., 2021. "The landscape of econometric discrete choice modelling research," Journal of choice modelling, Elsevier, vol. 40(C).
    19. Yuanyuan Gu & Richard Norman & Rosalie Viney, 2014. "Estimating Health State Utility Values From Discrete Choice Experiments—A Qaly Space Model Approach," Health Economics, John Wiley & Sons, Ltd., vol. 23(9), pages 1098-1114, September.
    20. Leite, Sheila Cristina Ferreira & De Figueiredo, Margarida Garcia, 2006. "Fluxos De Algodão Em Pluma Para Exportação No Estado Da Bahia: Uma Aplicação De Programação Linear," 44th Congress, July 23-27, 2006, Fortaleza, Ceará, Brazil 149116, Sociedade Brasileira de Economia, Administracao e Sociologia Rural (SOBER).

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:wly:japmet:v:39:y:2024:i:3:p:481-497. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: http://www.interscience.wiley.com/jpages/0883-7252/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.