IDEAS home Printed from https://ideas.repec.org/a/eee/eejocm/v39y2021ics1755534521000178.html
   My bibliography  Save this article

mixl: An open-source R package for estimating complex choice models on large datasets

Author

Listed:
  • Molloy, Joseph
  • Becker, Felix
  • Schmid, Basil
  • Axhausen, Kay W.

Abstract

This paper introduces mixl, a new R package for the estimation of advanced choice models. The estimation of such models typically relies on simulation methods with a large number of random draws to obtain stable results. mixl uses inherent properties of the log-likelihood problem structure to greatly reduce both the memory usage and runtime of the estimation procedure for specific types of mixed multinomial logit models. Functions for prediction and posterior analysis are included. Parallel computing is also supported, with near linear speedups observed on up to 24 cores. mixl is directly accessible from R, available on CRAN. We show that mixl is fast, easy to use, and scales to very large datasets. This paper presents the architecture and performance of the package, details its use, and presents some results using real world data and models.

Suggested Citation

  • Molloy, Joseph & Becker, Felix & Schmid, Basil & Axhausen, Kay W., 2021. "mixl: An open-source R package for estimating complex choice models on large datasets," Journal of choice modelling, Elsevier, vol. 39(C).
  • Handle: RePEc:eee:eejocm:v:39:y:2021:i:c:s1755534521000178
    DOI: 10.1016/j.jocm.2021.100284
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S1755534521000178
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.jocm.2021.100284?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Train,Kenneth E., 2009. "Discrete Choice Methods with Simulation," Cambridge Books, Cambridge University Press, number 9780521766555, September.
    2. Sarrias, Mauricio & Daziano, Ricardo, 2017. "Multinomial Logit Models with Continuous and Discrete Individual Heterogeneity in R: The gmnl Package," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 79(i02).
    3. Basil Schmid & Milos Balac & Kay W. Axhausen, 2019. "Post-Car World: data collection methods and response behavior in a multi-stage travel survey," Transportation, Springer, vol. 46(2), pages 425-492, April.
    4. Schmid, Basil & Jokubauskaite, Simona & Aschauer, Florian & Peer, Stefanie & Hössinger, Reinhard & Gerike, Regine & Jara-Diaz, Sergio R. & Axhausen, Kay W., 2019. "A pooled RP/SP mode, route and destination choice model to investigate mode and user-type effects in the value of travel time savings," Transportation Research Part A: Policy and Practice, Elsevier, vol. 124(C), pages 262-294.
    5. Ben-Akiva, Moshe & McFadden, Daniel & Train, Kenneth & Börsch-Supan, Axel, 2002. "Hybrid Choice Models: Progress and Challenges," Sonderforschungsbereich 504 Publications 02-29, Sonderforschungsbereich 504, Universität Mannheim;Sonderforschungsbereich 504, University of Mannheim.
    6. McFadden, Daniel, 1980. "Econometric Models for Probabilistic Choice among Products," The Journal of Business, University of Chicago Press, vol. 53(3), pages 13-29, July.
    7. Zeileis, Achim, 2006. "Object-oriented Computation of Sandwich Estimators," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 16(i09).
    8. van Cranenburgh, Sander & Bliemer, Michiel C.J., 2019. "Information theoretic-based sampling of observations," Journal of choice modelling, Elsevier, vol. 31(C), pages 181-197.
    9. Hess, Stephane & Train, Kenneth E. & Polak, John W., 2006. "On the use of a Modified Latin Hypercube Sampling (MLHS) method in the estimation of a Mixed Logit Model for vehicle choice," Transportation Research Part B: Methodological, Elsevier, vol. 40(2), pages 147-163, February.
    10. Schmid, Basil & Axhausen, Kay W., 2019. "In-store or online shopping of search and experience goods: A hybrid choice approach," Journal of choice modelling, Elsevier, vol. 31(C), pages 156-180.
    11. Hess, Stephane & Palma, David, 2019. "Apollo: A flexible, powerful and customisable freeware package for choice model estimation and application," Journal of choice modelling, Elsevier, vol. 32(C), pages 1-1.
    12. Czajkowski, Mikołaj & Budziński, Wiktor, 2019. "Simulation error in maximum likelihood estimation of discrete choice models," Journal of choice modelling, Elsevier, vol. 31(C), pages 73-85.
    13. Arne Henningsen & Ott Toomet, 2011. "maxLik: A package for maximum likelihood estimation in R," Computational Statistics, Springer, vol. 26(3), pages 443-458, September.
    14. Daniel McFadden & Kenneth Train, 2000. "Mixed MNL models for discrete response," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 15(5), pages 447-470.
    15. Ben-Akiva, Moshe & McFadden, Daniel & Train, Kenneth, 2019. "Foundations of Stated Preference Elicitation: Consumer Behavior and Choice-based Conjoint Analysis," Foundations and Trends(R) in Econometrics, now publishers, vol. 10(1-2), pages 1-144, January.
    16. Eddelbuettel, Dirk & Francois, Romain, 2011. "Rcpp: Seamless R and C++ Integration," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 40(i08).
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Meister, Adrian & Felder, Matteo & Schmid, Basil & Axhausen, Kay W., 2023. "Route choice modeling for cyclists on urban networks," Transportation Research Part A: Policy and Practice, Elsevier, vol. 173(C).
    2. Milos Balac & Sebastian Hörl & Basil Schmid, 2024. "Discrete choice modeling with anonymized data," Transportation, Springer, vol. 51(2), pages 351-370, April.
    3. Dias, Charitha & Abdullah, Muhammad & Lovreglio, Ruggiero & Sachchithanantham, Sumana & Rekatheeban, Markkandu & Sathyaprasad, I.M.S., 2022. "Exploring home-to-school trip mode choices in Kandy, Sri Lanka," Journal of Transport Geography, Elsevier, vol. 99(C).
    4. Aizaki, Hideo & Fogarty, James, 2023. "R packages and tutorial for case 1 best–worst scaling," Journal of choice modelling, Elsevier, vol. 46(C).
    5. Schmid, Basil & Molloy, Joseph & Peer, Stefanie & Jokubauskaite, Simona & Aschauer, Florian & Hössinger, Reinhard & Gerike, Regine & Jara-Diaz, Sergio R. & Axhausen, Kay W., 2021. "The value of travel time savings and the value of leisure in Zurich: Estimation, decomposition and policy implications," Transportation Research Part A: Policy and Practice, Elsevier, vol. 150(C), pages 186-215.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Tinessa, Fiore & Marzano, Vittorio & Papola, Andrea, 2020. "Mixing distributions of tastes with a Combination of Nested Logit (CoNL) kernel: Formulation and performance analysis," Transportation Research Part B: Methodological, Elsevier, vol. 141(C), pages 1-23.
    2. Schmid, Basil & Molloy, Joseph & Peer, Stefanie & Jokubauskaite, Simona & Aschauer, Florian & Hössinger, Reinhard & Gerike, Regine & Jara-Diaz, Sergio R. & Axhausen, Kay W., 2021. "The value of travel time savings and the value of leisure in Zurich: Estimation, decomposition and policy implications," Transportation Research Part A: Policy and Practice, Elsevier, vol. 150(C), pages 186-215.
    3. Johanna Lena Dahlhausen & Cam Rungie & Jutta Roosen, 2018. "Value of labeling credence attributes—common structures and individual preferences," Agricultural Economics, International Association of Agricultural Economists, vol. 49(6), pages 741-751, November.
    4. Stefania Troiano & Daniel Vecchiato & Francesco Marangon & Tiziano Tempesta & Federico Nassivera, 2019. "Households’ Preferences for a New ‘Climate-Friendly’ Heating System: Does Contribution to Reducing Greenhouse Gases Matter?," Energies, MDPI, vol. 12(13), pages 1-19, July.
    5. Arora, Nikita & Crastes dit Sourd, Romain & Hanson, Kara & Woldesenbet, Dorka & Seifu, Abiy & Quaife, Matthew, 2022. "Linking health worker motivation with their stated job preferences: A hybrid choice analysis in Ethiopia," Social Science & Medicine, Elsevier, vol. 307(C).
    6. Isler, Cassiano Augusto & Blumenfeld, Marcelo & Caldeira, Gabriel Pereira & Roberts, Clive, 2024. "Long-Distance railway mode choice in Brazil: Evidence from a discrete choice experiment," Research in Transportation Economics, Elsevier, vol. 104(C).
    7. Malte Welling & Ewa Zawojska & Julian Sagebiel, 2022. "Information, Consequentiality and Credibility in Stated Preference Surveys: A Choice Experiment on Climate Adaptation," Environmental & Resource Economics, Springer;European Association of Environmental and Resource Economists, vol. 82(1), pages 257-283, May.
    8. Daina, Nicolò & Sivakumar, Aruna & Polak, John W., 2017. "Modelling electric vehicles use: a survey on the methods," Renewable and Sustainable Energy Reviews, Elsevier, vol. 68(P1), pages 447-460.
    9. Krueger, Rico & Bierlaire, Michel & Daziano, Ricardo A. & Rashidi, Taha H. & Bansal, Prateek, 2021. "Evaluating the predictive abilities of mixed logit models with unobserved inter- and intra-individual heterogeneity," Journal of choice modelling, Elsevier, vol. 41(C).
    10. Haghani, Milad & Sarvi, Majid & Shahhoseini, Zahra, 2015. "Accommodating taste heterogeneity and desired substitution pattern in exit choices of pedestrian crowd evacuees using a mixed nested logit model," Journal of choice modelling, Elsevier, vol. 16(C), pages 58-68.
    11. Zsanett Blaga & Peter Czine & Barbara Takacs & Anna Szilagyi & Reka Szekeres & Zita Wachal & Csaba Hegedus & Gyula Buchholcz & Balazs Varga & Daniel Priksz & Mariann Bombicz & Adrienn Monika Szabo & R, 2023. "Examination of Preferences for COVID-19 Vaccines in Hungary Based on Their Properties—Examining the Impact of Pandemic Awareness with a Hybrid Choice Approach," IJERPH, MDPI, vol. 20(2), pages 1-16, January.
    12. John Buckell & David A Hensher & Stephane Hess, 2021. "Kicking the habit is hard: A hybrid choice model investigation into the role of addiction in smoking behavior," Health Economics, John Wiley & Sons, Ltd., vol. 30(1), pages 3-19, January.
    13. Rossolov, Oleksandr & Susilo, Yusak O., 2024. "Are consumers ready to pay extra for crowd-shipping e-groceries and why? A hybrid choice analysis for developing economies," Transportation Research Part A: Policy and Practice, Elsevier, vol. 187(C).
    14. Hess, Stephane & Palma, David, 2019. "Apollo: A flexible, powerful and customisable freeware package for choice model estimation and application," Journal of choice modelling, Elsevier, vol. 32(C), pages 1-1.
    15. Hancock, Thomas O. & Broekaert, Jan & Hess, Stephane & Choudhury, Charisma F., 2020. "Quantum probability: A new method for modelling travel behaviour," Transportation Research Part B: Methodological, Elsevier, vol. 139(C), pages 165-198.
    16. Youssef M Aboutaleb & Mazen Danaf & Yifei Xie & Moshe Ben-Akiva, 2020. "Sparse Covariance Estimation in Logit Mixture Models," Papers 2001.05034, arXiv.org.
    17. Bansal, Prateek & Krueger, Rico & Bierlaire, Michel & Daziano, Ricardo A. & Rashidi, Taha H., 2020. "Bayesian estimation of mixed multinomial logit models: Advances and simulation-based evaluations," Transportation Research Part B: Methodological, Elsevier, vol. 131(C), pages 124-142.
    18. Rotaris, Lucia & Giansoldati, Marco & Scorrano, Mariangela, 2021. "The slow uptake of electric cars in Italy and Slovenia. Evidence from a stated-preference survey and the role of knowledge and environmental awareness," Transportation Research Part A: Policy and Practice, Elsevier, vol. 144(C), pages 1-18.
    19. Dugstad, Anders & Brouwer, Roy & Grimsrud, Kristine & Kipperberg, Gorm & Lindhjem, Henrik & Navrud, Ståle, 2024. "Nature is ours! – Psychological ownership and preferences for wind energy," Energy Economics, Elsevier, vol. 129(C).
    20. Boyce, Christopher & Czajkowski, Mikołaj & Hanley, Nick, 2019. "Personality and economic choices," Journal of Environmental Economics and Management, Elsevier, vol. 94(C), pages 82-100.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:eejocm:v:39:y:2021:i:c:s1755534521000178. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.journals.elsevier.com/journal-of-choice-modelling .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.