IDEAS home Printed from https://ideas.repec.org/p/arx/papers/1810.10465.html
   My bibliography  Save this paper

Deep Neural Networks for Choice Analysis: A Statistical Learning Theory Perspective

Author

Listed:
  • Shenhao Wang
  • Qingyi Wang
  • Nate Bailey
  • Jinhua Zhao

Abstract

While researchers increasingly use deep neural networks (DNN) to analyze individual choices, overfitting and interpretability issues remain as obstacles in theory and practice. By using statistical learning theory, this study presents a framework to examine the tradeoff between estimation and approximation errors, and between prediction and interpretation losses. It operationalizes the DNN interpretability in the choice analysis by formulating the metrics of interpretation loss as the difference between true and estimated choice probability functions. This study also uses the statistical learning theory to upper bound the estimation error of both prediction and interpretation losses in DNN, shedding light on why DNN does not have the overfitting issue. Three scenarios are then simulated to compare DNN to binary logit model (BNL). We found that DNN outperforms BNL in terms of both prediction and interpretation for most of the scenarios, and larger sample size unleashes the predictive power of DNN but not BNL. DNN is also used to analyze the choice of trip purposes and travel modes based on the National Household Travel Survey 2017 (NHTS2017) dataset. These experiments indicate that DNN can be used for choice analysis beyond the current practice of demand forecasting because it has the inherent utility interpretation, the flexibility of accommodating various information formats, and the power of automatically learning utility specification. DNN is both more predictive and interpretable than BNL unless the modelers have complete knowledge about the choice task, and the sample size is small. Overall, statistical learning theory can be a foundation for future studies in the non-asymptotic data regime or using high-dimensional statistical models in choice analysis, and the experiments show the feasibility and effectiveness of DNN for its wide applications to policy and behavioral analysis.

Suggested Citation

  • Shenhao Wang & Qingyi Wang & Nate Bailey & Jinhua Zhao, 2018. "Deep Neural Networks for Choice Analysis: A Statistical Learning Theory Perspective," Papers 1810.10465, arXiv.org, revised Sep 2019.
  • Handle: RePEc:arx:papers:1810.10465
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/1810.10465
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Kenneth Train, 1980. "A Structured Logit Model of Auto Ownership and Mode Choice," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 47(2), pages 357-370.
    2. Edward L. Glaeser & Scott Duke Kominers & Michael Luca & Nikhil Naik, 2018. "Big Data And Big Cities: The Promises And Limitations Of Improved Measures Of Urban Life," Economic Inquiry, Western Economic Association International, vol. 56(1), pages 114-137, January.
    3. Train,Kenneth E., 2009. "Discrete Choice Methods with Simulation," Cambridge Books, Cambridge University Press, number 9780521766555.
    4. Sendhil Mullainathan & Jann Spiess, 2017. "Machine Learning: An Applied Econometric Approach," Journal of Economic Perspectives, American Economic Association, vol. 31(2), pages 87-106, Spring.
    5. Chorus, Caspar G. & Arentze, Theo A. & Timmermans, Harry J.P., 2008. "A Random Regret-Minimization model of travel choice," Transportation Research Part B: Methodological, Elsevier, vol. 42(1), pages 1-18, January.
    6. Bhat, Chandra R. & Pulugurta, Vamsi, 1998. "A comparison of two alternative behavioral choice mechanisms for household auto ownership decisions," Transportation Research Part B: Methodological, Elsevier, vol. 32(1), pages 61-75, January.
    7. Beggs, S. & Cardell, S. & Hausman, J., 1981. "Assessing the potential demand for electric cars," Journal of Econometrics, Elsevier, vol. 17(1), pages 1-19, September.
    8. Small, K. & Winston, C., 1998. ""The Demand for Transportation: Models and Applications"," Papers 98-99-6, California Irvine - School of Social Sciences.
    9. Hausman, J. A. & Abrevaya, Jason & Scott-Morton, F. M., 1998. "Misclassification of the dependent variable in a discrete-response setting," Journal of Econometrics, Elsevier, vol. 87(2), pages 239-269, September.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Shenhao Wang & Qingyi Wang & Jinhua Zhao, 2018. "Deep Neural Networks for Choice Analysis: Extracting Complete Economic Information for Interpretation," Papers 1812.04528, arXiv.org, revised Apr 2021.
    2. Shenhao Wang & Qingyi Wang & Jinhua Zhao, 2019. "Multitask Learning Deep Neural Networks to Combine Revealed and Stated Preference Data," Papers 1901.00227, arXiv.org, revised Aug 2019.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Wang, Shenhao & Wang, Qingyi & Zhao, Jinhua, 2020. "Multitask learning deep neural networks to combine revealed and stated preference data," Journal of choice modelling, Elsevier, vol. 37(C).
    2. Wang, Shenhao & Mo, Baichuan & Zhao, Jinhua, 2021. "Theory-based residual neural networks: A synergy of discrete choice models and deep neural networks," Transportation Research Part B: Methodological, Elsevier, vol. 146(C), pages 333-358.
    3. Wang, Shenhao & Wang, Qingyi & Bailey, Nate & Zhao, Jinhua, 2021. "Deep neural networks for choice analysis: A statistical learning theory perspective," Transportation Research Part B: Methodological, Elsevier, vol. 148(C), pages 60-81.
    4. Shenhao Wang & Qingyi Wang & Jinhua Zhao, 2019. "Multitask Learning Deep Neural Networks to Combine Revealed and Stated Preference Data," Papers 1901.00227, arXiv.org, revised Aug 2019.
    5. Shenhao Wang & Baichuan Mo & Jinhua Zhao, 2020. "Theory-based residual neural networks: A synergy of discrete choice models and deep neural networks," Papers 2010.11644, arXiv.org.
    6. Salon, Deborah, 2009. "Neighborhoods, cars, and commuting in New York City: A discrete choice approach," Transportation Research Part A: Policy and Practice, Elsevier, vol. 43(2), pages 180-196, February.
    7. John K. Dagsvik & Zhiyang Jia, 2016. "Labor Supply as a Choice Among Latent Jobs: Unobserved Heterogeneity and Identification," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 31(3), pages 487-506, April.
    8. Kim, Junghun & Seung, Hyunchan & Lee, Jongsu & Ahn, Joongha, 2020. "Asymmetric preference and loss aversion for electric vehicles: The reference-dependent choice model capturing different preference directions," Energy Economics, Elsevier, vol. 86(C).
    9. Kaplan, Sigal & Shiftan, Yoram & Bekhor, Shlomo, 2012. "Development and estimation of a semi-compensatory model with a flexible error structure," Transportation Research Part B: Methodological, Elsevier, vol. 46(2), pages 291-304.
    10. Galdo, Virgilio & Li, Yue & Rama, Martin, 2021. "Identifying urban areas by combining human judgment and machine learning: An application to India," Journal of Urban Economics, Elsevier, vol. 125(C).
    11. Franco Chingcuanco & Eric Miller, 2014. "A meta-model of vehicle ownership choice parameters," Transportation, Springer, vol. 41(5), pages 923-945, September.
    12. Liu, Gang, 2007. "A behavioral model of work-trip mode choice in Shanghai," China Economic Review, Elsevier, vol. 18(4), pages 456-476.
    13. John K. Dagsvik & Zhiyang Jia, 2014. "Labor supply as a discrete choice among latent jobs: Unobserved heterogeneity and identification," Discussion Papers 786, Statistics Norway, Research Department.
    14. Anders F. Jensen & Elisabetta Cherchi & Stefan L. Mabit & Juan de Dios Ortúzar, 2017. "Predicting the Potential Market for Electric Vehicles," Transportation Science, INFORMS, vol. 51(2), pages 427-440, May.
    15. Godager, Geir, 2012. "Birds of a feather flock together: A study of doctor–patient matching," Journal of Health Economics, Elsevier, vol. 31(1), pages 296-305.
    16. Siikamaki, Juha & Layton, David F., 2007. "Discrete choice survey experiments: A comparison using flexible methods," Journal of Environmental Economics and Management, Elsevier, vol. 53(1), pages 122-139, January.
    17. Ben Aoki-Sherwood & Catherine Bregou & David Liben-Nowell & Kiran Tomlinson & Thomas Zeng, 2024. "Bounding Consideration Probabilities in Consider-Then-Choose Ranking Models," Papers 2401.11016, arXiv.org.
    18. Kockelman, Kara M. & Podgorski, Kaethe & Bina, Michelle & Gadda, Shashank, 2009. "Public Perceptions of Pricing Existing Roads and Other Transportation Policies: The Texas Perspective," Journal of the Transportation Research Forum, Transportation Research Forum, vol. 48(3).
    19. de Lucio, Juan, 2021. "Estimación adelantada del crecimiento regional mediante redes neuronales LSTM," INVESTIGACIONES REGIONALES - Journal of REGIONAL RESEARCH, Asociación Española de Ciencia Regional, issue 49, pages 45-64.
    20. Hong il Yoo, 2012. "The perceived unreliability of rank-ordered data: an econometric origin and implications," Discussion Papers 2012-46, School of Economics, The University of New South Wales.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:1810.10465. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.