IDEAS home Printed from https://ideas.repec.org/a/inm/ormnsc/v66y2020i8p3635-3656.html
   My bibliography  Save this article

A Conditional Gradient Approach for Nonparametric Estimation of Mixing Distributions

Author

Listed:
  • Srikanth Jagabathula

    (Stern School of Business, New York University, New York, New York 10012; Harvard Business School, Harvard University, Boston, Massachusetts 02163)

  • Lakshminarayanan Subramanian

    (Courant Institute of Mathematical Sciences, New York University, New York, New York 10012)

  • Ashwin Venkataraman

    (Courant Institute of Mathematical Sciences, New York University, New York, New York 10012; Harvard Institute for Quantitative Social Science, Harvard University, Cambridge, Massachusetts 02138)

Abstract

Mixture models are versatile tools that are used extensively in many fields, including operations, marketing, and econometrics. The main challenge in estimating mixture models is that the mixing distribution is often unknown, and imposing a priori parametric assumptions can lead to model misspecification issues. In this paper, we propose a new methodology for nonparametric estimation of the mixing distribution of a mixture of logit models. We formulate the likelihood-based estimation problem as a constrained convex program and apply the conditional gradient (also known as Frank–Wolfe) algorithm to solve this convex program. We show that our method iteratively generates the support of the mixing distribution and the mixing proportions. Theoretically, we establish the sublinear convergence rate of our estimator and characterize the structure of the recovered mixing distribution. Empirically, we test our approach on real-world datasets. We show that it outperforms the standard expectation-maximization (EM) benchmark on speed (16 times faster), in-sample fit (up to 24% reduction in the log-likelihood loss), and predictive (average 28% reduction in standard error metrics) and decision accuracies (extracts around 23% more revenue). On synthetic data, we show that our estimator is robust to different ground-truth mixing distributions and can also account for endogeneity.

Suggested Citation

  • Srikanth Jagabathula & Lakshminarayanan Subramanian & Ashwin Venkataraman, 2020. "A Conditional Gradient Approach for Nonparametric Estimation of Mixing Distributions," Management Science, INFORMS, vol. 66(8), pages 3635-3656, August.
  • Handle: RePEc:inm:ormnsc:v:66:y:2020:i:8:p:3635-3656
    DOI: 10.1287/mnsc.2019.3373
    as

    Download full text from publisher

    File URL: https://doi.org/10.1287/mnsc.2019.3373
    Download Restriction: no

    File URL: https://libkey.io/10.1287/mnsc.2019.3373?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Feng, Long & Dicker, Lee H., 2018. "Approximate nonparametric maximum likelihood for mixture models: A convex optimization approach to fitting arbitrary multivariate mixing distributions," Computational Statistics & Data Analysis, Elsevier, vol. 122(C), pages 80-91.
    2. Daniel McFadden & Kenneth Train, 2000. "Mixed MNL models for discrete response," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 15(5), pages 447-470.
    3. Hauser, John R., 2014. "Consideration-set heuristics," Journal of Business Research, Elsevier, vol. 67(8), pages 1688-1699.
    4. Train,Kenneth E., 2009. "Discrete Choice Methods with Simulation," Cambridge Books, Cambridge University Press, number 9780521766555.
    5. Marguerite Frank & Philip Wolfe, 1956. "An algorithm for quadratic programming," Naval Research Logistics Quarterly, John Wiley & Sons, vol. 3(1‐2), pages 95-110, March.
    6. Gilles Laurent & Eric Lapersonne & Jean-Jacques Le Goff, 1995. "Consideration sets of size one: An empirical investigation of automobile purchases," Post-Print hal-00458463, HAL.
    7. Jeremy T. Fox & Kyoo il Kim & Stephen P. Ryan & Patrick Bajari, 2011. "A simple estimator for the distribution of random coefficients," Quantitative Economics, Econometric Society, vol. 2(3), pages 381-418, November.
    8. Jonathan James, 2017. "MM Algorithm for General Mixed Multinomial Logit Models," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 32(4), pages 841-857, June.
    9. Berry, Steven & Levinsohn, James & Pakes, Ariel, 1995. "Automobile Prices in Market Equilibrium," Econometrica, Econometric Society, vol. 63(4), pages 841-890, July.
    10. Chandra R. Bhat, 1997. "An Endogenous Segmentation Mode Choice Model with an Application to Intercity Travel," Transportation Science, INFORMS, vol. 31(1), pages 34-48, February.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Qi Feng & J. George Shanthikumar & Mengying Xue, 2022. "Consumer Choice Models and Estimation: A Review and Extension," Production and Operations Management, Production and Operations Management Society, vol. 31(2), pages 847-867, February.
    2. Xiyuan Ren & Joseph Y. J. Chow & Prateek Bansal, 2023. "Estimating a k-modal nonparametric mixed logit model with market-level data," Papers 2309.13159, arXiv.org, revised Aug 2024.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Bansal, Prateek & Daziano, Ricardo A & Guerra, Erick, 2018. "Minorization-Maximization (MM) algorithms for semiparametric logit models: Bottlenecks, extensions, and comparisons," Transportation Research Part B: Methodological, Elsevier, vol. 115(C), pages 17-40.
    2. Bansal, Prateek & Daziano, Ricardo A. & Achtnicht, Martin, 2018. "Comparison of parametric and semiparametric representations of unobserved preference heterogeneity in logit models," Journal of choice modelling, Elsevier, vol. 27(C), pages 97-113.
    3. Xiyuan Ren & Joseph Y. J. Chow & Prateek Bansal, 2023. "Estimating a k-modal nonparametric mixed logit model with market-level data," Papers 2309.13159, arXiv.org, revised Aug 2024.
    4. Martin O'Connell & Pierre Dubois & Rachel Griffith, 2022. "The Use of Scanner Data for Economics Research," Annual Review of Economics, Annual Reviews, vol. 14(1), pages 723-745, August.
    5. Heiss, Florian & Hetzenecker, Stephan & Osterhaus, Maximilian, 2022. "Nonparametric estimation of the random coefficients model: An elastic net approach," Journal of Econometrics, Elsevier, vol. 229(2), pages 299-321.
    6. Pereira, Pedro & Ribeiro, Tiago, 2011. "The impact on broadband access to the Internet of the dual ownership of telephone and cable networks," International Journal of Industrial Organization, Elsevier, vol. 29(2), pages 283-293, March.
    7. Allais, Olivier & Etilé, Fabrice & Lecocq, Sébastien, 2015. "Mandatory labels, taxes and market forces: An empirical evaluation of fat policies," Journal of Health Economics, Elsevier, vol. 43(C), pages 27-44.
    8. Bernard Fortin & Nicolas Jacquemet & Bruce Shearer, 2008. "Policy Analysis in Health-Services Market: Accounting for Quality and Quantity," Annals of Economics and Statistics, GENES, issue 91-92, pages 293-319.
    9. Rachel Griffith & Lars Nesheim & Martin O'Connell, 2018. "Income effects and the welfare consequences of tax in differentiated product oligopoly," Quantitative Economics, Econometric Society, vol. 9(1), pages 305-341, March.
    10. Dieckhoener, Caroline & Hecking, Harald, 2012. "Greenhouse Gas Abatement Cost Curves of the Residential Heating Market – a Microeconomic Approach," EWI Working Papers 2012-16, Energiewirtschaftliches Institut an der Universitaet zu Koeln (EWI).
    11. Peter Davis & Pasquale Schiraldi, 2014. "The flexible coefficient multinomial logit (FC-MNL) model of demand for differentiated products," RAND Journal of Economics, RAND Corporation, vol. 45(1), pages 32-63, March.
    12. Kidokoro, Yukihiro, 2016. "A micro foundation for discrete choice models with multiple categories of goods," Journal of choice modelling, Elsevier, vol. 19(C), pages 54-72.
    13. Kameng Nip & Zhenbo Wang & Zizhuo Wang, 2021. "Assortment Optimization under a Single Transition Choice Model," Production and Operations Management, Production and Operations Management Society, vol. 30(7), pages 2122-2142, July.
    14. Patrick Bigler & Doina Maria Radulescu, 2022. "Environmental, Redistributive and Revenue Effects of Policies Promoting Fuel Efficient and Electric Vehicles," CESifo Working Paper Series 9645, CESifo.
    15. Martin, Elliott William, 2009. "New Vehicle Choice, Fuel Economy and Vehicle Incentives: An Analysis of Hybrid Tax Credits and the Gasoline Tax," University of California Transportation Center, Working Papers qt5gd206wv, University of California Transportation Center.
    16. Hovhannisyan, Vardges & Stiegert, Kyle W., 2011. "Vertical Channel Analysis of the U.S. Milk Market," 2011 Annual Meeting, July 24-26, 2011, Pittsburgh, Pennsylvania 103631, Agricultural and Applied Economics Association.
    17. Sofia Berto Villas-Boas, 2007. "Vertical Relationships between Manufacturers and Retailers: Inference with Limited Data," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 74(2), pages 625-652.
    18. Hoyos, David, 2010. "The state of the art of environmental valuation with discrete choice experiments," Ecological Economics, Elsevier, vol. 69(8), pages 1595-1603, June.
    19. Greene, William H. & Hensher, David A., 2003. "A latent class model for discrete choice analysis: contrasts with mixed logit," Transportation Research Part B: Methodological, Elsevier, vol. 37(8), pages 681-698, September.
    20. Schmitz, Hendrik & Stroka, Magdalena A., 2014. "Do Elderly Choose Nursing Homes by Quality, Price or Location?," Ruhr Economic Papers 495, RWI - Leibniz-Institut für Wirtschaftsforschung, Ruhr-University Bochum, TU Dortmund University, University of Duisburg-Essen.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:inm:ormnsc:v:66:y:2020:i:8:p:3635-3656. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Chris Asher (email available below). General contact details of provider: https://edirc.repec.org/data/inforea.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.