IDEAS home Printed from https://ideas.repec.org/a/wly/ajagec/v104y2022i3p976-998.html
   My bibliography  Save this article

Nonparametric segmentation methods: Applications of unsupervised machine learning and revealed preference

Author

Listed:
  • Joey Blumberg
  • Gary Thompson

Abstract

Many recent efforts by econometricians have focused on supervised machine learning techniques to aid in empirical studies using experimental data. By contrast, this article explores the merits of unsupervised machine learning algorithms for informing ex ante policy design using observational data. We examine the extent to which groups of consumers with differing responses to economic incentives can be identified in a context of fruit and vegetable demand. Two classes of nonparametric algorithms—revealed preference and unsupervised machine learning—are compared for segmenting households in the National Consumer Panel. Nonlinear almost‐ideal demand models are estimated for all segments to determine which methods group households into segments with different expenditure and price elasticities. In‐sample comparisons and out‐of‐sample prediction results indicate methods using price‐quantity data alone—without demographic, geographic, or other variables—perform better at segmenting households into groups with sizeable differences in price and expenditure responsiveness. These segmentation results suggest considerable heterogeneity in household purchasing behavior of fruits and vegetables.

Suggested Citation

  • Joey Blumberg & Gary Thompson, 2022. "Nonparametric segmentation methods: Applications of unsupervised machine learning and revealed preference," American Journal of Agricultural Economics, John Wiley & Sons, vol. 104(3), pages 976-998, May.
  • Handle: RePEc:wly:ajagec:v:104:y:2022:i:3:p:976-998
    DOI: 10.1111/ajae.12257
    as

    Download full text from publisher

    File URL: https://doi.org/10.1111/ajae.12257
    Download Restriction: no

    File URL: https://libkey.io/10.1111/ajae.12257?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Harvey Goldstein & Michael J. R. Healy, 1995. "The Graphical Presentation of a Collection of Means," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 158(1), pages 175-177, January.
    2. McKelvey, Christopher, 2011. "Price, unit value, and quality demanded," Journal of Development Economics, Elsevier, vol. 95(2), pages 157-169, July.
    3. Stefan Wager & Susan Athey, 2018. "Estimation and Inference of Heterogeneous Treatment Effects using Random Forests," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 113(523), pages 1228-1242, July.
    4. Mark Dean & Daniel Martin, 2016. "Measuring Rationality with the Minimum Cost of Revealed Preference Violations," The Review of Economics and Statistics, MIT Press, vol. 98(3), pages 524-534, July.
    5. Liran Einav & Ephraim Leibtag & Aviv Nevo, 2010. "Recording discrepancies in Nielsen Homescan data: Are they present and do they matter?," Quantitative Marketing and Economics (QME), Springer, vol. 8(2), pages 207-239, June.
    6. Jayson L. Lusk, 2017. "Consumer Research with Big Data: Applications from the Food Demand Survey (FooDS)," American Journal of Agricultural Economics, Agricultural and Applied Economics Association, vol. 99(2), pages 303-320.
    7. Patrice Bertail & France Caillavet, 2008. "Fruit and Vegetable Consumption Patterns: A Segmentation Approach," American Journal of Agricultural Economics, Agricultural and Applied Economics Association, vol. 90(3), pages 827-842.
    8. Timothy K. M. Beatty & Ian A. Crawford, 2011. "How Demanding Is the Revealed Preference Approach to Demand?," American Economic Review, American Economic Association, vol. 101(6), pages 2782-2795, October.
    9. Chen Zhen & Eric A. Finkelstein & James M. Nonnemaker & Shawn A. Karns & Jessica E. Todd, 2014. "Predicting the Effects of Sugar-Sweetened Beverage Taxes on Food and Beverage Demand in a Large Demand System," American Journal of Agricultural Economics, Agricultural and Applied Economics Association, vol. 96(1), pages 1-25.
    10. A. Belloni & V. Chernozhukov & I. Fernández‐Val & C. Hansen, 2017. "Program Evaluation and Causal Inference With High‐Dimensional Data," Econometrica, Econometric Society, vol. 85, pages 233-298, January.
    11. Gross, John, 1995. "Heterogeneity of preferences for local public goods: The case of private expenditure on public education," Journal of Public Economics, Elsevier, vol. 57(1), pages 103-127, May.
    12. Hal R. Varian, 2014. "Big Data: New Tricks for Econometrics," Journal of Economic Perspectives, American Economic Association, vol. 28(2), pages 3-28, Spring.
    13. Boonsaeng, Tullaya & Carpio, Carlos E., 2019. "A Comparison of Food Demand Estimation from Homescan and Consumer Expenditure Survey Data," Journal of Agricultural and Resource Economics, Western Agricultural Economics Association, vol. 44(1), January.
    14. John L. Park & Rodney B. Holcomb & Kellie Curry Raper & Oral Capps, 1996. "A Demand Systems Analysis of Food Commodities by U.S. Households Segmented by Income," American Journal of Agricultural Economics, Agricultural and Applied Economics Association, vol. 78(2), pages 290-300.
    15. Varian, Hal R, 1982. "The Nonparametric Approach to Demand Analysis," Econometrica, Econometric Society, vol. 50(4), pages 945-973, July.
    16. Ferrier, Peyton M. & Zhen, Chen, 2017. "The Role of Income in Explaining the Shift from Preserved to Fresh Vegetable Purchases," Journal of Agricultural and Resource Economics, Western Agricultural Economics Association, vol. 42(3), September.
    17. Gross, John, 1995. "Testing Data for Consistency with Revealed Preference," The Review of Economics and Statistics, MIT Press, vol. 77(4), pages 701-710, November.
    18. Luo, Xing & Zhu, Xu & Lim, Eng Gee, 2019. "A parametric bootstrap algorithm for cluster number determination of load pattern categorization," Energy, Elsevier, vol. 180(C), pages 50-60.
    19. Cliona Ni Mhurchu & Helen Eyles & Chris Schilling & Qing Yang & William Kaye-Blake & Murat Genç & Tony Blakely, 2013. "Food Prices and Consumer Demand: Differences across Income Levels and Ethnic Groups," PLOS ONE, Public Library of Science, vol. 8(10), pages 1-12, October.
    20. Helen Jensen & Justo Manrique, 1998. "Demand for food commodities by income groups in Indonesia," Applied Economics, Taylor & Francis Journals, vol. 30(4), pages 491-501.
    21. Susan Athey, 2018. "The Impact of Machine Learning on Economics," NBER Chapters, in: The Economics of Artificial Intelligence: An Agenda, pages 507-547, National Bureau of Economic Research, Inc.
    22. Sendhil Mullainathan & Jann Spiess, 2017. "Machine Learning: An Applied Econometric Approach," Journal of Economic Perspectives, American Economic Association, vol. 31(2), pages 87-106, Spring.
    23. Fraiman, Ricardo & Justel, Ana & Svarc, Marcela, 2008. "Selection of Variables for Cluster Analysis and Classification Rules," Journal of the American Statistical Association, American Statistical Association, vol. 103(483), pages 1294-1303.
    24. Sweitzer, Megan & Brown, Derick & Karns, Shawn & Muth, Mary K. & Siegel, Peter & Zhen, Chen, 2017. "Food-at-Home Expenditures: Comparing Commercial Household Scanner Data From IRI and Government Survey Data," Technical Bulletins 291969, United States Department of Agriculture, Economic Research Service.
    25. Varian, Hal R., 1985. "Non-parametric analysis of optimizing behavior with measurement error," Journal of Econometrics, Elsevier, vol. 30(1-2), pages 445-458.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Jim Engle-Warnick & Natalia Mishagina, 2014. "Insensitivity to Prices in a Dictator Game," CIRANO Working Papers 2014s-19, CIRANO.
    2. Yoram Halevy & Dotan Persitz & Lanny Zrill, 2018. "Parametric Recoverability of Preferences," Journal of Political Economy, University of Chicago Press, vol. 126(4), pages 1558-1593.
    3. Javier A. Birchenall, 2024. "Random choice and market demand," Canadian Journal of Economics/Revue canadienne d'économique, John Wiley & Sons, vol. 57(1), pages 165-198, February.
    4. Byron Botha & Rulof Burger & Kevin Kotzé & Neil Rankin & Daan Steenkamp, 2023. "Big data forecasting of South African inflation," Empirical Economics, Springer, vol. 65(1), pages 149-188, July.
    5. Ian Crawford & Bram De Rock, 2014. "Empirical Revealed Preference," Annual Review of Economics, Annual Reviews, vol. 6(1), pages 503-524, August.
    6. Jan Heufer & Per Hjertstrand, 2015. "Homothetic Efficiency and Test Power: A Non-Parametric Approach," Tinbergen Institute Discussion Papers 15-064/I, Tinbergen Institute.
    7. Victor H Aguiar & Nail Kashaev, 2021. "Stochastic Revealed Preferences with Measurement Error [Consistency between Household-level Consumption Data from Registers and Surveys]," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 88(4), pages 2042-2093.
    8. Per Hjertstrand & James Swofford, 2014. "Are the choices of people stochastically rational? A stochastic test of the number of revealed preference violations," Empirical Economics, Springer, vol. 46(4), pages 1495-1519, June.
    9. Smeulders, Bart & Crama, Yves & Spieksma, Frits C.R., 2019. "Revealed preference theory: An algorithmic outlook," European Journal of Operational Research, Elsevier, vol. 272(3), pages 803-815.
    10. Tipoe, Eileen, 2021. "Price inattention: A revealed preference characterisation," European Economic Review, Elsevier, vol. 134(C).
    11. Eileen Tipoe & Abi Adams & Ian Crawford, 2022. "Revealed preference analysis and bounded rationality [Consume now or later? Time inconsistency, collective choice and revealed preference]," Oxford Economic Papers, Oxford University Press, vol. 74(2), pages 313-332.
    12. Cerqua, Augusto & Letta, Marco, 2022. "Local inequalities of the COVID-19 crisis," Regional Science and Urban Economics, Elsevier, vol. 92(C).
    13. Chen Zhen & Mary Muth & Abigail Okrent & Shawn Karns & Derick Brown & Peter Siegel, 2019. "Do differences in reported expenditures between household scanner data and expenditure surveys matter in health policy research?," Health Economics, John Wiley & Sons, Ltd., vol. 28(6), pages 782-800, June.
    14. Johann Pfitzinger, 2021. "An Interpretable Neural Network for Parameter Inference," Papers 2106.05536, arXiv.org.
    15. Davide Viviano & Jelena Bradic, 2019. "Synthetic learner: model-free inference on treatments over time," Papers 1904.01490, arXiv.org, revised Aug 2022.
    16. Philippe Goulet Coulombe & Maxime Leroux & Dalibor Stevanovic & Stéphane Surprenant, 2022. "How is machine learning useful for macroeconomic forecasting?," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 37(5), pages 920-964, August.
    17. Akash Malhotra, 2018. "A hybrid econometric-machine learning approach for relative importance analysis: Prioritizing food policy," Papers 1806.04517, arXiv.org, revised Aug 2020.
    18. Michael C. Knaus, 2021. "A double machine learning approach to estimate the effects of musical practice on student’s skills," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 184(1), pages 282-300, January.
    19. Croux, Christophe & Jagtiani, Julapa & Korivi, Tarunsai & Vulanovic, Milos, 2020. "Important factors determining Fintech loan default: Evidence from a lendingclub consumer platform," Journal of Economic Behavior & Organization, Elsevier, vol. 173(C), pages 270-296.
    20. Michael C Knaus & Michael Lechner & Anthony Strittmatter, 2021. "Machine learning estimation of heterogeneous causal effects: Empirical Monte Carlo evidence," The Econometrics Journal, Royal Economic Society, vol. 24(1), pages 134-161.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:wly:ajagec:v:104:y:2022:i:3:p:976-998. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: https://doi.org/10.1111/(ISSN)1467-8276 .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.