IDEAS home Printed from https://ideas.repec.org/a/eee/tefoso/v191y2023ics0040162523001166.html
   My bibliography  Save this article

Segmenting with big data analytics and Python: A quantitative exploratory analysis of household savings

Author

Listed:
  • Cuomo, Maria Teresa
  • Tortora, Debora
  • Colosimo, Ivan
  • Ricciardi Celsi, Lorenzo
  • Genovino, Cinzia
  • Festa, Giuseppe
  • La Rocca, Michele

Abstract

According to the national balance sheets of the most advanced economies, despite a recent sharp decline in per capita net wealth, Italian private households present a higher rate among the wealthiest and least indebted in Europe. Recently, the COVID-19 outbreak caused a new leap in households' savings worldwide, particularly in advanced economies and Italy. This study underlines that using advanced analytics tools, household saving behaviour information, and big data analytics may support data-driven decision approaches addressing the management of complex relationships in the financial arena. More specifically, using exploratory and predictive analyses based on big data analytics and machine learning, this study aims to provide extensive customer profiling in the household saving sector in Italy, supporting a data-driven decision-making approach. A profiling of household savings has been defined using the information provided by big data analysis. To proceed in this direction, the hardware and software requirements necessary to perform data processing were considered in the first phase of the study. Data collection was performed according to the so-called extract, transform, load (ETL) process. The contribution of this study lies in the results obtained in terms of data analytics over a dataset that accounts for the purchasing behaviour of almost 20 million postal savers. The clustering algorithm is highly efficient and scales well for large datasets. K-means clustering can be implemented within the MapReduce computational framework. Therefore, the overall procedure proposed here can be easily extended to big data using parallel computing and software implementing MapReduce, such as Hadoop and Spark.

Suggested Citation

  • Cuomo, Maria Teresa & Tortora, Debora & Colosimo, Ivan & Ricciardi Celsi, Lorenzo & Genovino, Cinzia & Festa, Giuseppe & La Rocca, Michele, 2023. "Segmenting with big data analytics and Python: A quantitative exploratory analysis of household savings," Technological Forecasting and Social Change, Elsevier, vol. 191(C).
  • Handle: RePEc:eee:tefoso:v:191:y:2023:i:c:s0040162523001166
    DOI: 10.1016/j.techfore.2023.122431
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0040162523001166
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.techfore.2023.122431?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Nicola Fuchs‐Schündeln & Paolo Masella & Hannah Paule‐Paludkiewicz, 2020. "Cultural Determinants of Household Saving Behavior," Journal of Money, Credit and Banking, Blackwell Publishing, vol. 52(5), pages 1035-1070, August.
    2. Luis Hernández & Carlos Baladrón & Javier M. Aguiar & Belén Carro & Antonio Sánchez-Esguevillas, 2012. "Classification and Clustering of Electricity Demand Patterns in Industrial Parks," Energies, MDPI, vol. 5(12), pages 1-14, December.
    3. Ricardo Bebczuk & Leonardo Gasparini & Noelia Garbero & Julian Amendolaggine, 2015. "Understanding the Determinants of Household Saving: Micro Evidence for Latin America," CEDLAS, Working Papers 0189, CEDLAS, Universidad Nacional de La Plata.
    4. Thaler, Richard H & Shefrin, H M, 1981. "An Economic Theory of Self-Control," Journal of Political Economy, University of Chicago Press, vol. 89(2), pages 392-406, April.
    5. Paolo Acciari & Salvatore Morelli, 2020. "Wealth Transfers and Net Wealth at Death: Evidence from the Italian Inheritance Tax Records 1995–2016," NBER Chapters, in: Measuring Distribution and Mobility of Income and Wealth, pages 175-203, National Bureau of Economic Research, Inc.
    6. Milton Friedman, 1957. "Introduction to "A Theory of the Consumption Function"," NBER Chapters, in: A Theory of the Consumption Function, pages 1-6, National Bureau of Economic Research, Inc.
    7. Elisa Guglielminetti & Concetta Rondinelli, 2021. "Consumption and saving patterns in Italy during Covid-19," Questioni di Economia e Finanza (Occasional Papers) 620, Bank of Italy, Economic Research and International Relations Area.
    8. Miles S. Kimball, 1990. "Precautionary Saving and the Marginal Propensity to Consume," NBER Working Papers 3403, National Bureau of Economic Research, Inc.
    9. OR Attanasio & J Banks, 2001. "The assessment: household saving - issues in theory and policy," Oxford Review of Economic Policy, Oxford University Press and Oxford Review of Economic Policy Limited, vol. 17(1), pages 1-19, Spring.
    10. Milton Friedman, 1957. "A Theory of the Consumption Function," NBER Books, National Bureau of Economic Research, Inc, number frie57-1.
    11. Venieris, Yiannis P & Gupta, Dipak K, 1986. "Income Distribution and Sociopolitical Instability as Determinants of Savings: A Cross-sectional Model," Journal of Political Economy, University of Chicago Press, vol. 94(4), pages 873-883, August.
    12. Oleg V. Buklemishev, 2020. "Coronavirus crisis and its effects on the economy," Population and Economics, ARPHA Platform, vol. 4(2), pages 13-17, April.
    13. Campbell, John Y & Mankiw, N Gregory, 1990. "Permanent Income, Current Income, and Consumption," Journal of Business & Economic Statistics, American Statistical Association, vol. 8(3), pages 265-279, July.
    14. Gulnur MURADOGLU & Fatma TASKIN, 1996. "Differences In Household Savings Behavior: Evidence From Industrial And Developing Countries," The Developing Economies, Institute of Developing Economies, vol. 34(2), pages 138-153, June.
    15. Annamaria Lusardi, 2008. "Household Saving Behavior: The Role of Financial Literacy, Information, and Financial Education Programs," NBER Working Papers 13824, National Bureau of Economic Research, Inc.
    16. Lusardi, Annamaria, 1998. "On the Importance of the Precautionary Saving Motive," American Economic Review, American Economic Association, vol. 88(2), pages 449-453, May.
    17. David Roubaud & Rameshwar Dubey & Cyril Foropon & Angappa Gunasekaran & Stephen J. Childe & Zongwei Luo & Fosso Wamba Samuel, 2018. "Examining the role of big data and predictive analytics on collaborative performance in context to sustainable consumption and production behaviour," Post-Print hal-02051276, HAL.
    18. Chunhui Yuan & Haitao Yang, 2019. "Research on K-Value Selection Method of K-Means Clustering Algorithm," J, MDPI, vol. 2(2), pages 1-10, June.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Patti Fisher, 2013. "Is There Evidence of Loss Aversion in Saving Behaviors in Spain?," Journal of Family and Economic Issues, Springer, vol. 34(1), pages 41-51, March.
    2. Bande, Roberto & Riveiro, Dolores & Ruiz, Freddy, 2021. "Does Uncertainty Affect Saving Decisions of Colombian Households? Evidence on Precautionary Saving," MPRA Paper 106771, University Library of Munich, Germany.
    3. Patti Fisher & Catherine Montalto, 2011. "Loss Aversion and Saving Behavior: Evidence from the 2007 U.S. Survey of Consumer Finances," Journal of Family and Economic Issues, Springer, vol. 32(1), pages 4-14, March.
    4. Marieka M. Klawitter & C. Leigh Anderson & Mary Kay Gugerty, 2013. "Savings And Personal Discount Rates In A Matched Savings Program For Low-Income Families," Contemporary Economic Policy, Western Economic Association International, vol. 31(3), pages 468-485, July.
    5. Bernd Hayo & Matthias Uhl, 2017. "Taxation and consumption: evidence from a representative survey of the German population," Applied Economics, Taylor & Francis Journals, vol. 49(53), pages 5477-5490, November.
    6. Annamaria Lusardi, 2000. "Explaining Why So Many Households Do Not Save," Working Papers 0001, Harris School of Public Policy Studies, University of Chicago.
    7. Ricardo Bebczuk & Leonardo Gasparini & Noelia Garbero & Julian Amendolaggine, 2015. "Understanding the Determinants of Household Saving: Micro Evidence for Latin America," CEDLAS, Working Papers 0189, CEDLAS, Universidad Nacional de La Plata.
    8. Luc Arrondel & Hector Calvo Pardo, 2008. "Les Français sont-ils prudents ? Patrimoine et risque sur les revenus des ménages," Working Papers halshs-00585994, HAL.
    9. Rodepeter, Ralf & Winter, Joachim, 1999. "Rules of thumb in life-cycle savings models," Sonderforschungsbereich 504 Publications 99-81, Sonderforschungsbereich 504, Universität Mannheim;Sonderforschungsbereich 504, University of Mannheim.
    10. Caliendo, Frank & Aadland, David, 2007. "Short-term planning and the life-cycle consumption puzzle," Journal of Economic Dynamics and Control, Elsevier, vol. 31(4), pages 1392-1415, April.
    11. Rajat Deb, 2016. "Determinants of Savings in Sukanya Samriddhi Account: Evidence from Tripura," IIM Kozhikode Society & Management Review, , vol. 5(2), pages 120-140, July.
    12. Carroll, Christopher D., 2009. "Precautionary saving and the marginal propensity to consume out of permanent income," Journal of Monetary Economics, Elsevier, vol. 56(6), pages 780-790, September.
    13. Chen, Kevin Z. & D. Meilke, Karl & Turvey, Calum, 1999. "Income risk and farm consumption behavior," Agricultural Economics, Blackwell, vol. 20(2), pages 173-183, March.
    14. Levin, Mark (Левин, Марк) & Matrosova, Ksenia (Матросова, Ксения), 2018. "Development and Research of Economic Behavior of Households in Changing Conditions [Разработка И Исследование Экономического Поведения Домохозяйств В Изменяющихся Условиях]," Working Papers 041825, Russian Presidential Academy of National Economy and Public Administration.
    15. repec:ptu:bdpart:r201610 is not listed on IDEAS
    16. Roberto Bande & Dolores Riveiro, 2013. "Private Saving Rates and Macroeconomic Uncertainty: Evidence from Spanish Regional Data," The Economic and Social Review, Economic and Social Studies, vol. 44(3), pages 323-349.
    17. Baugh, Brian & Ben-David, Itzhak & Park, Hoonsuk, 2013. "Disentangling Financial Constraints, Precautionary Savings, and Myopia: Household Behavior Surrounding Federal Tax Returns," Working Paper Series 2013-20, Ohio State University, Charles A. Dice Center for Research in Financial Economics.
    18. Miriam Beblo & Sven Schreiber, 2022. "Leisure and housing consumption after retirement: new evidence on the life-cycle hypothesis," Review of Economics of the Household, Springer, vol. 20(1), pages 305-330, March.
    19. Takala, Kari, 1995. "The consumption function revisited : an error-correction model for Finnish consumption," Research Discussion Papers 20/1995, Bank of Finland.
    20. Mervyn A. King, 1983. "The Economics of Saving," NBER Working Papers 1247, National Bureau of Economic Research, Inc.
    21. Brunila, Anne, 1996. "Fiscal policy and private consumption : Saving decisions : Evidence from Finland," Research Discussion Papers 28/1996, Bank of Finland.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:tefoso:v:191:y:2023:i:c:s0040162523001166. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.sciencedirect.com/science/journal/00401625 .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.