IDEAS home Printed from https://ideas.repec.org/a/gam/jmathe/v12y2024i23p3794-d1533911.html
   My bibliography  Save this article

Unsupervised Modelling of E-Customers’ Profiles: Multiple Correspondence Analysis with Hierarchical Clustering of Principal Components and Machine Learning Classifiers

Author

Listed:
  • Vijoleta Vrhovac

    (Department of Industrial Engineering and Engineering Management, Faculty of Technical Sciences, University of Novi Sad, Trg Dositeja Obradovića 6, 21000 Novi Sad, Serbia)

  • Marko Orošnjak

    (Department of Industrial Engineering and Engineering Management, Faculty of Technical Sciences, University of Novi Sad, Trg Dositeja Obradovića 6, 21000 Novi Sad, Serbia)

  • Kristina Ristić

    (Department of Industrial Engineering and Engineering Management, Faculty of Technical Sciences, University of Novi Sad, Trg Dositeja Obradovića 6, 21000 Novi Sad, Serbia)

  • Nemanja Sremčev

    (Department of Industrial Engineering and Engineering Management, Faculty of Technical Sciences, University of Novi Sad, Trg Dositeja Obradovića 6, 21000 Novi Sad, Serbia)

  • Mitar Jocanović

    (Department of Industrial Engineering and Engineering Management, Faculty of Technical Sciences, University of Novi Sad, Trg Dositeja Obradovića 6, 21000 Novi Sad, Serbia)

  • Jelena Spajić

    (Department of Industrial Engineering and Engineering Management, Faculty of Technical Sciences, University of Novi Sad, Trg Dositeja Obradovića 6, 21000 Novi Sad, Serbia)

  • Nebojša Brkljač

    (Department of Industrial Engineering and Engineering Management, Faculty of Technical Sciences, University of Novi Sad, Trg Dositeja Obradovića 6, 21000 Novi Sad, Serbia)

Abstract

The rapid growth of e-commerce has transformed customer behaviors, demanding deeper insights into how demographic factors shape online user preferences. This study performed a threefold analysis to understand the impact of these changes. Firstly, this study investigated how demographic factors (e.g., age, gender, education) influence e-customer preferences in Serbia. From a sample of n = 906 respondents, conditional dependencies between demographics and user preferences were tested. From a hypothetical framework of 24 tested hypotheses, this study successfully rejected 8/24 (with p < 0.05), suggesting a high association between demographics with purchase frequency and reasons for quitting the purchase. However, although the reported test statistics suggested an association, understanding how interactions between categories shape e-customer profiles was still required. Therefore, the second part of this study considers an MCA-HCPC (Multiple Correspondence Analysis with Hierarchical Clustering on Principal Components) to identify user profiles. The analysis revealed three main clusters: (1) young, female, unemployed e-customers driven mainly by customer reviews; (2) retirees and older adults with infrequent purchases, hesitant to buy without experiencing the product in person; and (3) employed, highly educated, male, middle-aged adults who prioritize fast and accurate delivery over price. In the third stage, the clusters are used as labels for Machine Learning (ML) classification tasks. Particularly, Gradient Boosting Machine (GBM), Decision Tree (DT), k-Nearest Neighbors (kNN), Gaussian Naïve Bayes (GNB), Random Forest (RF), and Support Vector Machine (SVM) were used. The results suggested that GBM, RF, and SVM had high classification performance in identifying user profiles. Lastly, after performing Permutation Feature Importance (PFI), the findings suggested that age, work status, education, and income are the main determinants of shaping e-customer profiles and developing marketing strategies.

Suggested Citation

  • Vijoleta Vrhovac & Marko Orošnjak & Kristina Ristić & Nemanja Sremčev & Mitar Jocanović & Jelena Spajić & Nebojša Brkljač, 2024. "Unsupervised Modelling of E-Customers’ Profiles: Multiple Correspondence Analysis with Hierarchical Clustering of Principal Components and Machine Learning Classifiers," Mathematics, MDPI, vol. 12(23), pages 1-25, November.
  • Handle: RePEc:gam:jmathe:v:12:y:2024:i:23:p:3794-:d:1533911
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2227-7390/12/23/3794/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2227-7390/12/23/3794/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Maja Martinović & Roko Barać & Hrvoje Maljak, 2024. "Exploring Croatian Consumer Adoption of Subscription-Based E-Commerce for Business Innovation," Administrative Sciences, MDPI, vol. 14(7), pages 1-21, July.
    2. Jianquan Guo & Xinxin Liu & Jungbok Jo, 2017. "Dynamic joint construction and optimal operation strategy of multi-period reverse logistics network: a case study of Shanghai apparel E-commerce enterprises," Journal of Intelligent Manufacturing, Springer, vol. 28(3), pages 819-831, March.
    3. Prateek Kalia, 2017. "Does Demographics Affect Purchase Frequency in Online Retail?," International Journal of Online Marketing (IJOM), IGI Global, vol. 7(2), pages 42-56, April.
    4. Heleen Buldeo Rai & Koen Mommens & Sara Verlinde & Cathy Macharis, 2019. "How Does Consumers’ Omnichannel Shopping Behaviour Translate into Travel and Transport Impacts? Case-Study of a Footwear Retailer in Belgium," Sustainability, MDPI, vol. 11(9), pages 1-19, May.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Kokkinou, Alinda & Quak, Hans & Mitas, Ondrej & Mandemakers, Albert, 2024. "Should I wait or should I go? Encouraging customers to make the more sustainable delivery choice," Research in Transportation Economics, Elsevier, vol. 103(C).
    2. Magdalena Mucowska, 2021. "Trends of Environmentally Sustainable Solutions of Urban Last-Mile Deliveries on the E-Commerce Market—A Literature Review," Sustainability, MDPI, vol. 13(11), pages 1-26, May.
    3. Ashu Kedia & Diana Kusumastuti & Alan Nicholson, 2019. "Establishing Collection and Delivery Points to Encourage the Use of Active Transport: A Case Study in New Zealand Using a Consumer-Centric Approach," Sustainability, MDPI, vol. 11(22), pages 1-23, November.
    4. Qing Zhu & Renxian Zuo & Shan Liu & Fan Zhang, 2020. "Online dynamic group-buying community analysis based on high frequency time series simulation," Electronic Commerce Research, Springer, vol. 20(1), pages 81-118, March.
    5. Navid Zarbakhshnia & Devika Kannan & Reza Kiani Mavi & Hamed Soleimani, 2020. "A novel sustainable multi-objective optimization model for forward and reverse logistics system under demand uncertainty," Annals of Operations Research, Springer, vol. 295(2), pages 843-880, December.
    6. Beckers, Joris & Cardenas, Ivan & Le Pira, Michela & Zhang, Jia, 2023. "Exploring Logistics-as-a-Service to integrate the consumer into urban freight," Research in Transportation Economics, Elsevier, vol. 101(C).
    7. Buldeo Rai, Heleen, 2021. "The net environmental impact of online shopping, beyond the substitution bias," Journal of Transport Geography, Elsevier, vol. 93(C).
    8. Alaa Eddine El Moussaoui & Brahim Benbba & Anicia Jaegler & Taoufiq El Moussaoui & Zineb El Andaloussi & Loqman Chakir, 2023. "Consumer Perceptions of Online Shopping and Willingness to Use Pick-Up Points: A Case Study of Morocco," Sustainability, MDPI, vol. 15(9), pages 1-19, April.
    9. Susanne Feichtinger & Manfred Gronalt, 2021. "The Environmental Impact of Transport Activities for Online and In-Store Shopping: A Systematic Literature Review to Identify Relevant Factors for Quantitative Assessments," Sustainability, MDPI, vol. 13(5), pages 1-23, March.
    10. Mashalah, Heider Al & Hassini, Elkafi & Gunasekaran, Angappa & Bhatt (Mishra), Deepa, 2022. "The impact of digital transformation on supply chains through e-commerce: Literature review and a conceptual framework," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 165(C).
    11. Mommens, Koen & Buldeo Rai, Heleen & van Lier, Tom & Macharis, Cathy, 2021. "Delivery to homes or collection points? A sustainability analysis for urban, urbanised and rural areas in Belgium," Journal of Transport Geography, Elsevier, vol. 94(C).
    12. Marcin Gąsior, 2021. "Environmental Attitudes and Willingness to Purchase Online—Classification Approach," Sustainability, MDPI, vol. 13(15), pages 1-17, August.
    13. Paulo Rita & Ricardo F. Ramos, 2022. "Global Research Trends in Consumer Behavior and Sustainability in E-Commerce: A Bibliometric Analysis of the Knowledge Structure," Sustainability, MDPI, vol. 14(15), pages 1-20, August.
    14. Angelos Pantouvakis & Anastasia Gerou, 2022. "The Theoretical and Practical Evolution of Customer Journey and Its Significance in Services Sustainability," Sustainability, MDPI, vol. 14(15), pages 1-16, August.
    15. Fernando Tobal Berssaneti & Simone Berger & Ana Maria Saut & Rosangela Maria Vanalle & José Carlos Curvelo Santana, 2019. "Value Generation of Remanufactured Products: Multi-Case Study of Third-Party Companies," Sustainability, MDPI, vol. 11(3), pages 1-21, January.
    16. Sotirios Zygiaris, 2022. "The Impact of Innovation Systems on E-commerce Capacity," Journal of the Knowledge Economy, Springer;Portland International Center for Management of Engineering and Technology (PICMET), vol. 13(1), pages 276-289, March.
    17. Giacomo Lozzi & Gabriele Iannaccone & Ila Maltese & Valerio Gatta & Edoardo Marcucci & Riccardo Lozzi, 2022. "On-Demand Logistics: Solutions, Barriers, and Enablers," Sustainability, MDPI, vol. 14(15), pages 1-21, August.
    18. Maja Kiba-Janiak & Katarzyna Cheba & Magdalena Mucowska & Leise Kelli de Oliveira, 2022. "Segmentation of e-customers in terms of sustainable last-mile delivery," Oeconomia Copernicana, Institute of Economic Research, vol. 13(4), pages 1117-1142, December.
    19. Joshi, Aparna & Pani, Agnivesh & Sahu, Prasanta K. & Majumdar, Bandhan Bandhu & Tavasszy, Lóránt, 2024. "Gender and generational differences in omnichannel shopping travel decisions: What drives consumer choices to pick up in-store or ship direct?," Research in Transportation Economics, Elsevier, vol. 103(C).
    20. Beckers, Joris & Cardenas, Ivan & Sanchez-Diaz, Ivan, 2022. "Managing household freight: The impact of online shopping on residential freight trips," Transport Policy, Elsevier, vol. 125(C), pages 299-311.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jmathe:v:12:y:2024:i:23:p:3794-:d:1533911. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.