IDEAS home Printed from https://ideas.repec.org/a/ers/journl/vxxviiy2024ispecialap72-82.html
   My bibliography  Save this article

Use of Autoencoder and One-Hot Encoding for Customer Segmentation

Author

Listed:
  • Tomasz Smutek
  • Jan Sikora
  • Sylwester Bogacki
  • Marek Rutkowski
  • Dariusz Wozniak

Abstract

Purpose: The research article aims to apply Autoencoder and One-Hot Encoding techniques to the segmentation of retail customers, exploring how these methodologies can contribute to a more refined and actionable segmentation process. Design/Methodology/Approach: The study uses a dataset comprising detailed profiles of 2240 retail customers, applying Autoencoders and One-Hot Encoding to categorize customers into distinct segments. It evaluates Autoencoders' embeddings and compares them with the traditional One-Hot Encoding method. The effectiveness of the segmentation is further analyzed using various clustering algorithms, including K-means, DBSCAN, Louvain Community Detection, Greedy Modularity, and Label Propagation. The research assesses clustering quality using indices such as the Caliński-Harabasz, Davies-Bouldin, and modularity metrics. Findings: Application of the Louvain method with a cut-off parameter of 0.75 using AutoEmbedder revealed three evenly distributed customer groups, albeit with slightly lower Caliński-Harabasz and Davies-Bouldin index values than those obtained by the Greedy method using AutoEmbedder with a cut-off parameter of 0.5. However, the Louvain method exhibited higher modularity, indicating more cohesive segmentation. Comparisons between AutoEmbedder and One-Hot Encoding suggested the superiority of AutoEmbedder in forming customer clusters. Practical Implications: The findings present actionable insights for marketing strategists to develop targeted campaigns based on customer expenditure patterns. By identifying customer segments with similar attributes, businesses can allocate marketing resources more effectively and tailor strategies to meet the specific needs of each segment. Originality/Value: The article introduces a novel comparison between Autoencoder embeddings and traditional One-Hot Encoding in the context of customer segmentation, providing evidence of the former's enhanced capability in creating more meaningful and modular customer groups. It also extends the discussion on clustering quality assessment in the segmentation process, adding value to marketing analytics.

Suggested Citation

  • Tomasz Smutek & Jan Sikora & Sylwester Bogacki & Marek Rutkowski & Dariusz Wozniak, 2024. "Use of Autoencoder and One-Hot Encoding for Customer Segmentation," European Research Studies Journal, European Research Studies Journal, vol. 0(Special A), pages 72-82.
  • Handle: RePEc:ers:journl:v:xxvii:y:2024:i:speciala:p:72-82
    as

    Download full text from publisher

    File URL: https://ersj.eu/journal/3388/download
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. repec:ers:journl:v:xxiv:y:2021:i:special2:p:513-522 is not listed on IDEAS
    2. repec:ers:journl:v:xxiv:y:2021:i:special2:p:335-345 is not listed on IDEAS
    3. Katarzyna Czainska & Aleksandra Sus & Eleftherios I. Thalassinos, 2021. "Sustainable Survival: Resource Management Strategy in Micro and Small Enterprises in the Rubber Products Market in Poland during the COVID-19 Pandemic," Resources, MDPI, vol. 10(8), pages 1-21, August.
    4. Pawel Rymarczyk & Piotr Golabek & Sylwia Skrzypek - Ahmed & Magdalena Rzemieniak, 2021. "Profiling and Segmenting Clients with the Use of Machine Learning Algorithms," European Research Studies Journal, European Research Studies Journal, vol. 0(Special 1), pages 513-522.
    5. Pawel Rymarczyk & Piotr Bednarczuk & Ryszard Nowak & Tomasz Cieplak, 2021. "Methods of Analyzing Consumer Behavior Based on Multi-Source Data," European Research Studies Journal, European Research Studies Journal, vol. 0(Special 1), pages 335-345.
    6. A.G. Polyakova & M.P. Loginov & A.I. Serebrennikova & E.I. Thalassinos, 2019. "Design of a Socio-economic Processes Monitoring System Based on Network Analysis and Big Data," International Journal of Economics & Business Administration (IJEBA), International Journal of Economics & Business Administration (IJEBA), vol. 0(1), pages 130-139.
    7. Marta Kadłubek & Eleftherios Thalassinos & Joanna Domagała & Sandra Grabowska & Sebastian Saniuk, 2022. "Intelligent Transportation System Applications and Logistics Resources for Logistics Customer Service in Road Freight Transport Enterprises," Energies, MDPI, vol. 15(13), pages 1-27, June.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Pawel Rymarczyk & Cezary Figura & Lukasz Wojciechowski & Kamila Cwik & Piotr Stalinski, 2024. "Evaluating the Effectiveness of Advertising Campaigns in the Fast-Food Industry Using an Analytical Engine," European Research Studies Journal, European Research Studies Journal, vol. 0(Special A), pages 126-136.
    2. Malgorzata Gorzalczynska-Koczkodaj, 2023. "Intelligent Specializations as an Opportunity for Regional Development on the Example of the West Pomeranian Voivodeship," European Research Studies Journal, European Research Studies Journal, vol. 0(4), pages 446-455.
    3. Yu.V. Przhedetskiy & N.V. Przhedetskaya & K.V. Borzenko & V.A. Bondarenko, 2019. "Blockchain Technologies in Healthcare Institutions: Focus on Security and Effective Cooperation with the Government," International Journal of Economics & Business Administration (IJEBA), International Journal of Economics & Business Administration (IJEBA), vol. 0(Special 2), pages 92-99.
    4. Tomasz Rokicki & Piotr Borawski & Aneta Beldycka-Borawska & Andras Szeberenyi & Luiza Ochnio & Bogdan Klepacki, 2024. "Resilience of Supply Chains in the Automotive Industry during the COVID-19 Pandemic on the Example of Polish Enterprises," European Research Studies Journal, European Research Studies Journal, vol. 0(1), pages 238-252.
    5. Garnov & A. & Zvyagin & L. & Sviridova & O., 2019. "System Data Analysis: Innovative Technologies, Methods and Techniques," International Journal of Economics & Business Administration (IJEBA), International Journal of Economics & Business Administration (IJEBA), vol. 0(Special 1), pages 26-39.
    6. Gabrielli, Gianluca & Magri, Carlotta & Medioli, Alice & Marchini, Pier Luigi, 2024. "The power of big data affordances to reshape anti-fraud strategies," Technological Forecasting and Social Change, Elsevier, vol. 205(C).
    7. Ilona Jacyna-Gołda & Nadiia Shmygol & Nataliia Gavkalova & Mariusz Salwin, 2023. "Sustainable Development of Intermodal Freight Transportation—Through the Integration of Logistics Flows in Ukraine and Poland," Sustainability, MDPI, vol. 16(1), pages 1-13, December.
    8. Arzhenovskiy S.V. & Bakhteev A.V. & Sinyavskaya T.G. & Hahonova N.N., 2019. "Audit Risk Assessment Model," International Journal of Economics & Business Administration (IJEBA), International Journal of Economics & Business Administration (IJEBA), vol. 0(Special 1), pages 74-85.
    9. Ivanchenko O.V. & Mirgorodskaya O.N. & Baraulya E.V. & Putilina T.I., 2019. "Marketing Relations and Communication Infrastructure Development in the Banking Sector Based on Big Data Mining," International Journal of Economics & Business Administration (IJEBA), International Journal of Economics & Business Administration (IJEBA), vol. 0(Special 2), pages 176-184.
    10. Semenyuta O.G. & Andreeva A.V. & Sichev R.A. & Filippov Yu.M., 2019. "Digital Technologies in Lending Small and Medium-Size Enterprises in Russia," International Journal of Economics & Business Administration (IJEBA), International Journal of Economics & Business Administration (IJEBA), vol. 0(Special 1), pages 40-52.
    11. Beata Meyer, 2023. "Use of Green and Water Areas in the Process of City Image Creation on the Example of Szczecin," European Research Studies Journal, European Research Studies Journal, vol. 0(4), pages 553-562.
    12. N.E. Goryushkina & T.V. Gaifutdinova & E.V. Logvina & A.G. Redkin & V.V. Kudryavtsev & Y.N. Shol, 2019. "Basic Principles of Tourist Services Market Segmentation," International Journal of Economics & Business Administration (IJEBA), International Journal of Economics & Business Administration (IJEBA), vol. 0(2), pages 139-150.
    13. Lavrova Tatyana (Лаврова Т.Б.) & Polyakova Aleksandra (Полякова А.Г.), 2020. "Development Of The Unified Information System For The Russian Federation Civil Service Human Resources Management [Развитие Единой Информационной Системы Управления Кадровым Составом Государственно," State and Municipal Management Scholar Notes, Russian Presidential Academy of National Economy and Public Administration, vol. 1, pages 33-40.
    14. Julia Nowicka & Zbigniew Ciekanowski & Mariusz Czternastek & Agnieszka Krol & Marzena Kacprzak, 2024. "Navigating Hybrid Threats: Advanced Security Solutions for Modern Organizations," European Research Studies Journal, European Research Studies Journal, vol. 0(2), pages 488-499.
    15. R.A. Ramazanov, 2019. "Development of Electronic Communications in the Financial Market-Based System," International Journal of Economics & Business Administration (IJEBA), International Journal of Economics & Business Administration (IJEBA), vol. 0(Special 1), pages 86-92.
    16. Przemyslaw Ruta & Joanna Kubicka & Yurii Vitkovskyi & Marcin Budzinski & Magdalena Dobrzańska-Rzepecka, 2024. "Preparing Polish Micro-Enterprises for the Loss of Liquidity," European Research Studies Journal, European Research Studies Journal, vol. 0(Special B), pages 3-16.
    17. Antonios Adamopoulos & Eleftherios I. Thalassinos, 2020. "Tourism Development and Economic Growth: A Comparative Study for the G-6 Leaders," European Research Studies Journal, European Research Studies Journal, vol. 0(1), pages 368-380.
    18. L. Poplawski, 2020. "Development Planning versus Participation of Inhabitants in Management," European Research Studies Journal, European Research Studies Journal, vol. 0(1), pages 3-12.
    19. Letife Özdemir & Ercan Özen & Simon Grima & Yannis Thalassinos, 2019. "Causality between Spot and Future Markets of the Borsa Istanbul Index and the Dow Jones Industrial Average," International Journal of Finance, Insurance and Risk Management, International Journal of Finance, Insurance and Risk Management, vol. 9(3-4), pages 115-131.
    20. Andrey Paptsov & Vasiliy Nechaev & Pavel Valerievich Mikhailushkin, 2019. "Towards to a single innovation space in the agrarian sector of the member states of the Eurasian economic union: a case study," Entrepreneurship and Sustainability Issues, VsI Entrepreneurship and Sustainability Center, vol. 7(1), pages 637-648, September.

    More about this item

    Keywords

    Autoencoder; One-Hot Encoding; customer segmentation; machine learning; clustering algorithms.;
    All these keywords.

    JEL classification:

    • C45 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods: Special Topics - - - Neural Networks and Related Topics
    • C53 - Mathematical and Quantitative Methods - - Econometric Modeling - - - Forecasting and Prediction Models; Simulation Methods
    • D12 - Microeconomics - - Household Behavior - - - Consumer Economics: Empirical Analysis
    • L11 - Industrial Organization - - Market Structure, Firm Strategy, and Market Performance - - - Production, Pricing, and Market Structure; Size Distribution of Firms
    • L86 - Industrial Organization - - Industry Studies: Services - - - Information and Internet Services; Computer Software
    • M15 - Business Administration and Business Economics; Marketing; Accounting; Personnel Economics - - Business Administration - - - IT Management

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:ers:journl:v:xxvii:y:2024:i:speciala:p:72-82. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Marios Agiomavritis (email available below). General contact details of provider: https://ersj.eu/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.