IDEAS home Printed from https://ideas.repec.org/a/gam/jmathe/v10y2022i19p3554-d928980.html
   My bibliography  Save this article

Text Mining of User-Generated Content (UGC) for Business Applications in E-Commerce: A Systematic Review

Author

Listed:
  • Shugang Li

    (School of Management, Shanghai University, Shanghai 200444, China)

  • Fang Liu

    (School of Management, Shanghai University, Shanghai 200444, China)

  • Yuqi Zhang

    (School of Management, Shanghai University, Shanghai 200444, China)

  • Boyi Zhu

    (School of Management, Shanghai University, Shanghai 200444, China)

  • He Zhu

    (School of Management, Shanghai University, Shanghai 200444, China)

  • Zhaoxu Yu

    (Department of Automation, East China University of Science and Technology, Shanghai 200237, China)

Abstract

In the Web2.0 era, user-generated content (UGC) provides a valuable source of data to aid in understanding consumers and driving intelligent business. Text mining techniques, such as semantic analysis and sentiment analysis, help to extract meaningful information embedded in UGC. However, research on text mining of UGC for e-commerce business applications involves interdisciplinary knowledge, and few studies have systematically summarized the research framework and application directions of related research in this field. First, based on e-commerce practice, in this study, we derive a general framework to summarize the mainstream research in this field. Second, widely used text mining techniques are introduced, including semantic and sentiment analysis. Furthermore, we analyze the development status of semantic analysis in terms of text representation and semantic understanding. Then, the definition, development, and technical classification of sentiment analysis techniques are introduced. Third, we discuss mainstream directions of text mining for business applications, ranging from high-quality UGC detection and consumer profiling, to product enhancement and marketing. Finally, research gaps with respect to these efforts are emphasized, and suggestions are provided for future work. We also provide prospective directions for future research.

Suggested Citation

  • Shugang Li & Fang Liu & Yuqi Zhang & Boyi Zhu & He Zhu & Zhaoxu Yu, 2022. "Text Mining of User-Generated Content (UGC) for Business Applications in E-Commerce: A Systematic Review," Mathematics, MDPI, vol. 10(19), pages 1-26, September.
  • Handle: RePEc:gam:jmathe:v:10:y:2022:i:19:p:3554-:d:928980
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2227-7390/10/19/3554/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2227-7390/10/19/3554/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Wei Hong & Changyuan Zheng & Linhai Wu & Xujin Pu, 2019. "Analyzing the Relationship between Consumer Satisfaction and Fresh E-Commerce Logistics Service Using Text Mining Techniques," Sustainability, MDPI, vol. 11(13), pages 1-16, June.
    2. Nanne, Annemarie J. & Antheunis, Marjolijn L. & van der Lee, Chris G. & Postma, Eric O. & Wubben, Sander & van Noort, Guda, 2020. "The Use of Computer Vision to Analyze Brand-Related User Generated Image Content," Journal of Interactive Marketing, Elsevier, vol. 50(C), pages 156-167.
    3. Chatterjee, Swagato & Goyal, Divesh & Prakash, Atul & Sharma, Jiwan, 2021. "Exploring healthcare/health-product ecommerce satisfaction: A text mining and machine learning application," Journal of Business Research, Elsevier, vol. 131(C), pages 815-825.
    4. Yue Ma & Guoqing Chen & Qiang Wei, 2017. "Finding users preferences from large-scale online reviews for personalized recommendation," Electronic Commerce Research, Springer, vol. 17(1), pages 3-29, March.
    5. Liu, Weihua & Long, Shangsong & Xie, Dong & Liang, Yanjie & Wang, Jinkun, 2021. "How to govern the big data discriminatory pricing behavior in the platform service supply chain?An examination with a three-party evolutionary game model," International Journal of Production Economics, Elsevier, vol. 231(C).
    6. Hui Zhang & Huguang Rao & Junzheng Feng, 2018. "Product innovation based on online review data mining: a case study of Huawei phones," Electronic Commerce Research, Springer, vol. 18(1), pages 3-22, March.
    7. Huosong Xia & Yitai Yang & Xiaoting Pan & Zuopeng Zhang & Wuyue An, 2020. "Sentiment analysis for online reviews using conditional random fields and support vector machines," Electronic Commerce Research, Springer, vol. 20(2), pages 343-360, June.
    8. Jianping Li & Yinhong Yao & Yuanjie Xu & Jingyu Li & Lu Wei & Xiaoqian Zhu, 2019. "Consumer’s risk perception on the Belt and Road countries: evidence from the cross-border e-commerce," Electronic Commerce Research, Springer, vol. 19(4), pages 823-840, December.
    9. Jian-Wu Bi & Yang Liu & Zhi-Ping Fan & Erik Cambria, 2019. "Modelling customer satisfaction from online reviews using ensemble neural network and effect-based Kano model," International Journal of Production Research, Taylor & Francis Journals, vol. 57(22), pages 7068-7088, November.
    10. Nilanshi Chauhan & Pardeep Singh, 2017. "Identifying the Opinion Orientation of Online Product Reviews at Feature Level: A Pruning Approach," International Journal of Information System Modeling and Design (IJISMD), IGI Global, vol. 8(2), pages 92-111, April.
    11. Shugang Li & Yuqi Zhang & Yueming Li & Zhaoxu Yu, 2021. "The user preference identification for product improvement based on online comment patch," Electronic Commerce Research, Springer, vol. 21(2), pages 423-444, June.
    12. Sunil Saumya & Jyoti Prakash Singh, 2022. "Spam review detection using LSTM autoencoder: an unsupervised approach," Electronic Commerce Research, Springer, vol. 22(1), pages 113-133, March.
    13. Donthu, Naveen & Kumar, Satish & Pandey, Neeraj & Pandey, Nitesh & Mishra, Akanksha, 2021. "Mapping the electronic word-of-mouth (eWOM) research: A systematic review and bibliometric analysis," Journal of Business Research, Elsevier, vol. 135(C), pages 758-773.
    14. Hui Yuan & Wei Xu & Qian Li & Raymond Lau, 2018. "Topic sentiment mining for sales performance prediction in e-commerce," Annals of Operations Research, Springer, vol. 270(1), pages 553-576, November.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Zhang, Dianfeng & Shen, Zifan & Li, Yanlai, 2023. "Requirement analysis and service optimization of multiple category fresh products in online retailing using importance-Kano analysis," Journal of Retailing and Consumer Services, Elsevier, vol. 72(C).
    2. Satish Kumar & Weng Marc Lim & Nitesh Pandey & J. Christopher Westland, 2021. "20 years of Electronic Commerce Research," Electronic Commerce Research, Springer, vol. 21(1), pages 1-40, March.
    3. Yanlai Li & Zifan Shen & Cuiming Zhao & Kwai-Sang Chin & Xuwei Lang, 2024. "Understanding Customer Opinion Change on Fresh Food E-Commerce Products and Services—Comparative Analysis before and during COVID-19 Pandemic," Sustainability, MDPI, vol. 16(7), pages 1-22, March.
    4. Wang, Binni & Wang, Pong & Tu, Yiliu, 2021. "Customer satisfaction service match and service quality-based blockchain cloud manufacturing," International Journal of Production Economics, Elsevier, vol. 240(C).
    5. Liang Xiao & Linyong Luo & Tongping Ke, 2024. "The influence of eWOM information structures on consumers’ purchase intentions," Electronic Commerce Research, Springer, vol. 24(3), pages 1713-1735, September.
    6. Pahrudin Pahrudin & Tsung-Hua Hsieh & Li-Wei Liu & Chia-Chun Wang, 2023. "The Role of Information Sources on Tourist Behavior Post-Earthquake Disaster in Indonesia: A Stimulus–Organism–Response (SOR) Approach," Sustainability, MDPI, vol. 15(11), pages 1-23, May.
    7. Nasiri, Mohammad Sadegh & Shokouhyar, Sajjad, 2021. "Actual consumers' response to purchase refurbished smartphones: Exploring perceived value from product reviews in online retailing," Journal of Retailing and Consumer Services, Elsevier, vol. 62(C).
    8. Liu, Changyu & Song, Yadong & Wang, Wei & Shi, Xunpeng, 2023. "The governance of manufacturers’ greenwashing behaviors: A tripartite evolutionary game analysis of electric vehicles," Applied Energy, Elsevier, vol. 333(C).
    9. Azer, Jaylan & Anker, Thomas & Taheri, Babak & Tinsley, Ross, 2023. "Consumer-Driven racial stigmatization: The moderating role of race in online consumer-to-consumer reviews," Journal of Business Research, Elsevier, vol. 157(C).
    10. Villarroel Ordenes, Francisco & Silipo, Rosaria, 2021. "Machine learning for marketing on the KNIME Hub: The development of a live repository for marketing applications," Journal of Business Research, Elsevier, vol. 137(C), pages 393-410.
    11. Manta Eduard Mihai & Davidescu Adriana Ana Maria & Geambasu Maria Cristina & Florescu Margareta Stela, 2023. "Exploring the research area of direct taxation. An empirical analysis based on bibliometric analysis results," Management & Marketing, Sciendo, vol. 18(s1), pages 355-383, December.
    12. Maria Rostasova & Anna Padourova & Tatiana Corejova, 2020. "KANO model as a tool of effective customer satisfaction diagnostics of postal services," Entrepreneurship and Sustainability Issues, VsI Entrepreneurship and Sustainability Center, vol. 8(2), pages 811-828, December.
    13. Yuanyuan Guo & Yanqing Wang & Chaoyou Wang, 2019. "Exploring the Salient Attributes of Short-Term Rental Experience: An Analysis of Online Reviews from Chinese Guests," Sustainability, MDPI, vol. 11(16), pages 1-19, August.
    14. Shahzad, Muhammad Farrukh & Xu, Shuo & An, Xin & Javed, Iqra, 2024. "Assessing the impact of AI-chatbot service quality on user e-brand loyalty through chatbot user trust, experience and electronic word of mouth," Journal of Retailing and Consumer Services, Elsevier, vol. 79(C).
    15. Renjie Zhang & Hsingwei Tai & Kuotai Cheng & Huizhong Dong & Wenhui Liu & Junjie Hou, 2022. "Carbon Emission Efficiency Network: Evolutionary Game and Sensitivity Analysis between Differentiated Efficiency Groups and Local Governments," Sustainability, MDPI, vol. 14(4), pages 1-19, February.
    16. Gaofeng Guan & Dong Liu & Jiayang Zhai, 2022. "Factors Influencing Consumer Satisfaction of Fresh Produce E-Commerce in the Background of COVID-19—A Hybrid Approach Based on LDA-SEM-XGBoost," Sustainability, MDPI, vol. 14(24), pages 1-19, December.
    17. Baranowski Paweł & Korczak Karol & Zając Jarosław, 2020. "Forecasting Cinema Attendance at the Movie Show Level: Evidence from Poland," Business Systems Research, Sciendo, vol. 11(1), pages 73-88, March.
    18. Nan Yang & Nikolaos Korfiatis & Dimitris Zissis & Konstantina Spanaki, 2024. "Incorporating topic membership in review rating prediction from unstructured data: a gradient boosting approach," Annals of Operations Research, Springer, vol. 339(1), pages 631-662, August.
    19. Silvia Garc'ia-M'endez & Francisco de Arriba-P'erez & Ana Barros-Vila & Francisco J. Gonz'alez-Casta~no, 2024. "Targeted aspect-based emotion analysis to detect opportunities and precaution in financial Twitter messages," Papers 2404.08665, arXiv.org.
    20. Xing, Yunfei & Zhang, Justin Zuopeng & Teng, Guangqing & Zhou, Xiaotang, 2024. "Voices in the digital storm: Unraveling online polarization with ChatGPT," Technology in Society, Elsevier, vol. 77(C).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jmathe:v:10:y:2022:i:19:p:3554-:d:928980. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.