IDEAS home Printed from https://ideas.repec.org/p/hal/journl/hal-03630420.html
   My bibliography  Save this paper

Fraudulent review detection model focusing on emotional expressions and explicit aspects : investigating the potential of feature engineering

Author

Listed:
  • Ajay Kumar

    (EM - EMLyon Business School)

  • Ram D. Gopal

    (WBS - Warwick Business School - University of Warwick [Coventry])

  • Ravi Shankar

    (IIT Delhi - Indian Institute of Technology Delhi)

  • Kim Hua Tan

    (Nottingham University Business School [Nottingham])

Abstract

Reading customer reviews before purchasing items online has become a common practice; however, some companies use machine learning (ML) algorithms to generate false reviews in order to create positive brand images of their own products and negative images of competitors' offerings. Existing techniques use review content to identify fraudulent reviewers; however, spammers become more intelligent, started to learn from their mistakes, and changed their tactics in order to avoid detection techniques. Thus, investigating fraudulent accounts' behaviour of generating fake negative or positive reviews for competitors or themselves and the necessity of ML classifiers to identify fraudulent reviews, is more important than ever. In this research, we present a novel feature engineering approach in which we (1) extract several "review-centric" and "reviewer-centric" features from a dataset; (2) combine the cumulative effects of features distributions into a unified model that represents overall behavior of the fraudulent reviewers; (3) investigate the role of effective data pre-processing to improve detection accuracy; and (4) develop a probabilistic approach to detect fraudulent reviewers by learning a novel M-SMOTE model over a derived balanced dataset and feature distributions, which outperforms other ML models. Our study contributes to the literature on digital platforms and fraudulent review detection with significant managerial and theoretical implications through these novel findings.

Suggested Citation

  • Ajay Kumar & Ram D. Gopal & Ravi Shankar & Kim Hua Tan, 2022. "Fraudulent review detection model focusing on emotional expressions and explicit aspects : investigating the potential of feature engineering," Post-Print hal-03630420, HAL.
  • Handle: RePEc:hal:journl:hal-03630420
    DOI: 10.1016/j.dss.2021.113728
    Note: View the original document on HAL open archive server: https://hal.science/hal-03630420v1
    as

    Download full text from publisher

    File URL: https://hal.science/hal-03630420v1/document
    Download Restriction: no

    File URL: https://libkey.io/10.1016/j.dss.2021.113728?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Ajay Kumar & Ravi Shankar & Alok Choudhary & Lakshman S. Thakur, 2016. "A big data MapReduce framework for fault diagnosis in cloud-based manufacturing," International Journal of Production Research, Taylor & Francis Journals, vol. 54(23), pages 7060-7073, December.
    2. Mohamad Hazim & Nor Badrul Anuar & Mohd Faizal Ab Razak & Nor Aniza Abdullah, 2018. "Detecting opinion spams through supervised boosting approach," PLOS ONE, Public Library of Science, vol. 13(6), pages 1-23, June.
    3. Chuanming Yu & Yuheng Zuo & Bolin Feng & Lu An & Baiyun Chen, 2019. "An individual-group-merchant relation model for identifying fake online reviews: an empirical study on a Chinese e-commerce platform," Information Technology and Management, Springer, vol. 20(3), pages 123-138, September.
    4. Theodoros Lappas & Gaurav Sabnis & Georgios Valkanas, 2016. "The Impact of Fake Reviews on Online Visibility: A Vulnerability Assessment of the Hotel Industry," Information Systems Research, INFORMS, vol. 27(4), pages 940-961, December.
    5. Michael Luca & Georgios Zervas, 2016. "Fake It Till You Make It: Reputation, Competition, and Yelp Review Fraud," Management Science, INFORMS, vol. 62(12), pages 3412-3427, December.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Carlos Galera-Zarco & Goulielmos Floros, 2024. "A deep learning approach to improve built asset operations and disaster management in critical events: an integrative simulation model for quicker decision making," Annals of Operations Research, Springer, vol. 339(1), pages 573-612, August.
    2. Jiguang Wang & Yilun Zhang & Xinjie Xing & Yuanzhu Zhan & Wai Kin Victor Chan & Sunil Tiwari, 2024. "A data-driven system for cooperative-bus route planning based on generative adversarial network and metric learning," Annals of Operations Research, Springer, vol. 339(1), pages 427-453, August.
    3. Jing Ma & Xiaoyu Guo & Xufeng Zhao, 2024. "Identifying purchase intention through deep learning: analyzing the Q &D text of an E-Commerce platform," Annals of Operations Research, Springer, vol. 339(1), pages 329-348, August.
    4. Sook Fern Yeo & Cheng Ling Tan & Ajay Kumar & Kim Hua Tan & Jee Kit Wong, 2022. "Investigating the impact of AI-powered technologies on Instagrammers’ purchase decisions in digitalization era–A study of the fashion and apparel industry," Post-Print hal-03628402, HAL.
    5. Yeo, Sook Fern & Tan, Cheng Ling & Kumar, Ajay & Tan, Kim Hua & Wong, Jee Kit, 2022. "Investigating the impact of AI-powered technologies on Instagrammers’ purchase decisions in digitalization era–A study of the fashion and apparel industry," Technological Forecasting and Social Change, Elsevier, vol. 177(C).
    6. Kumar, Ajay & Taylor, James W., 2024. "Feature importance in the age of explainable AI: Case study of detecting fake news & misinformation via a multi-modal framework," European Journal of Operational Research, Elsevier, vol. 317(2), pages 401-413.
    7. Han, Shuihua & Jia, Xinyun & Chen, Xinming & Gupta, Shivam & Kumar, Ajay & Lin, Zhibin, 2022. "Search well and be wise: A machine learning approach to search for a profitable location," Journal of Business Research, Elsevier, vol. 144(C), pages 416-427.
    8. Nan Yang & Nikolaos Korfiatis & Dimitris Zissis & Konstantina Spanaki, 2024. "Incorporating topic membership in review rating prediction from unstructured data: a gradient boosting approach," Annals of Operations Research, Springer, vol. 339(1), pages 631-662, August.
    9. Hajek, Petr & Hikkerova, Lubica & Sahut, Jean-Michel, 2023. "Fake review detection in e-Commerce platforms using aspect-based sentiment analysis," Journal of Business Research, Elsevier, vol. 167(C).
    10. Kshitij Sharma & Yogesh K. Dwivedi & Bhimaraya Metri, 2024. "Incorporating causality in energy consumption forecasting using deep neural networks," Annals of Operations Research, Springer, vol. 339(1), pages 537-572, August.
    11. Serge Nyawa & Dieudonné Tchuente & Samuel Fosso-Wamba, 2024. "COVID-19 vaccine hesitancy: a social media analysis using deep learning," Annals of Operations Research, Springer, vol. 339(1), pages 477-515, August.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Zhuang, Mengzhou & Cui, Geng & Peng, Ling, 2018. "Manufactured opinions: The effect of manipulating online product reviews," Journal of Business Research, Elsevier, vol. 87(C), pages 24-35.
    2. Hung-Pin Shih & Pei-Chen Sung, 2021. "Addressing the Review-Based Learning and Private Information Approaches to Foster Platform Continuance," Information Systems Frontiers, Springer, vol. 23(3), pages 649-661, June.
    3. Ku, Hsuan-Hsuan & Shang, Rong-An & Fu, Yi-Fan, 2021. "Social learning effects of complaint handling on social media: Self-construal as a moderator," Journal of Retailing and Consumer Services, Elsevier, vol. 59(C).
    4. Cheng Zhao & Chong Alex Wang, 2023. "A cross-site comparison of online review manipulation using Benford’s law," Electronic Commerce Research, Springer, vol. 23(1), pages 365-406, March.
    5. Xiaohui Zhang & Qianzhou Du & Zhongju Zhang, 2022. "A theory‐driven machine learning system for financial disinformation detection," Production and Operations Management, Production and Operations Management Society, vol. 31(8), pages 3160-3179, August.
    6. Cuixia Jiang & Jun Zhu & Qifa Xu, 2022. "Dissecting click farming on the Taobao platform in China via PU learning and weighted logistic regression," Electronic Commerce Research, Springer, vol. 22(1), pages 157-176, March.
    7. Ishita Chakraborty & Minkyung Kim & K. Sudhir, 2019. "Attribute Sentiment Scoring With Online Text Reviews : Accounting for Language Structure and Attribute Self-Selection," Cowles Foundation Discussion Papers 2176, Cowles Foundation for Research in Economics, Yale University.
    8. Harrison-Walker, L. Jean & Jiang, Ying, 2023. "Suspicion of online product reviews as fake: Cues and consequences," Journal of Business Research, Elsevier, vol. 160(C).
    9. Uttara M. Ananthakrishnan & Beibei Li & Michael D. Smith, 2020. "A Tangled Web: Should Online Review Portals Display Fraudulent Reviews?," Information Systems Research, INFORMS, vol. 31(3), pages 950-971, September.
    10. Banerjee, Snehasish & Chua, Alton Y.K., 2023. "Understanding online fake review production strategies," Journal of Business Research, Elsevier, vol. 156(C).
    11. Ben Jabeur, Sami & Ballouk, Hossein & Ben Arfi, Wissal & Sahut, Jean-Michel, 2023. "Artificial intelligence applications in fake review detection: Bibliometric analysis and future avenues for research," Journal of Business Research, Elsevier, vol. 158(C).
    12. Tim Kollmer & Andreas Eckhardt & Victoria Reibenspiess, 2022. "Explaining consumer suspicion: insights of a vignette study on online product reviews," Electronic Markets, Springer;IIM University of St. Gallen, vol. 32(3), pages 1221-1238, September.
    13. Guo, Qiaozhen & Chen, Ying-Ju & Huang, Wei, 2022. "Dynamic pricing of new experience products with dual-channel social learning and online review manipulations," Omega, Elsevier, vol. 109(C).
    14. Warut Khern-am-nuai & Karthik Kannan & Hossein Ghasemkhani, 2018. "Extrinsic versus Intrinsic Rewards for Contributing Reviews in an Online Platform," Information Systems Research, INFORMS, vol. 29(4), pages 871-892, December.
    15. Hui Zhao & Xiaoyuan Wang & Debing Ni & Kevin W. Li, 2023. "The Quality-Signaling Role of Manipulated Consumer Reviews," Group Decision and Negotiation, Springer, vol. 32(3), pages 503-536, June.
    16. Li, Yuanshuo & Zhang, Zili & Pedersen, Susanne & Liu, Xudong & Zhang, Ziqiong, 2023. "The influence of relative popularity on negative fake reviews: A case study on restaurant reviews," Journal of Business Research, Elsevier, vol. 162(C).
    17. Ngai, Eric W.T. & Wu, Yuanyuan, 2022. "Machine learning in marketing: A literature review, conceptual framework, and research agenda," Journal of Business Research, Elsevier, vol. 145(C), pages 35-48.
    18. Plotkina, Daria & Munzel, Andreas & Pallud, Jessie, 2020. "Illusions of truth—Experimental insights into human and algorithmic detections of fake online reviews," Journal of Business Research, Elsevier, vol. 109(C), pages 511-523.
    19. Koukova, Nevena T. & Wang, Rebecca Jen-Hui & Isaac, Mathew S., 2023. "“If you loved our product”: Do conditional review requests harm retailer loyalty?," Journal of Retailing, Elsevier, vol. 99(1), pages 85-101.
    20. Wang, Qiang & Zhang, Wen & Li, Jian & Ma, Zhenzhong, 2023. "Complements or confounders? A study of effects of target and non-target features on online fraudulent reviewer detection," Journal of Business Research, Elsevier, vol. 167(C).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:hal:journl:hal-03630420. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: CCSD (email available below). General contact details of provider: https://hal.archives-ouvertes.fr/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.