Fraudulent review detection model focusing on emotional expressions and explicit aspects : investigating the potential of feature engineering

My bibliography Save this paper

Fraudulent review detection model focusing on emotional expressions and explicit aspects : investigating the potential of feature engineering

Author

Listed:

Ajay Kumar
(EM - EMLyon Business School)
Ram D. Gopal
(WBS - Warwick Business School - University of Warwick [Coventry])
Ravi Shankar
(IIT Delhi - Indian Institute of Technology Delhi)
Kim Hua Tan
(Nottingham University Business School [Nottingham])

Registered:

Abstract

Reading customer reviews before purchasing items online has become a common practice; however, some companies use machine learning (ML) algorithms to generate false reviews in order to create positive brand images of their own products and negative images of competitors' offerings. Existing techniques use review content to identify fraudulent reviewers; however, spammers become more intelligent, started to learn from their mistakes, and changed their tactics in order to avoid detection techniques. Thus, investigating fraudulent accounts' behaviour of generating fake negative or positive reviews for competitors or themselves and the necessity of ML classifiers to identify fraudulent reviews, is more important than ever. In this research, we present a novel feature engineering approach in which we (1) extract several "review-centric" and "reviewer-centric" features from a dataset; (2) combine the cumulative effects of features distributions into a unified model that represents overall behavior of the fraudulent reviewers; (3) investigate the role of effective data pre-processing to improve detection accuracy; and (4) develop a probabilistic approach to detect fraudulent reviewers by learning a novel M-SMOTE model over a derived balanced dataset and feature distributions, which outperforms other ML models. Our study contributes to the literature on digital platforms and fraudulent review detection with significant managerial and theoretical implications through these novel findings.

Suggested Citation

Ajay Kumar & Ram D. Gopal & Ravi Shankar & Kim Hua Tan, 2022. "Fraudulent review detection model focusing on emotional expressions and explicit aspects : investigating the potential of feature engineering," Post-Print hal-03630420, HAL.

Handle: RePEc:hal:journl:hal-03630420
DOI: 10.1016/j.dss.2021.113728
Note: View the original document on HAL open archive server: https://hal.science/hal-03630420v1

Download full text from publisher

References listed on IDEAS

Ajay Kumar & Ravi Shankar & Alok Choudhary & Lakshman S. Thakur, 2016. "A big data MapReduce framework for fault diagnosis in cloud-based manufacturing," International Journal of Production Research, Taylor & Francis Journals, vol. 54(23), pages 7060-7073, December.
Mohamad Hazim & Nor Badrul Anuar & Mohd Faizal Ab Razak & Nor Aniza Abdullah, 2018. "Detecting opinion spams through supervised boosting approach," PLOS ONE, Public Library of Science, vol. 13(6), pages 1-23, June.
Chuanming Yu & Yuheng Zuo & Bolin Feng & Lu An & Baiyun Chen, 2019. "An individual-group-merchant relation model for identifying fake online reviews: an empirical study on a Chinese e-commerce platform," Information Technology and Management, Springer, vol. 20(3), pages 123-138, September.
Theodoros Lappas & Gaurav Sabnis & Georgios Valkanas, 2016. "The Impact of Fake Reviews on Online Visibility: A Vulnerability Assessment of the Hotel Industry," Information Systems Research, INFORMS, vol. 27(4), pages 940-961, December.
Michael Luca & Georgios Zervas, 2016. "Fake It Till You Make It: Reputation, Competition, and Yelp Review Fraud," Management Science, INFORMS, vol. 62(12), pages 3412-3427, December.

Full references (including those not matched with items on IDEAS)

Citations

Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.

Cited by:

Carlos Galera-Zarco & Goulielmos Floros, 2024. "A deep learning approach to improve built asset operations and disaster management in critical events: an integrative simulation model for quicker decision making," Annals of Operations Research, Springer, vol. 339(1), pages 573-612, August.
Jiguang Wang & Yilun Zhang & Xinjie Xing & Yuanzhu Zhan & Wai Kin Victor Chan & Sunil Tiwari, 2024. "A data-driven system for cooperative-bus route planning based on generative adversarial network and metric learning," Annals of Operations Research, Springer, vol. 339(1), pages 427-453, August.
Yeo, Sook Fern & Tan, Cheng Ling & Kumar, Ajay & Tan, Kim Hua & Wong, Jee Kit, 2022. "Investigating the impact of AI-powered technologies on Instagrammers’ purchase decisions in digitalization era–A study of the fashion and apparel industry," Technological Forecasting and Social Change, Elsevier, vol. 177(C).
Kumar, Ajay & Taylor, James W., 2024. "Feature importance in the age of explainable AI: Case study of detecting fake news & misinformation via a multi-modal framework," European Journal of Operational Research, Elsevier, vol. 317(2), pages 401-413.
Han, Shuihua & Jia, Xinyun & Chen, Xinming & Gupta, Shivam & Kumar, Ajay & Lin, Zhibin, 2022. "Search well and be wise: A machine learning approach to search for a profitable location," Journal of Business Research, Elsevier, vol. 144(C), pages 416-427.
Linlin Jing & Wei Shan & Richard David Evans & Xiaoxiao Shi, 2024. "Getting to know my disease better: The influence of linguistic features of patients’ self-disclosure on physicians’ social support in online health consultation," Electronic Markets, Springer;IIM University of St. Gallen, vol. 34(1), pages 1-24, December.
Nan Yang & Nikolaos Korfiatis & Dimitris Zissis & Konstantina Spanaki, 2024. "Incorporating topic membership in review rating prediction from unstructured data: a gradient boosting approach," Annals of Operations Research, Springer, vol. 339(1), pages 631-662, August.
Jing Ma & Xiaoyu Guo & Xufeng Zhao, 2024. "Identifying purchase intention through deep learning: analyzing the Q &D text of an E-Commerce platform," Annals of Operations Research, Springer, vol. 339(1), pages 329-348, August.
Hajek, Petr & Hikkerova, Lubica & Sahut, Jean-Michel, 2023. "Fake review detection in e-Commerce platforms using aspect-based sentiment analysis," Journal of Business Research, Elsevier, vol. 167(C).
Kshitij Sharma & Yogesh K. Dwivedi & Bhimaraya Metri, 2024. "Incorporating causality in energy consumption forecasting using deep neural networks," Annals of Operations Research, Springer, vol. 339(1), pages 537-572, August.
Serge Nyawa & Dieudonné Tchuente & Samuel Fosso-Wamba, 2024. "COVID-19 vaccine hesitancy: a social media analysis using deep learning," Annals of Operations Research, Springer, vol. 339(1), pages 477-515, August.
Sook Fern Yeo & Cheng Ling Tan & Ajay Kumar & Kim Hua Tan & Jee Kit Wong, 2022. "Investigating the impact of AI-powered technologies on Instagrammers’ purchase decisions in digitalization era–A study of the fashion and apparel industry," Post-Print hal-03628402, HAL.

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Zhuang, Mengzhou & Cui, Geng & Peng, Ling, 2018. "Manufactured opinions: The effect of manipulating online product reviews," Journal of Business Research, Elsevier, vol. 87(C), pages 24-35.
Hung-Pin Shih & Pei-Chen Sung, 2021. "Addressing the Review-Based Learning and Private Information Approaches to Foster Platform Continuance," Information Systems Frontiers, Springer, vol. 23(3), pages 649-661, June.
Ku, Hsuan-Hsuan & Shang, Rong-An & Fu, Yi-Fan, 2021. "Social learning effects of complaint handling on social media: Self-construal as a moderator," Journal of Retailing and Consumer Services, Elsevier, vol. 59(C).
Cheng Zhao & Chong Alex Wang, 2023. "A cross-site comparison of online review manipulation using Benford’s law," Electronic Commerce Research, Springer, vol. 23(1), pages 365-406, March.
Dimitrios Tsekouras & Dominik Gutt & Irina Heimbach, 2024. "The robo bias in conversational reviews: How the solicitation medium anthropomorphism affects product rating valence and review helpfulness," Journal of the Academy of Marketing Science, Springer, vol. 52(6), pages 1651-1672, November.
Xiaohui Zhang & Qianzhou Du & Zhongju Zhang, 2022. "A theory‐driven machine learning system for financial disinformation detection," Production and Operations Management, Production and Operations Management Society, vol. 31(8), pages 3160-3179, August.
Cuixia Jiang & Jun Zhu & Qifa Xu, 2022. "Dissecting click farming on the Taobao platform in China via PU learning and weighted logistic regression," Electronic Commerce Research, Springer, vol. 22(1), pages 157-176, March.
Ishita Chakraborty & Minkyung Kim & K. Sudhir, 2019. "Attribute Sentiment Scoring With Online Text Reviews : Accounting for Language Structure and Attribute Self-Selection," Cowles Foundation Discussion Papers 2176, Cowles Foundation for Research in Economics, Yale University.
Harrison-Walker, L. Jean & Jiang, Ying, 2023. "Suspicion of online product reviews as fake: Cues and consequences," Journal of Business Research, Elsevier, vol. 160(C).
Uttara M. Ananthakrishnan & Beibei Li & Michael D. Smith, 2020. "A Tangled Web: Should Online Review Portals Display Fraudulent Reviews?," Information Systems Research, INFORMS, vol. 31(3), pages 950-971, September.
Banerjee, Snehasish & Chua, Alton Y.K., 2023. "Understanding online fake review production strategies," Journal of Business Research, Elsevier, vol. 156(C).
Ben Jabeur, Sami & Ballouk, Hossein & Ben Arfi, Wissal & Sahut, Jean-Michel, 2023. "Artificial intelligence applications in fake review detection: Bibliometric analysis and future avenues for research," Journal of Business Research, Elsevier, vol. 158(C).
Tim Kollmer & Andreas Eckhardt & Victoria Reibenspiess, 2022. "Explaining consumer suspicion: insights of a vignette study on online product reviews," Electronic Markets, Springer;IIM University of St. Gallen, vol. 32(3), pages 1221-1238, September.
Guo, Qiaozhen & Chen, Ying-Ju & Huang, Wei, 2022. "Dynamic pricing of new experience products with dual-channel social learning and online review manipulations," Omega, Elsevier, vol. 109(C).
Warut Khern-am-nuai & Karthik Kannan & Hossein Ghasemkhani, 2018. "Extrinsic versus Intrinsic Rewards for Contributing Reviews in an Online Platform," Information Systems Research, INFORMS, vol. 29(4), pages 871-892, December.
Hui Zhao & Xiaoyuan Wang & Debing Ni & Kevin W. Li, 2023. "The Quality-Signaling Role of Manipulated Consumer Reviews," Group Decision and Negotiation, Springer, vol. 32(3), pages 503-536, June.
Li, Yuanshuo & Zhang, Zili & Pedersen, Susanne & Liu, Xudong & Zhang, Ziqiong, 2023. "The influence of relative popularity on negative fake reviews: A case study on restaurant reviews," Journal of Business Research, Elsevier, vol. 162(C).
Ngai, Eric W.T. & Wu, Yuanyuan, 2022. "Machine learning in marketing: A literature review, conceptual framework, and research agenda," Journal of Business Research, Elsevier, vol. 145(C), pages 35-48.
Plotkina, Daria & Munzel, Andreas & Pallud, Jessie, 2020. "Illusions of truth—Experimental insights into human and algorithmic detections of fake online reviews," Journal of Business Research, Elsevier, vol. 109(C), pages 511-523.
Koukova, Nevena T. & Wang, Rebecca Jen-Hui & Isaac, Mathew S., 2023. "“If you loved our product”: Do conditional review requests harm retailer loyalty?," Journal of Retailing, Elsevier, vol. 99(1), pages 85-101.

More about this item

Keywords

online reviews; Digital platforms; Review manipulation; Machine learning; Opinion spamming; Feature engineering;
All these keywords.

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:hal:journl:hal-03630420. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: CCSD (email available below). General contact details of provider: https://hal.archives-ouvertes.fr/ .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Fraudulent review detection model focusing on emotional expressions and explicit aspects : investigating the potential of feature engineering

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Citations

Most related items

More about this item

Keywords

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data