IDEAS home Printed from https://ideas.repec.org/a/gam/jsusta/v14y2022i8p4723-d794214.html
   My bibliography  Save this article

Climate Change Sentiment Analysis Using Lexicon, Machine Learning and Hybrid Approaches

Author

Listed:
  • Nabila Mohamad Sham

    (Faculty of Computer and Mathematical Sciences, Universiti Teknologi MARA (UiTM), Shah Alam 40450, Malaysia)

  • Azlinah Mohamed

    (Institute for Big Data Analytics and Artificial Intelligence, Universiti Teknologi MARA (UiTM), Shah Alam 40450, Malaysia)

Abstract

The emissions of greenhouse gases, such as carbon dioxide, into the biosphere have the consequence of warming up the planet, hence the existence of climate change. Sentiment analysis has been a popular subject and there has been a plethora of research conducted in this area in recent decades, typically on social media platforms such as Twitter, due to the proliferation of data generated today during discussions on climate change. However, there is not much research on the performances of different sentiment analysis approaches using lexicon, machine learning and hybrid methods, particularly within this domain-specific sentiment. This study aims to find the most effective sentiment analysis approach for climate change tweets and related domains by performing a comparative evaluation of various sentiment analysis approaches. In this context, seven lexicon-based approaches were used, namely SentiWordNet, TextBlob, VADER, SentiStrength, Hu and Liu, MPQA, and WKWSCI. Meanwhile, three machine learning classifiers were used, namely Support Vector Machine, Naïve Bayes, and Logistic Regression, by using two feature extraction techniques, which were Bag-of-Words and TF–IDF. Next, the hybridization between lexicon-based and machine learning-based approaches was performed. The results indicate that the hybrid method outperformed the other two approaches, with hybrid TextBlob and Logistic Regression achieving an F1-score of 75.3%; thus, this has been chosen as the most effective approach. This study also found that lemmatization improved the accuracy of machine learning and hybrid approaches by 1.6%. Meanwhile, the TF–IDF feature extraction technique was slightly better than BoW by increasing the accuracy of the Logistic Regression classifier by 0.6%. However, TF–IDF and BoW had an identical effect on SVM and NB. Future works will include investigating the suitability of deep learning approaches toward this domain-specific sentiment on social media platforms.

Suggested Citation

  • Nabila Mohamad Sham & Azlinah Mohamed, 2022. "Climate Change Sentiment Analysis Using Lexicon, Machine Learning and Hybrid Approaches," Sustainability, MDPI, vol. 14(8), pages 1-28, April.
  • Handle: RePEc:gam:jsusta:v:14:y:2022:i:8:p:4723-:d:794214
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2071-1050/14/8/4723/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2071-1050/14/8/4723/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Mike Thelwall & Kevan Buckley & Georgios Paltoglou & Di Cai & Arvid Kappas, 2010. "Sentiment strength detection in short informal text," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 61(12), pages 2544-2558, December.
    2. Mike Thelwall & Kevan Buckley & Georgios Paltoglou & Di Cai & Arvid Kappas, 2010. "Sentiment strength detection in short informal text," Journal of the American Society for Information Science and Technology, Association for Information Science & Technology, vol. 61(12), pages 2544-2558, December.
    3. Ning Xiang & Limao Wang & Shuai Zhong & Chen Zheng & Bo Wang & Qiushi Qu, 2021. "How Does the World View China’s Carbon Policy? A Sentiment Analysis on Twitter Data," Energies, MDPI, vol. 14(22), pages 1-17, November.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Ján Mojžiš & Peter Krammer & Marcel Kvassay & Lenka Skovajsová & Ladislav Hluchý, 2022. "Towards Reliable Baselines for Document-Level Sentiment Analysis in the Czech and Slovak Languages," Future Internet, MDPI, vol. 14(10), pages 1-23, October.
    2. Eva L. Jenkins & Dickson Lukose & Linda Brennan & Annika Molenaar & Tracy A. McCaffrey, 2023. "Exploring Food Waste Conversations on Social Media: A Sentiment, Emotion, and Topic Analysis of Twitter Data," Sustainability, MDPI, vol. 15(18), pages 1-26, September.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Ma, Jie & Tse, Ying Kei & Wang, Xiaojun & Zhang, Minhao, 2019. "Examining customer perception and behaviour through social media research – An empirical study of the United Airlines overbooking crisis," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 127(C), pages 192-205.
    2. Müller-Hansen, Finn & Lee, Yuan Ting & Callaghan, Max & Jankin, Slava & Minx, Jan C., 2022. "The German coal debate on Twitter: Reactions to a corporate policy process," Energy Policy, Elsevier, vol. 169(C).
    3. Lipizzi, Carlo & Iandoli, Luca & Ramirez Marquez, José Emmanuel, 2015. "Extracting and evaluating conversational patterns in social media: A socio-semantic analysis of customers’ reactions to the launch of new products using Twitter streams," International Journal of Information Management, Elsevier, vol. 35(4), pages 490-503.
    4. Martin Haselmayer & Marcelo Jenny, 2017. "Sentiment analysis of political communication: combining a dictionary approach with crowdcoding," Quality & Quantity: International Journal of Methodology, Springer, vol. 51(6), pages 2623-2646, November.
    5. Daesik Kim & Chung Joo Chung & Kihong Eom, 2022. "Measuring Online Public Opinion for Decision Making: Application of Deep Learning on Political Context," Sustainability, MDPI, vol. 14(7), pages 1-16, March.
    6. David M. Goldberg & Nohel Zaman & Arin Brahma & Mariano Aloiso, 2022. "Are mortgage loan closing delay risks predictable? A predictive analysis using text mining on discussion threads," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 73(3), pages 419-437, March.
    7. Gabriele Ranco & Ilaria Bordino & Giacomo Bormetti & Guido Caldarelli & Fabrizio Lillo & Michele Treccani, 2014. "Coupling news sentiment with web browsing data improves prediction of intra-day price dynamics," Papers 1412.3948, arXiv.org, revised Dec 2015.
    8. Tadić, Bosiljka & Mitrović Dankulov, Marija & Melnik, Roderick, 2023. "Evolving cycles and self-organised criticality in social dynamics," Chaos, Solitons & Fractals, Elsevier, vol. 171(C).
    9. Ping-Yu Hsu & Hong-Tsuen Lei & Shih-Hsiang Huang & Teng Hao Liao & Yao-Chung Lo & Chin-Chun Lo, 2019. "Effects of sentiment on recommendations in social network," Electronic Markets, Springer;IIM University of St. Gallen, vol. 29(2), pages 253-262, June.
    10. Cohen, Scott & Stienmetz, Jason & Hanna, Paul & Humbracht, Michael & Hopkins, Debbie, 2020. "Shadowcasting tourism knowledge through media: Self-driving sex cars?," Annals of Tourism Research, Elsevier, vol. 85(C).
    11. Zhang, Xuetong & Zhang, Weiguo, 2023. "Information asymmetry, sentiment interactions, and asset price," The North American Journal of Economics and Finance, Elsevier, vol. 67(C).
    12. Takahiro Yabe & P. Suresh C. Rao & Satish V. Ukkusuri, 2021. "Modeling the Influence of Online Social Media Information on Post-Disaster Mobility Decisions," Sustainability, MDPI, vol. 13(9), pages 1-13, May.
    13. Patricia P. Iglesias-Sánchez & Gustavo Fabián Vaccaro Witt & Francisco E. Cabrera & Carmen Jambrino-Maldonado, 2020. "The Contagion of Sentiments during the COVID-19 Pandemic Crisis: The Case of Isolation in Spain," IJERPH, MDPI, vol. 17(16), pages 1-10, August.
    14. Indy Wijngaards & Martijn Burger & Job van Exel, 2019. "The promise of open survey questions—The validation of text-based job satisfaction measures," PLOS ONE, Public Library of Science, vol. 14(12), pages 1-22, December.
    15. Junegak Joung & Ki-Hun Kim & Kwangsoo Kim, 2021. "Data-Driven Approach to Dual Service Failure Monitoring From Negative Online Reviews: Managerial Perspective," SAGE Open, , vol. 11(1), pages 21582440209, January.
    16. Wang, Fang & Du, Zhao & Wang, Shan, 2023. "Information multidimensionality in online customer reviews," Journal of Business Research, Elsevier, vol. 159(C).
    17. Frantisek Darena & Jonas Petrovsky & Jan Zizka & Jan Prichystal, 2016. "Analyzing the correlation between online texts and stock price movements at micro-level using machine learning," MENDELU Working Papers in Business and Economics 2016-67, Mendel University in Brno, Faculty of Business and Economics.
    18. Xu, Xun & Lee, Chieh, 2020. "Utilizing the platform economy effect through EWOM: Does the platform matter?," International Journal of Production Economics, Elsevier, vol. 227(C).
    19. Ema Kušen & Mark Strembeck, 2021. "“Evacuate everyone south of that line” Analyzing structural communication patterns during natural disasters," Journal of Computational Social Science, Springer, vol. 4(2), pages 531-565, November.
    20. Wen Zhang & Daniel R. Fesenmaier, 2018. "Assessing emotions in online stories: comparing self-report and text-based approaches," Information Technology & Tourism, Springer, vol. 20(1), pages 83-95, December.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jsusta:v:14:y:2022:i:8:p:4723-:d:794214. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.