IDEAS home Printed from https://ideas.repec.org/a/gam/jmathe/v11y2023i22p4647-d1280114.html
   My bibliography  Save this article

Sentiment Difficulty in Aspect-Based Sentiment Analysis

Author

Listed:
  • Adrian-Gabriel Chifu

    (Aix-Marseille Université, Université de Toulon, CNRS, LIS, 13007 Marseille, France)

  • Sébastien Fournier

    (Aix-Marseille Université, Université de Toulon, CNRS, LIS, 13007 Marseille, France)

Abstract

Subjectivity is a key aspect of natural language understanding, especially in the context of user-generated text and conversational systems based on large language models. Natural language sentences often contain subjective elements, such as opinions and emotions, that make them more nuanced and complex. The level of detail at which the study of the text is performed determines the possible applications of sentiment analysis. The analysis can be done at the document or paragraph level, or, even more granularly, at the aspect level. Many researchers have studied this topic extensively. The field of aspect-based sentiment analysis has numerous data sets and models. In this work, we initiate the discussion around the definition of sentence difficulty in this context of aspect-based sentiment analysis. To assess and quantify the difficulty of the aspect-based sentiment analysis, we conduct an experiment using three data sets: “Laptops”, “Restaurants”, and “MTSC” (Multi-Target-dependent Sentiment Classification), along with 21 learning models from scikit-learn. We also use two textual representations, TF-IDF (Terms frequency-inverse document frequency) and BERT (Bidirectional Encoder Representations from Transformers), to analyze the difficulty faced by these models in performing aspect-based sentiment analysis. Additionally, we compare the models with a fine-tuned version of BERT on the three data sets. We identify the most challenging sentences using a combination of classifiers in order to better understand them. We propose two strategies for defining sentence difficulty. The first strategy is binary and considers sentences as difficult when the classifiers are unable to correctly assign the sentiment polarity. The second strategy uses a six-level difficulty scale based on how many of the top five best-performing classifiers can correctly identify sentiment polarity. These sentences with assigned difficulty classes are then used to create predictive models for early difficulty detection. The purpose of estimating the difficulty of aspect-based sentiment analysis is to enhance performance while minimizing resource usage.

Suggested Citation

  • Adrian-Gabriel Chifu & Sébastien Fournier, 2023. "Sentiment Difficulty in Aspect-Based Sentiment Analysis," Mathematics, MDPI, vol. 11(22), pages 1-33, November.
  • Handle: RePEc:gam:jmathe:v:11:y:2023:i:22:p:4647-:d:1280114
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2227-7390/11/22/4647/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2227-7390/11/22/4647/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Falk Scholer & Hugh E. Williams & Andrew Turpin, 2004. "Query association surrogates for Web search," Journal of the American Society for Information Science and Technology, Association for Information Science & Technology, vol. 55(7), pages 637-650, May.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Josiane Mothe, 2022. "Analytics Methods to Understand Information Retrieval Effectiveness—A Survey," Mathematics, MDPI, vol. 10(12), pages 1-25, June.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jmathe:v:11:y:2023:i:22:p:4647-:d:1280114. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.