Linguistic features for sentence difficulty prediction in ABSA
- URL: http://arxiv.org/abs/2402.03163v1
- Date: Mon, 5 Feb 2024 16:31:03 GMT
- Title: Linguistic features for sentence difficulty prediction in ABSA
- Authors: Adrian-Gabriel Chifu and S\'ebastien Fournier
- Abstract summary: We study the impact of domain diversity and syntactic diversity on difficulty.
We employ two ways of defining sentence difficulty.
We also define 9 linguistic features that, combined, aim at estimating the difficulty at sentence level.
- Score: 0.3172761915061083
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: One of the challenges of natural language understanding is to deal with the
subjectivity of sentences, which may express opinions and emotions that add
layers of complexity and nuance. Sentiment analysis is a field that aims to
extract and analyze these subjective elements from text, and it can be applied
at different levels of granularity, such as document, paragraph, sentence, or
aspect. Aspect-based sentiment analysis is a well-studied topic with many
available data sets and models. However, there is no clear definition of what
makes a sentence difficult for aspect-based sentiment analysis. In this paper,
we explore this question by conducting an experiment with three data sets:
"Laptops", "Restaurants", and "MTSC" (Multi-Target-dependent Sentiment
Classification), and a merged version of these three datasets. We study the
impact of domain diversity and syntactic diversity on difficulty. We use a
combination of classifiers to identify the most difficult sentences and analyze
their characteristics. We employ two ways of defining sentence difficulty. The
first one is binary and labels a sentence as difficult if the classifiers fail
to correctly predict the sentiment polarity. The second one is a six-level
scale based on how many of the top five best-performing classifiers can
correctly predict the sentiment polarity. We also define 9 linguistic features
that, combined, aim at estimating the difficulty at sentence level.
Related papers
- Lexicon-Based Sentiment Analysis on Text Polarities with Evaluation of Classification Models [1.342834401139078]
This work uses a lexicon-based method to perform sentiment analysis and shows an evaluation of classification models trained over textual data.
The lexicon-based methods identify the intensity of emotion and subjectivity at word levels.
This work is based on a multi-class problem of text being labeled as positive, negative, or neutral.
arXiv Detail & Related papers (2024-09-19T15:31:12Z) - What is Sentiment Meant to Mean to Language Models? [0.0]
"sentiment" entails a wide variety of concepts depending on the domain and tools used.
"sentiment" has been used to mean emotion, opinions, market movements, or simply a general good-bad'' dimension.
arXiv Detail & Related papers (2024-05-03T19:37:37Z) - What Evidence Do Language Models Find Convincing? [94.90663008214918]
We build a dataset that pairs controversial queries with a series of real-world evidence documents that contain different facts.
We use this dataset to perform sensitivity and counterfactual analyses to explore which text features most affect LLM predictions.
Overall, we find that current models rely heavily on the relevance of a website to the query, while largely ignoring stylistic features that humans find important.
arXiv Detail & Related papers (2024-02-19T02:15:34Z) - A Soft Contrastive Learning-based Prompt Model for Few-shot Sentiment
Analysis [38.17825180485807]
We propose a Soft Contrastive learning-based Prompt model for few-shot sentiment analysis.
First, we design a sentiment-aware chain of thought prompt module to guide the model to predict the sentiment.
Then, we propose a soft contrastive learning algorithm to take the correlation of the labels into account.
arXiv Detail & Related papers (2023-12-16T15:17:28Z) - A Contrastive Cross-Channel Data Augmentation Framework for Aspect-based
Sentiment Analysis [91.83895509731144]
We propose a novel training framework to mitigate the multi-aspect challenge of sentiment analysis.
A source sentence is fed a domain-specific generator to obtain some synthetic sentences.
The generator generates aspect-specific sentences and a Polarity Augmentation (PAC) to generate polarity-inverted sentences.
Our framework can outperform those baselines without any augmentations by about 1% on accuracy and Macro-F1.
arXiv Detail & Related papers (2022-04-16T16:05:58Z) - Dialog speech sentiment classification for imbalanced datasets [7.84604505907019]
In this paper, we use single and bi-modal analysis of short dialog utterances and gain insights on the main factors that aid in sentiment detection.
We propose an architecture which uses a learning rate scheduler and different monitoring criteria and provides state-of-the-art results for the SWITCHBOARD imbalanced sentiment dataset.
arXiv Detail & Related papers (2021-09-15T11:43:04Z) - Competency Problems: On Finding and Removing Artifacts in Language Data [50.09608320112584]
We argue that for complex language understanding tasks, all simple feature correlations are spurious.
We theoretically analyze the difficulty of creating data for competency problems when human bias is taken into account.
arXiv Detail & Related papers (2021-04-17T21:34:10Z) - Narrative Incoherence Detection [76.43894977558811]
We propose the task of narrative incoherence detection as a new arena for inter-sentential semantic understanding.
Given a multi-sentence narrative, decide whether there exist any semantic discrepancies in the narrative flow.
arXiv Detail & Related papers (2020-12-21T07:18:08Z) - Geometry matters: Exploring language examples at the decision boundary [2.7249290070320034]
BERT, CNN and fasttext are susceptible to word substitutions in high difficulty examples.
On YelpReviewPolarity we observe a correlation coefficient of -0.4 between resilience to perturbations and the difficulty score.
Our approach is simple, architecture agnostic and can be used to study the fragilities of text classification models.
arXiv Detail & Related papers (2020-10-14T16:26:13Z) - Weakly-Supervised Aspect-Based Sentiment Analysis via Joint
Aspect-Sentiment Topic Embedding [71.2260967797055]
We propose a weakly-supervised approach for aspect-based sentiment analysis.
We learn sentiment, aspect> joint topic embeddings in the word embedding space.
We then use neural models to generalize the word-level discriminative information.
arXiv Detail & Related papers (2020-10-13T21:33:24Z) - A Deep Neural Framework for Contextual Affect Detection [51.378225388679425]
A short and simple text carrying no emotion can represent some strong emotions when reading along with its context.
We propose a Contextual Affect Detection framework which learns the inter-dependence of words in a sentence.
arXiv Detail & Related papers (2020-01-28T05:03:15Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.