A Weak Supervised Dataset of Fine-Grained Emotions in Portuguese
- URL: http://arxiv.org/abs/2108.07638v1
- Date: Tue, 17 Aug 2021 14:08:23 GMT
- Title: A Weak Supervised Dataset of Fine-Grained Emotions in Portuguese
- Authors: Diogo Cortiz, Jefferson O. Silva, Newton Calegari, Ana Lu\'isa
Freitas, Ana Ang\'elica Soares, Carolina Botelho, Gabriel Gaudencio R\^ego,
Waldir Sampaio, Paulo Sergio Boggio
- Abstract summary: This research describes an approach to create a lexical-based weak supervised corpus for fine-grained emotion in Portuguese.
Our results suggest lexical-based weak supervision as an appropriate strategy for initial work in low resources environment.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Affective Computing is the study of how computers can recognize, interpret
and simulate human affects. Sentiment Analysis is a common task in NLP related
to this topic, but it focuses only on emotion valence (positive, negative,
neutral). An emerging approach in NLP is Emotion Recognition, which relies on
fined-grained classification. This research describes an approach to create a
lexical-based weak supervised corpus for fine-grained emotion in Portuguese. We
evaluate our dataset by fine-tuning a transformer-based language model (BERT)
and validating it on a Golden Standard annotated validation set. Our results
(F1-score= .64) suggest lexical-based weak supervision as an appropriate
strategy for initial work in low resources environment.
Related papers
- SOUL: Towards Sentiment and Opinion Understanding of Language [96.74878032417054]
We propose a new task called Sentiment and Opinion Understanding of Language (SOUL)
SOUL aims to evaluate sentiment understanding through two subtasks: Review (RC) and Justification Generation (JG)
arXiv Detail & Related papers (2023-10-27T06:48:48Z) - Disco-Bench: A Discourse-Aware Evaluation Benchmark for Language
Modelling [70.23876429382969]
We propose a benchmark that can evaluate intra-sentence discourse properties across a diverse set of NLP tasks.
Disco-Bench consists of 9 document-level testsets in the literature domain, which contain rich discourse phenomena.
For linguistic analysis, we also design a diagnostic test suite that can examine whether the target models learn discourse knowledge.
arXiv Detail & Related papers (2023-07-16T15:18:25Z) - REDAffectiveLM: Leveraging Affect Enriched Embedding and
Transformer-based Neural Language Model for Readers' Emotion Detection [3.6678641723285446]
We propose a novel approach for Readers' Emotion Detection from short-text documents using a deep learning model called REDAffectiveLM.
We leverage context-specific and affect enriched representations by using a transformer-based pre-trained language model in tandem with affect enriched Bi-LSTM+Attention.
arXiv Detail & Related papers (2023-01-21T19:28:25Z) - Sentiment-Aware Word and Sentence Level Pre-training for Sentiment
Analysis [64.70116276295609]
SentiWSP is a Sentiment-aware pre-trained language model with combined Word-level and Sentence-level Pre-training tasks.
SentiWSP achieves new state-of-the-art performance on various sentence-level and aspect-level sentiment classification benchmarks.
arXiv Detail & Related papers (2022-10-18T12:25:29Z) - Eliminating Sentiment Bias for Aspect-Level Sentiment Classification
with Unsupervised Opinion Extraction [47.09579655541417]
ALSC aims at identifying the sentiment polarity of a specified aspect in a sentence.
Recent works fine-tune pre-trained Transformer encoders for ALSC to extract an aspect-centric dependency tree.
We propose a span-based anti-bias aspect representation learning framework.
arXiv Detail & Related papers (2021-09-06T12:32:42Z) - Weakly-Supervised Aspect-Based Sentiment Analysis via Joint
Aspect-Sentiment Topic Embedding [71.2260967797055]
We propose a weakly-supervised approach for aspect-based sentiment analysis.
We learn sentiment, aspect> joint topic embeddings in the word embedding space.
We then use neural models to generalize the word-level discriminative information.
arXiv Detail & Related papers (2020-10-13T21:33:24Z) - Structured Self-Attention Weights Encode Semantics in Sentiment Analysis [13.474141732019099]
We show that self-attention scores encode semantics by considering sentiment analysis tasks.
We propose a simple and effective Attention Tracing (LAT) method to analyze structured attention weights.
Our results show that structured attention weights encode rich semantics in sentiment analysis, and match human interpretations of semantics.
arXiv Detail & Related papers (2020-10-10T06:49:25Z) - Language Models as Emotional Classifiers for Textual Conversations [3.04585143845864]
We present a novel methodology for classifying emotion in a conversation.
At the backbone of our proposed methodology is a pre-trained Language Model (LM)
We apply our proposed methodology on the IEMOCAP and Friends data sets.
arXiv Detail & Related papers (2020-08-27T20:04:30Z) - A computational model implementing subjectivity with the 'Room Theory'.
The case of detecting Emotion from Text [68.8204255655161]
This work introduces a new method to consider subjectivity and general context dependency in text analysis.
By using similarity measure between words, we are able to extract the relative relevance of the elements in the benchmark.
This method could be applied to all the cases where evaluating subjectivity is relevant to understand the relative value or meaning of a text.
arXiv Detail & Related papers (2020-05-12T21:26:04Z) - A SentiWordNet Strategy for Curriculum Learning in Sentiment Analysis [7.562843347215284]
Curriculum Learning (CL) is the idea that learning on a training set sequenced or ordered in a manner where samples range from easy to difficult.
In this paper, we apply the ideas of curriculum learning, driven by SentiWordNet in a sentiment analysis setting.
arXiv Detail & Related papers (2020-05-10T18:50:40Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.