Author's Sentiment Prediction
- URL: http://arxiv.org/abs/2011.06128v1
- Date: Thu, 12 Nov 2020 00:03:26 GMT
- Title: Author's Sentiment Prediction
- Authors: Mohaddeseh Bastan, Mahnaz Koupaee, Youngseo Son, Richard Sicoli, and
Niranjan Balasubramanian
- Abstract summary: PerSenT is a dataset of crowd-sourced annotations of the sentiment expressed by the authors towards the main entities in news articles.
The dataset also includes paragraph-level sentiment annotations to provide more fine-grained supervision for the task.
We release this dataset with 5.3k documents and 38k paragraphs covering 3.2k unique entities as a challenge in entity sentiment analysis.
- Score: 13.459029439420872
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: We introduce PerSenT, a dataset of crowd-sourced annotations of the sentiment
expressed by the authors towards the main entities in news articles. The
dataset also includes paragraph-level sentiment annotations to provide more
fine-grained supervision for the task. Our benchmarks of multiple strong
baselines show that this is a difficult classification task. The results also
suggest that simply fine-tuning document-level representations from BERT isn't
adequate for this task. Making paragraph-level decisions and aggregating them
over the entire document is also ineffective. We present empirical and
qualitative analyses that illustrate the specific challenges posed by this
dataset. We release this dataset with 5.3k documents and 38k paragraphs
covering 3.2k unique entities as a challenge in entity sentiment analysis.
Related papers
- Rethinking the Evaluation of Pre-trained Text-and-Layout Models from an
Entity-Centric Perspective [15.222536348615087]
EC-FUNSD is an entity-centric benckmark designed for the evaluation of semantic entity recognition and entity linking on visually-rich documents.
This dataset contains diverse formats of document layouts and annotations of semantic-driven entities and their relations.
Experiment results demonstrate that state-of-the-art PTLMs exhibit overfitting tendencies on the prevailing benchmarks, as their performance sharply decrease when the dataset bias is removed.
arXiv Detail & Related papers (2024-02-04T07:33:45Z) - Text2Analysis: A Benchmark of Table Question Answering with Advanced
Data Analysis and Unclear Queries [67.0083902913112]
We develop the Text2Analysis benchmark, incorporating advanced analysis tasks.
We also develop five innovative and effective annotation methods.
We evaluate five state-of-the-art models using three different metrics.
arXiv Detail & Related papers (2023-12-21T08:50:41Z) - Document-Level Supervision for Multi-Aspect Sentiment Analysis Without
Fine-grained Labels [14.187425779776204]
We propose a VAE-based topic modeling approach that performs ABSA using document-level supervision.
Our approach allows for the detection of multiple aspects in a document, thereby allowing for the possibility of reasoning about how sentiment expressed through multiple aspects comes together to form an observable document-level sentiment.
arXiv Detail & Related papers (2023-10-10T18:53:21Z) - Entity-Level Sentiment Analysis (ELSA): An exploratory task survey [2.191505742658975]
This paper explores the task of identifying the overall sentiment expressed towards volitional entities in a document.
We show that sentiment in our dataset is expressed not only with an entity mention as target, but also towards targets with a sentiment-relevant relation to a volitional entity.
arXiv Detail & Related papers (2023-04-27T15:01:20Z) - Modeling Entities as Semantic Points for Visual Information Extraction
in the Wild [55.91783742370978]
We propose an alternative approach to precisely and robustly extract key information from document images.
We explicitly model entities as semantic points, i.e., center points of entities are enriched with semantic information describing the attributes and relationships of different entities.
The proposed method can achieve significantly enhanced performance on entity labeling and linking, compared with previous state-of-the-art models.
arXiv Detail & Related papers (2023-03-23T08:21:16Z) - Document-Level Relation Extraction with Sentences Importance Estimation
and Focusing [52.069206266557266]
Document-level relation extraction (DocRE) aims to determine the relation between two entities from a document of multiple sentences.
We propose a Sentence Estimation and Focusing (SIEF) framework for DocRE, where we design a sentence importance score and a sentence focusing loss.
Experimental results on two domains show that our SIEF not only improves overall performance, but also makes DocRE models more robust.
arXiv Detail & Related papers (2022-04-27T03:20:07Z) - Augmenting Document Representations for Dense Retrieval with
Interpolation and Perturbation [49.940525611640346]
Document Augmentation for dense Retrieval (DAR) framework augments the representations of documents with their Dense Augmentation and perturbations.
We validate the performance of DAR on retrieval tasks with two benchmark datasets, showing that the proposed DAR significantly outperforms relevant baselines on the dense retrieval of both the labeled and unlabeled documents.
arXiv Detail & Related papers (2022-03-15T09:07:38Z) - Competency Problems: On Finding and Removing Artifacts in Language Data [50.09608320112584]
We argue that for complex language understanding tasks, all simple feature correlations are spurious.
We theoretically analyze the difficulty of creating data for competency problems when human bias is taken into account.
arXiv Detail & Related papers (2021-04-17T21:34:10Z) - YASO: A New Benchmark for Targeted Sentiment Analysis [12.60266470026856]
We present YASO -- a new crowd-sourced TSA evaluation dataset.
The dataset contains 2,215 English sentences from movie, business and product reviews, and 7,415 terms and their sentiments annotated within these sentences.
Our analysis verifies the reliability of our annotations, and explores the characteristics of the collected data.
arXiv Detail & Related papers (2020-12-29T00:25:15Z) - Weakly-Supervised Aspect-Based Sentiment Analysis via Joint
Aspect-Sentiment Topic Embedding [71.2260967797055]
We propose a weakly-supervised approach for aspect-based sentiment analysis.
We learn sentiment, aspect> joint topic embeddings in the word embedding space.
We then use neural models to generalize the word-level discriminative information.
arXiv Detail & Related papers (2020-10-13T21:33:24Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.