E2MoCase: A Dataset for Emotional, Event and Moral Observations in News Articles on High-impact Legal Cases
- URL: http://arxiv.org/abs/2409.09001v1
- Date: Fri, 13 Sep 2024 17:31:09 GMT
- Title: E2MoCase: A Dataset for Emotional, Event and Moral Observations in News Articles on High-impact Legal Cases
- Authors: Candida M. Greco, Lorenzo Zangari, Davide Picca, Andrea Tagarelli,
- Abstract summary: E2MoCase is a novel dataset designed to facilitate the integrated analysis of emotions, moral values, and events within legal narratives and media coverage.
By leveraging advanced models for emotion detection, moral value identification, and event extraction, E2MoCase offers a multi-dimensional perspective on how legal cases are portrayed in news articles.
- Score: 2.435021773579434
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The way media reports on legal cases can significantly shape public opinion, often embedding subtle biases that influence societal views on justice and morality. Analyzing these biases requires a holistic approach that captures the emotional tone, moral framing, and specific events within the narratives. In this work we introduce E2MoCase, a novel dataset designed to facilitate the integrated analysis of emotions, moral values, and events within legal narratives and media coverage. By leveraging advanced models for emotion detection, moral value identification, and event extraction, E2MoCase offers a multi-dimensional perspective on how legal cases are portrayed in news articles.
Related papers
- Exploring and steering the moral compass of Large Language Models [55.2480439325792]
Large Language Models (LLMs) have become central to advancing automation and decision-making across various sectors.
This study proposes a comprehensive comparative analysis of the most advanced LLMs to assess their moral profiles.
arXiv Detail & Related papers (2024-05-27T16:49:22Z) - An Exploratory Case Study on Data Breach Journalism [0.19116784879310028]
This paper explores the novel topic of data breach journalism and data breach news through the case of databreaches.net.
Motivated by the issues in traditional crime news and crime journalism, the case is explored by the means of text mining.
arXiv Detail & Related papers (2024-05-02T16:31:16Z) - EMONA: Event-level Moral Opinions in News Articles [14.898581862558112]
This paper initiates a new task to understand moral opinions towards events in news articles.
We have created a new dataset, EMONA, and annotated event-level moral opinions in news articles.
arXiv Detail & Related papers (2024-04-02T07:57:19Z) - DELTA: Pre-train a Discriminative Encoder for Legal Case Retrieval via Structural Word Alignment [55.91429725404988]
We introduce DELTA, a discriminative model designed for legal case retrieval.
We leverage shallow decoders to create information bottlenecks, aiming to enhance the representation ability.
Our approach can outperform existing state-of-the-art methods in legal case retrieval.
arXiv Detail & Related papers (2024-03-27T10:40:14Z) - MOKA: Moral Knowledge Augmentation for Moral Event Extraction [7.8192232188516115]
News media often strive to minimize explicit moral language in news articles, yet most articles are dense with moral values as expressed through the reported events themselves.
To study this phenomenon, we annotate a new dataset, MORAL EVENTS, consisting of 5,494 structured event annotations on 474 news articles by diverse US media across the political spectrum.
We propose MOKA, a moral event extraction framework with MOral Knowledge Augmentation, which leverages knowledge derived from moral words and moral scenarios to produce structural representations of morality-bearing events.
arXiv Detail & Related papers (2023-11-16T10:04:49Z) - MUSER: A Multi-View Similar Case Retrieval Dataset [65.36779942237357]
Similar case retrieval (SCR) is a representative legal AI application that plays a pivotal role in promoting judicial fairness.
Existing SCR datasets only focus on the fact description section when judging the similarity between cases.
We present M, a similar case retrieval dataset based on multi-view similarity measurement and comprehensive legal element with sentence-level legal element annotations.
arXiv Detail & Related papers (2023-10-24T08:17:11Z) - SAILER: Structure-aware Pre-trained Language Model for Legal Case
Retrieval [75.05173891207214]
Legal case retrieval plays a core role in the intelligent legal system.
Most existing language models have difficulty understanding the long-distance dependencies between different structures.
We propose a new Structure-Aware pre-traIned language model for LEgal case Retrieval.
arXiv Detail & Related papers (2023-04-22T10:47:01Z) - Legal Element-oriented Modeling with Multi-view Contrastive Learning for
Legal Case Retrieval [3.909749182759558]
We propose an interaction-focused network for legal case retrieval with a multi-view contrastive learning objective.
Case-view contrastive learning minimizes the hidden space distance between relevant legal case representations.
We employ a legal element knowledge-aware indicator to detect legal elements of cases.
arXiv Detail & Related papers (2022-10-11T06:47:23Z) - Fine-grained Intent Classification in the Legal Domain [2.088409822555567]
We introduce a dataset of 93 legal documents, belonging to the case categories of either Murder, Land Dispute, Robbery, or Corruption.
We annotate fine-grained intents for each such phrase to enable a deeper understanding of the case for a reader.
We analyze the performance of several transformer-based models in automating the process of extracting intent phrases.
arXiv Detail & Related papers (2022-05-06T23:57:17Z) - What About the Precedent: An Information-Theoretic Analysis of Common
Law [64.49276556192073]
In common law, the outcome of a new case is determined mostly by precedent cases, rather than existing statutes.
We are the first to approach this question by comparing two longstanding jurisprudential views.
We find that the precedent's arguments share 0.38 nats of information with the case's outcome, whereas precedent's facts only share 0.18 nats of information.
arXiv Detail & Related papers (2021-04-25T11:20:09Z) - Modeling "Newsworthiness" for Lead-Generation Across Corpora [85.92467549469147]
We train models on automatically labeled corpora to predict whether each article was a front-page article.
We rank documents in unlabeled corpora on "newsworthiness"
A fine-tuned RoBERTa model achieves.93 AUC performance on heldout labeled documents, and.88 AUC on expert-validated unlabeled corpora.
arXiv Detail & Related papers (2021-04-19T21:48:15Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.