Aspect Classification for Legal Depositions
- URL: http://arxiv.org/abs/2009.04485v1
- Date: Wed, 9 Sep 2020 18:00:15 GMT
- Title: Aspect Classification for Legal Depositions
- Authors: Saurabh Chakravarty and Satvik Chekuri and Maanav Mehrotra and Edward
A. Fox
- Abstract summary: It is important to know not only about liability, but also about events, accidents, physical conditions, and treatments.
A legal deposition consists of various aspects that are discussed as part of the deponent testimony.
Our methods have achieved a classification F1 score of 0.83.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Attorneys and others have a strong interest in having a digital library with
suitable services (e.g., summarizing, searching, and browsing) to help them
work with large corpora of legal depositions. Their needs often involve
understanding the semantics of such documents. That depends in part on the role
of the deponent, e.g., plaintiff, defendant, law enforcement personnel, expert,
etc. In the case of tort litigation associated with property and casualty
insurance claims, such as relating to an injury, it is important to know not
only about liability, but also about events, accidents, physical conditions,
and treatments.
We hypothesize that a legal deposition consists of various aspects that are
discussed as part of the deponent testimony. Accordingly, we developed an
ontology of aspects in a legal deposition for accident and injury cases. Using
that, we have developed a classifier that can identify portions of text for
each of the aspects of interest. Doing so was complicated by the peculiarities
of this genre, e.g., that deposition transcripts generally consist of data in
the form of question-answer (QA) pairs. Accordingly, our automated system
starts with pre-processing, and then transforms the QA pairs into a canonical
form made up of declarative sentences. Classifying the declarative sentences
that are generated, according to the aspect, can then help with downstream
tasks such as summarization, segmentation, question-answering, and information
retrieval.
Our methods have achieved a classification F1 score of 0.83. Having the
aspects classified with a good accuracy will help in choosing QA pairs that can
be used as candidate summary sentences, and to generate an informative summary
for legal professionals or insurance claim agents. Our methodology could be
extended to legal depositions of other kinds, and to aid services like
searching.
Related papers
- To Aggregate or Not to Aggregate. That is the Question: A Case Study on Annotation Subjectivity in Span Prediction [44.5492443909544]
We use a corpus of problem descriptions written by laypeople in English that is annotated by practising lawyers.
Inherent subjectivity exists in our task because legal area categorisation is a complex task.
arXiv Detail & Related papers (2024-08-05T06:16:31Z) - DELTA: Pre-train a Discriminative Encoder for Legal Case Retrieval via Structural Word Alignment [55.91429725404988]
We introduce DELTA, a discriminative model designed for legal case retrieval.
We leverage shallow decoders to create information bottlenecks, aiming to enhance the representation ability.
Our approach can outperform existing state-of-the-art methods in legal case retrieval.
arXiv Detail & Related papers (2024-03-27T10:40:14Z) - MUSER: A Multi-View Similar Case Retrieval Dataset [65.36779942237357]
Similar case retrieval (SCR) is a representative legal AI application that plays a pivotal role in promoting judicial fairness.
Existing SCR datasets only focus on the fact description section when judging the similarity between cases.
We present M, a similar case retrieval dataset based on multi-view similarity measurement and comprehensive legal element with sentence-level legal element annotations.
arXiv Detail & Related papers (2023-10-24T08:17:11Z) - An Intent Taxonomy of Legal Case Retrieval [43.22489520922202]
Legal case retrieval is a special Information Retrieval(IR) task focusing on legal case documents.
We present a novel hierarchical intent taxonomy of legal case retrieval.
We reveal significant differences in user behavior and satisfaction under different search intents in legal case retrieval.
arXiv Detail & Related papers (2023-07-25T07:27:32Z) - SAILER: Structure-aware Pre-trained Language Model for Legal Case
Retrieval [75.05173891207214]
Legal case retrieval plays a core role in the intelligent legal system.
Most existing language models have difficulty understanding the long-distance dependencies between different structures.
We propose a new Structure-Aware pre-traIned language model for LEgal case Retrieval.
arXiv Detail & Related papers (2023-04-22T10:47:01Z) - PropSegmEnt: A Large-Scale Corpus for Proposition-Level Segmentation and
Entailment Recognition [63.51569687229681]
We argue for the need to recognize the textual entailment relation of each proposition in a sentence individually.
We propose PropSegmEnt, a corpus of over 45K propositions annotated by expert human raters.
Our dataset structure resembles the tasks of (1) segmenting sentences within a document to the set of propositions, and (2) classifying the entailment relation of each proposition with respect to a different yet topically-aligned document.
arXiv Detail & Related papers (2022-12-21T04:03:33Z) - Fine-grained Intent Classification in the Legal Domain [2.088409822555567]
We introduce a dataset of 93 legal documents, belonging to the case categories of either Murder, Land Dispute, Robbery, or Corruption.
We annotate fine-grained intents for each such phrase to enable a deeper understanding of the case for a reader.
We analyze the performance of several transformer-based models in automating the process of extracting intent phrases.
arXiv Detail & Related papers (2022-05-06T23:57:17Z) - GERE: Generative Evidence Retrieval for Fact Verification [57.78768817972026]
We propose GERE, the first system that retrieves evidences in a generative fashion.
The experimental results on the FEVER dataset show that GERE achieves significant improvements over the state-of-the-art baselines.
arXiv Detail & Related papers (2022-04-12T03:49:35Z) - Important Sentence Identification in Legal Cases Using Multi-Class
Classification [0.1499944454332829]
This research explores the usage of sentence embeddings for multi-class classification to identify important sentences in a legal case.
A task-specific loss function is defined in order to improve the accuracy restricted by the straightforward use of categorical cross entropy loss.
arXiv Detail & Related papers (2021-11-10T14:58:29Z) - Asking It All: Generating Contextualized Questions for any Semantic Role [56.724302729493594]
We introduce the task of role question generation, which is given a predicate mention and a passage.
We develop a two-stage model for this task, which first produces a context-independent question prototype for each role.
Our evaluation demonstrates that we generate diverse and well-formed questions for a large, broad-coverage of predicates and roles.
arXiv Detail & Related papers (2021-09-10T12:31:14Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.