Analysing the Resourcefulness of the Paragraph for Precedence Retrieval
- URL: http://arxiv.org/abs/2308.01203v1
- Date: Sat, 29 Jul 2023 08:55:38 GMT
- Title: Analysing the Resourcefulness of the Paragraph for Precedence Retrieval
- Authors: Bhoomeendra Singh Sisodiya, Narendra Babu Unnam, P. Krishna Reddy,
Apala Das, K.V.K. Santhy, V. Balakista Reddy
- Abstract summary: We analyzed the resourcefulness of paragraph-level information in capturing similarity among judgments for improving the performance of precedence retrieval.
We found that the paragraph-level methods could capture the similarity among the judgments with only a few paragraph interactions and exhibit more discriminating power over the baseline document-level method.
- Score: 0.1761604268733064
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Developing methods for extracting relevant legal information to aid legal
practitioners is an active research area. In this regard, research efforts are
being made by leveraging different kinds of information, such as meta-data,
citations, keywords, sentences, paragraphs, etc. Similar to any text document,
legal documents are composed of paragraphs. In this paper, we have analyzed the
resourcefulness of paragraph-level information in capturing similarity among
judgments for improving the performance of precedence retrieval. We found that
the paragraph-level methods could capture the similarity among the judgments
with only a few paragraph interactions and exhibit more discriminating power
over the baseline document-level method. Moreover, the comparison results on
two benchmark datasets for the precedence retrieval on the Indian supreme court
judgments task show that the paragraph-level methods exhibit comparable
performance with the state-of-the-art methods
Related papers
- Unified Multi-Modal Interleaved Document Representation for Information Retrieval [57.65409208879344]
We produce more comprehensive and nuanced document representations by holistically embedding documents interleaved with different modalities.
Specifically, we achieve this by leveraging the capability of recent vision-language models that enable the processing and integration of text, images, and tables into a unified format and representation.
arXiv Detail & Related papers (2024-10-03T17:49:09Z) - Contextual Document Embeddings [77.22328616983417]
We propose two complementary methods for contextualized document embeddings.
First, an alternative contrastive learning objective that explicitly incorporates the document neighbors into the intra-batch contextual loss.
Second, a new contextual architecture that explicitly encodes neighbor document information into the encoded representation.
arXiv Detail & Related papers (2024-10-03T14:33:34Z) - Judgment2vec: Apply Graph Analytics to Searching and Recommendation of Similar Judgments [0.0]
In court practice, legal professionals rely on their training to provide opinions that resolve cases.
Finding a similar case is challenging and often depends on experience, legal domain knowledge, and extensive labor hours.
This research aims to automate the analysis of judgment text similarity.
arXiv Detail & Related papers (2024-08-08T11:37:32Z) - Computing and Exploiting Document Structure to Improve Unsupervised
Extractive Summarization of Legal Case Decisions [7.99536002595393]
We propose an unsupervised graph-based ranking model that uses a reweighting algorithm to exploit document structure.
Results on the Canadian Legal Case Law dataset show that our proposed method outperforms several strong baselines.
arXiv Detail & Related papers (2022-11-06T22:20:42Z) - GERE: Generative Evidence Retrieval for Fact Verification [57.78768817972026]
We propose GERE, the first system that retrieves evidences in a generative fashion.
The experimental results on the FEVER dataset show that GERE achieves significant improvements over the state-of-the-art baselines.
arXiv Detail & Related papers (2022-04-12T03:49:35Z) - Legal Search in Case Law and Statute Law [12.697393184074457]
We describe a method to identify document pairwise relevance in the context of a typical legal document collection.
We review the usage of generalized language models, including supervised and unsupervised learning.
arXiv Detail & Related papers (2021-08-23T12:51:24Z) - VerbCL: A Dataset of Verbatim Quotes for Highlight Extraction in Case
Law [12.080138272647144]
This paper presents a new dataset that consists of the citation graph of court opinions.
We focus on the verbatim quotes, where the text of the original opinion is directly reused.
We introduce the task of highlight extraction as a single-document summarization task based on the citation graph.
arXiv Detail & Related papers (2021-08-23T12:41:41Z) - Comprehensive Studies for Arbitrary-shape Scene Text Detection [78.50639779134944]
We propose a unified framework for the bottom-up based scene text detection methods.
Under the unified framework, we ensure the consistent settings for non-core modules.
With the comprehensive investigations and elaborate analyses, it reveals the advantages and disadvantages of previous models.
arXiv Detail & Related papers (2021-07-25T13:18:55Z) - Unsupervised Document Embedding via Contrastive Augmentation [48.71917352110245]
We present a contrasting learning approach with data augmentation techniques to learn document representations in unsupervised manner.
Inspired by recent contrastive self-supervised learning algorithms used for image and pretraining, we hypothesize that high-quality document embedding should be invariant to diverse paraphrases.
Our method can decrease the classification error rate by up to 6.4% over the SOTA approaches on the document classification task, matching or even surpassing fully-supervised methods.
arXiv Detail & Related papers (2021-03-26T15:48:52Z) - Multilevel Text Alignment with Cross-Document Attention [59.76351805607481]
Existing alignment methods operate at a single, predefined level.
We propose a new learning approach that equips previously established hierarchical attention encoders for representing documents with a cross-document attention component.
arXiv Detail & Related papers (2020-10-03T02:52:28Z) - Methods for Computing Legal Document Similarity: A Comparative Study [9.007583099505954]
Finding similar legal documents is an important and challenging task in the domain of Legal Information Retrieval.
We propose two broad ways of measuring similarity between legal documents - analyzing the precedent citation network, and measuring similarity based on textual content similarity measures.
We explore two promising new similarity computation methods - one text-based and the other based on network embeddings, which have not been considered till now.
arXiv Detail & Related papers (2020-04-26T08:26:04Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.