Hier-SPCNet: A Legal Statute Hierarchy-based Heterogeneous Network for
Computing Legal Case Document Similarity
- URL: http://arxiv.org/abs/2007.03225v1
- Date: Tue, 7 Jul 2020 06:30:46 GMT
- Title: Hier-SPCNet: A Legal Statute Hierarchy-based Heterogeneous Network for
Computing Legal Case Document Similarity
- Authors: Paheli Bhattacharya, Kripabandhu Ghosh, Arindam Pal, Saptarshi Ghosh
- Abstract summary: All prior network-based similarity methods considered a precedent citation network among case documents only (PCNet)
We propose to augment the PCNet with the hierarchy of legal statutes, to form a heterogeneous network Hier-SPCNet.
Experiments over a set of Indian Supreme Court case documents show that our proposed heterogeneous network enables significantly better document similarity estimation.
- Score: 9.007583099505954
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Computing similarity between two legal case documents is an important and
challenging task in Legal IR, for which text-based and network-based measures
have been proposed in literature. All prior network-based similarity methods
considered a precedent citation network among case documents only (PCNet).
However, this approach misses an important source of legal knowledge -- the
hierarchy of legal statutes that are applicable in a given legal jurisdiction
(e.g., country). We propose to augment the PCNet with the hierarchy of legal
statutes, to form a heterogeneous network Hier-SPCNet, having citation links
between case documents and statutes, as well as citation and hierarchy links
among the statutes. Experiments over a set of Indian Supreme Court case
documents show that our proposed heterogeneous network enables significantly
better document similarity estimation, as compared to existing approaches using
PCNet. We also show that the proposed network-based method can complement
text-based measures for better estimation of legal document similarity.
Related papers
- Contextual Document Embeddings [77.22328616983417]
We propose two complementary methods for contextualized document embeddings.
First, an alternative contrastive learning objective that explicitly incorporates the document neighbors into the intra-batch contextual loss.
Second, a new contextual architecture that explicitly encodes neighbor document information into the encoded representation.
arXiv Detail & Related papers (2024-10-03T14:33:34Z) - DELTA: Pre-train a Discriminative Encoder for Legal Case Retrieval via Structural Word Alignment [55.91429725404988]
We introduce DELTA, a discriminative model designed for legal case retrieval.
We leverage shallow decoders to create information bottlenecks, aiming to enhance the representation ability.
Our approach can outperform existing state-of-the-art methods in legal case retrieval.
arXiv Detail & Related papers (2024-03-27T10:40:14Z) - MUSER: A Multi-View Similar Case Retrieval Dataset [65.36779942237357]
Similar case retrieval (SCR) is a representative legal AI application that plays a pivotal role in promoting judicial fairness.
Existing SCR datasets only focus on the fact description section when judging the similarity between cases.
We present M, a similar case retrieval dataset based on multi-view similarity measurement and comprehensive legal element with sentence-level legal element annotations.
arXiv Detail & Related papers (2023-10-24T08:17:11Z) - SAILER: Structure-aware Pre-trained Language Model for Legal Case
Retrieval [75.05173891207214]
Legal case retrieval plays a core role in the intelligent legal system.
Most existing language models have difficulty understanding the long-distance dependencies between different structures.
We propose a new Structure-Aware pre-traIned language model for LEgal case Retrieval.
arXiv Detail & Related papers (2023-04-22T10:47:01Z) - Computing and Exploiting Document Structure to Improve Unsupervised
Extractive Summarization of Legal Case Decisions [7.99536002595393]
We propose an unsupervised graph-based ranking model that uses a reweighting algorithm to exploit document structure.
Results on the Canadian Legal Case Law dataset show that our proposed method outperforms several strong baselines.
arXiv Detail & Related papers (2022-11-06T22:20:42Z) - GERE: Generative Evidence Retrieval for Fact Verification [57.78768817972026]
We propose GERE, the first system that retrieves evidences in a generative fashion.
The experimental results on the FEVER dataset show that GERE achieves significant improvements over the state-of-the-art baselines.
arXiv Detail & Related papers (2022-04-12T03:49:35Z) - Specialized Document Embeddings for Aspect-based Similarity of Research
Papers [4.661692753666685]
We treat aspect-based similarity as a classical vector similarity problem in aspect-specific embedding spaces.
We represent a document not as a single generic embedding but as multiple specialized embeddings.
Our approach mitigates potential risks arising from implicit biases by making them explicit.
arXiv Detail & Related papers (2022-03-28T07:35:26Z) - Incorporating Domain Knowledge for Extractive Summarization of Legal
Case Documents [7.6340456946456605]
We propose an unsupervised summarization algorithm DELSumm for summarizing legal case documents.
Our proposed algorithm outperforms several supervised summarization models that are trained over thousands of document-summary pairs.
arXiv Detail & Related papers (2021-06-30T08:06:15Z) - Aspect-based Document Similarity for Research Papers [4.661692753666685]
We extend similarity with aspect information by performing a pairwise document classification task.
We evaluate our aspect-based document similarity for research papers.
Our results show SciBERT as the best performing system.
arXiv Detail & Related papers (2020-10-13T13:51:21Z) - Multilevel Text Alignment with Cross-Document Attention [59.76351805607481]
Existing alignment methods operate at a single, predefined level.
We propose a new learning approach that equips previously established hierarchical attention encoders for representing documents with a cross-document attention component.
arXiv Detail & Related papers (2020-10-03T02:52:28Z) - Methods for Computing Legal Document Similarity: A Comparative Study [9.007583099505954]
Finding similar legal documents is an important and challenging task in the domain of Legal Information Retrieval.
We propose two broad ways of measuring similarity between legal documents - analyzing the precedent citation network, and measuring similarity based on textual content similarity measures.
We explore two promising new similarity computation methods - one text-based and the other based on network embeddings, which have not been considered till now.
arXiv Detail & Related papers (2020-04-26T08:26:04Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.