LeSICiN: A Heterogeneous Graph-based Approach for Automatic Legal
Statute Identification from Indian Legal Documents
- URL: http://arxiv.org/abs/2112.14731v1
- Date: Wed, 29 Dec 2021 18:39:35 GMT
- Title: LeSICiN: A Heterogeneous Graph-based Approach for Automatic Legal
Statute Identification from Indian Legal Documents
- Authors: Shounak Paul, Pawan Goyal and Saptarshi Ghosh
- Abstract summary: Legal Statute Identification (LSI) aims to identify the legal statutes that are relevant to a given description of Facts or evidence of a legal case.
Existing methods only utilize the textual content of Facts and legal articles to guide such a task.
We take the first step towards utilising both the text and the legal citation network for the LSI task.
- Score: 10.059041122060686
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The task of Legal Statute Identification (LSI) aims to identify the legal
statutes that are relevant to a given description of Facts or evidence of a
legal case. Existing methods only utilize the textual content of Facts and
legal articles to guide such a task. However, the citation network among case
documents and legal statutes is a rich source of additional information, which
is not considered by existing models. In this work, we take the first step
towards utilising both the text and the legal citation network for the LSI
task. We curate a large novel dataset for this task, including Facts of cases
from several major Indian Courts of Law, and statutes from the Indian Penal
Code (IPC). Modeling the statutes and training documents as a heterogeneous
graph, our proposed model LeSICiN can learn rich textual and graphical
features, and can also tune itself to correlate these features. Thereafter, the
model can be used to inductively predict links between test documents (new
nodes whose graphical features are not available to the model) and statutes
(existing nodes). Extensive experiments on the dataset show that our model
comfortably outperforms several state-of-the-art baselines, by exploiting the
graphical structure along with textual features. The dataset and our codes are
available at https://github.com/Law-AI/LeSICiN.
Related papers
- DELTA: Pre-train a Discriminative Encoder for Legal Case Retrieval via Structural Word Alignment [55.91429725404988]
We introduce DELTA, a discriminative model designed for legal case retrieval.
We leverage shallow decoders to create information bottlenecks, aiming to enhance the representation ability.
Our approach can outperform existing state-of-the-art methods in legal case retrieval.
arXiv Detail & Related papers (2024-03-27T10:40:14Z) - MUSER: A Multi-View Similar Case Retrieval Dataset [65.36779942237357]
Similar case retrieval (SCR) is a representative legal AI application that plays a pivotal role in promoting judicial fairness.
Existing SCR datasets only focus on the fact description section when judging the similarity between cases.
We present M, a similar case retrieval dataset based on multi-view similarity measurement and comprehensive legal element with sentence-level legal element annotations.
arXiv Detail & Related papers (2023-10-24T08:17:11Z) - Constructing a Knowledge Graph for Vietnamese Legal Cases with
Heterogeneous Graphs [5.168558598888541]
This paper presents a knowledge graph construction method for legal case documents and related laws.
Our approach consists of three main steps: data crawling, information extraction, and knowledge graph deployment.
arXiv Detail & Related papers (2023-09-16T18:31:47Z) - CiteCaseLAW: Citation Worthiness Detection in Caselaw for Legal
Assistive Writing [44.75251805925605]
We introduce a labeled dataset of 178M sentences for citation-worthiness detection in the legal domain from the Caselaw Access Project (CAP)
The performance of various deep learning models was examined on this novel dataset.
The domain-specific pre-trained model tends to outperform other models, with an 88% F1-score for the citation-worthiness detection task.
arXiv Detail & Related papers (2023-05-03T04:20:56Z) - SAILER: Structure-aware Pre-trained Language Model for Legal Case
Retrieval [75.05173891207214]
Legal case retrieval plays a core role in the intelligent legal system.
Most existing language models have difficulty understanding the long-distance dependencies between different structures.
We propose a new Structure-Aware pre-traIned language model for LEgal case Retrieval.
arXiv Detail & Related papers (2023-04-22T10:47:01Z) - Foundation Models and Fair Use [96.04664748698103]
In the U.S. and other countries, copyrighted content may be used to build foundation models without incurring liability due to the fair use doctrine.
In this work, we survey the potential risks of developing and deploying foundation models based on copyrighted content.
We discuss technical mitigations that can help foundation models stay in line with fair use.
arXiv Detail & Related papers (2023-03-28T03:58:40Z) - Finding the Law: Enhancing Statutory Article Retrieval via Graph Neural
Networks [3.5880535198436156]
We propose a novel graph-augmented dense statute retriever (G-DSR) model that incorporates the structure of legislation via a graph neural network to improve dense retrieval performance.
Experimental results show that our approach outperforms strong retrieval baselines on a real-world expert-annotated SAR dataset.
arXiv Detail & Related papers (2023-01-30T12:59:09Z) - Attentive Deep Neural Networks for Legal Document Retrieval [2.4350217735794337]
We study the use of attentive neural network-based text representation for statute law document retrieval.
We develop two hierarchical architectures with sparse attention to represent long sentences and articles, and we name them Attentive CNN and Paraformer.
Experimental results show that Attentive neural methods substantially outperform non-neural methods in terms of retrieval performance across datasets and languages.
arXiv Detail & Related papers (2022-12-13T01:37:27Z) - Lawformer: A Pre-trained Language Model for Chinese Legal Long Documents [56.40163943394202]
We release the Longformer-based pre-trained language model, named as Lawformer, for Chinese legal long documents understanding.
We evaluate Lawformer on a variety of LegalAI tasks, including judgment prediction, similar case retrieval, legal reading comprehension, and legal question answering.
arXiv Detail & Related papers (2021-05-09T09:39:25Z) - Text-guided Legal Knowledge Graph Reasoning [11.089663225933412]
We propose a novel legal application of legal provision prediction (LPP), which aims to predict the related legal provisions of affairs.
We collect amounts of real-world legal provision data from the Guangdong government service website and construct a legal dataset called LegalLPP.
arXiv Detail & Related papers (2021-04-06T04:42:56Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.