DeepParliament: A Legal domain Benchmark & Dataset for Parliament Bills
Prediction
- URL: http://arxiv.org/abs/2211.15424v1
- Date: Tue, 15 Nov 2022 04:55:32 GMT
- Title: DeepParliament: A Legal domain Benchmark & Dataset for Parliament Bills
Prediction
- Authors: Ankit Pal
- Abstract summary: This paper introduces DeepParliament, a legal domain Benchmark dataset that gathers bill documents and metadata.
We propose two new benchmarks: Binary and Multi-Class Bill Status classification.
This work will be the first to present a Parliament bill prediction task.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: This paper introduces DeepParliament, a legal domain Benchmark Dataset that
gathers bill documents and metadata and performs various bill status
classification tasks. The proposed dataset text covers a broad range of bills
from 1986 to the present and contains richer information on parliament bill
content. Data collection, detailed statistics and analyses are provided in the
paper. Moreover, we experimented with different types of models ranging from
RNN to pretrained and reported the results. We are proposing two new
benchmarks: Binary and Multi-Class Bill Status classification. Models developed
for bill documents and relevant supportive tasks may assist Members of
Parliament (MPs), presidents, and other legal practitioners. It will help
review or prioritise bills, thus speeding up the billing process, improving the
quality of decisions and reducing the time consumption in both houses.
Considering that the foundation of the country's democracy is Parliament and
state legislatures, we anticipate that our research will be an essential
addition to the Legal NLP community. This work will be the first to present a
Parliament bill prediction task. In order to improve the accessibility of legal
AI resources and promote reproducibility, we have made our code and dataset
publicly accessible at github.com/monk1337/DeepParliament
Related papers
- Framework of Voting Prediction of Parliament Members [9.050557698554694]
Voting Prediction Framework (VPF) is a data-driven framework designed to forecast parliamentary voting outcomes at the individual legislator level and for entire bills.<n>VPF consists of three key components: Data Collection, Parsing and Feature Integration, and Prediction Models.<n>To evaluate VPF, we analyzed over 5 million voting records from five countries - Canada, Israel, Tunisia, the United Kingdom and the USA.
arXiv Detail & Related papers (2025-05-18T20:26:55Z) - JuDGE: Benchmarking Judgment Document Generation for Chinese Legal System [12.256518096712334]
JuDGE (Judgment Document Generation Evaluation) is a novel benchmark for evaluating the performance of judgment document generation in the Chinese legal system.
We construct a comprehensive dataset consisting of factual descriptions from real legal cases, paired with their corresponding full judgment documents.
In collaboration with legal professionals, we establish a comprehensive automated evaluation framework to assess the quality of generated judgment documents.
arXiv Detail & Related papers (2025-03-18T13:48:18Z) - Leveraging Knowledge Graphs and LLMs to Support and Monitor Legislative Systems [0.0]
This work investigates how Legislative Knowledge Graphs and LLMs can synergize and support legislative processes.
To this aim, we develop Legis AI Platform, an interactive platform focused on Italian legislation that enhances the possibility of conducting legislative analysis.
arXiv Detail & Related papers (2024-09-20T06:21:03Z) - Precedent-Enhanced Legal Judgment Prediction with LLM and Domain-Model
Collaboration [52.57055162778548]
Legal Judgment Prediction (LJP) has become an increasingly crucial task in Legal AI.
Precedents are the previous legal cases with similar facts, which are the basis for the judgment of the subsequent case in national legal systems.
Recent advances in deep learning have enabled a variety of techniques to be used to solve the LJP task.
arXiv Detail & Related papers (2023-10-13T16:47:20Z) - Leveraging Large Language Models for Topic Classification in the Domain
of Public Affairs [65.9077733300329]
Large Language Models (LLMs) have the potential to greatly enhance the analysis of public affairs documents.
LLMs can be of great use to process domain-specific documents, such as those in the domain of public affairs.
arXiv Detail & Related papers (2023-06-05T13:35:01Z) - Attentive Deep Neural Networks for Legal Document Retrieval [2.4350217735794337]
We study the use of attentive neural network-based text representation for statute law document retrieval.
We develop two hierarchical architectures with sparse attention to represent long sentences and articles, and we name them Attentive CNN and Paraformer.
Experimental results show that Attentive neural methods substantially outperform non-neural methods in terms of retrieval performance across datasets and languages.
arXiv Detail & Related papers (2022-12-13T01:37:27Z) - A Survey on Legal Judgment Prediction: Datasets, Metrics, Models and
Challenges [73.34944216896837]
Legal judgment prediction (LJP) applies Natural Language Processing (NLP) techniques to predict judgment results based on fact descriptions automatically.
We analyze 31 LJP datasets in 6 languages, present their construction process and define a classification method of LJP.
We show the state-of-art results for 8 representative datasets from different court cases and discuss the open challenges.
arXiv Detail & Related papers (2022-04-11T04:06:28Z) - Algorithmic Fairness Datasets: the Story so Far [68.45921483094705]
Data-driven algorithms are studied in diverse domains to support critical decisions, directly impacting people's well-being.
A growing community of researchers has been investigating the equity of existing algorithms and proposing novel ones, advancing the understanding of risks and opportunities of automated decision-making for historically disadvantaged populations.
Progress in fair Machine Learning hinges on data, which can be appropriately used only if adequately documented.
Unfortunately, the algorithmic fairness community suffers from a collective data documentation debt caused by a lack of information on specific resources (opacity) and scatteredness of available information (sparsity)
arXiv Detail & Related papers (2022-02-03T17:25:46Z) - Learning Bill Similarity with Annotated and Augmented Corpora of Bills [9.910141281434319]
We construct a human-labeled dataset of 4,721 bill-to-bill relationships at the subsection-level.
We generate synthetic data with varying degrees of similarity, mimicking the complex bill writing process.
We apply our trained model to infer section- and bill-level similarities.
arXiv Detail & Related papers (2021-09-14T08:50:06Z) - Prospects And Challenges In Digitization: The Case Study Of Federal
Parliament Of Nepal [0.0]
The performance of the parliament, secretariat, parliamentarians as well as employees working procedure should be fully digitalized.
The overall activities of parliament, committees, and its secretariats records are managed systematically through tailored software.
arXiv Detail & Related papers (2021-08-12T06:57:09Z) - Lawformer: A Pre-trained Language Model for Chinese Legal Long Documents [56.40163943394202]
We release the Longformer-based pre-trained language model, named as Lawformer, for Chinese legal long documents understanding.
We evaluate Lawformer on a variety of LegalAI tasks, including judgment prediction, similar case retrieval, legal reading comprehension, and legal question answering.
arXiv Detail & Related papers (2021-05-09T09:39:25Z) - \textit{StateCensusLaws.org}: A Web Application for Consuming and
Annotating Legal Discourse Learning [89.77347919191774]
We create a web application to highlight the output of NLP models trained to parse and label discourse segments in law text.
We focus on state-level law that uses U.S. Census population numbers to allocate resources and organize government.
arXiv Detail & Related papers (2021-04-20T22:00:54Z) - Predicting and Analyzing Law-Making in Kenya [0.012691047660244334]
We developed and trained machine learning models on a combination of features extracted from the bills to predict the outcome.
We observed that the texts in a bill are not as relevant as the year and month the bill was introduced and the category the bill belongs to.
arXiv Detail & Related papers (2020-06-09T20:21:50Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.