Law Smells: Defining and Detecting Problematic Patterns in Legal
Drafting
- URL: http://arxiv.org/abs/2110.11984v1
- Date: Fri, 15 Oct 2021 06:37:13 GMT
- Title: Law Smells: Defining and Detecting Problematic Patterns in Legal
Drafting
- Authors: Corinna Coupette, Dirk Hartung, Janis Beckedorf, Maximilian B\"other,
Daniel Martin Katz
- Abstract summary: Law smells are patterns in legal texts that pose threats to the comprehensibility and maintainability of the law.
We develop a comprehensive law smell taxonomy, using text-based and graph-based methods.
Our work demonstrates how ideas from software engineering can be leveraged to assess and improve the quality of legal code.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Building on the computer science concept of code smells, we initiate the
study of law smells, i.e., patterns in legal texts that pose threats to the
comprehensibility and maintainability of the law. With five intuitive law
smells as running examples - namely, duplicated phrase, long element, large
reference tree, ambiguous syntax, and natural language obsession -, we develop
a comprehensive law smell taxonomy. This taxonomy classifies law smells by when
they can be detected, which aspects of law they relate to, and how they can be
discovered. We introduce text-based and graph-based methods to identify
instances of law smells, confirming their utility in practice using the United
States Code as a test case. Our work demonstrates how ideas from software
engineering can be leveraged to assess and improve the quality of legal code,
thus drawing attention to an understudied area in the intersection of law and
computer science and highlighting the potential of computational legal
drafting.
Related papers
- Specification languages for computational laws versus basic legal principles [0.0]
We speak of a textitcomputational law when that law is intended to be enforced by software through an automated decision-making process.
In this paper, we investigate how certain legal principles fare in both scenarios: computational law written in natural language or written in formal language.
arXiv Detail & Related papers (2025-03-12T07:39:27Z) - DELTA: Pre-train a Discriminative Encoder for Legal Case Retrieval via Structural Word Alignment [55.91429725404988]
We introduce DELTA, a discriminative model designed for legal case retrieval.
We leverage shallow decoders to create information bottlenecks, aiming to enhance the representation ability.
Our approach can outperform existing state-of-the-art methods in legal case retrieval.
arXiv Detail & Related papers (2024-03-27T10:40:14Z) - Constructing a Knowledge Graph for Vietnamese Legal Cases with
Heterogeneous Graphs [5.168558598888541]
This paper presents a knowledge graph construction method for legal case documents and related laws.
Our approach consists of three main steps: data crawling, information extraction, and knowledge graph deployment.
arXiv Detail & Related papers (2023-09-16T18:31:47Z) - SAILER: Structure-aware Pre-trained Language Model for Legal Case
Retrieval [75.05173891207214]
Legal case retrieval plays a core role in the intelligent legal system.
Most existing language models have difficulty understanding the long-distance dependencies between different structures.
We propose a new Structure-Aware pre-traIned language model for LEgal case Retrieval.
arXiv Detail & Related papers (2023-04-22T10:47:01Z) - Law to Binary Tree -- An Formal Interpretation of Legal Natural Language [3.1468624343533844]
We propose a new approach based on legal science, specifically legal taxonomy, for representing and reasoning with legal documents.
Our approach interprets the regulations in legal documents as binary trees, which facilitates legal reasoning systems to make decisions and resolve logical contradictions.
arXiv Detail & Related papers (2022-12-16T08:26:32Z) - Language Model Decoding as Likelihood-Utility Alignment [54.70547032876017]
We introduce a taxonomy that groups decoding strategies based on their implicit assumptions about how well the model's likelihood is aligned with the task-specific notion of utility.
Specifically, by analyzing the correlation between the likelihood and the utility of predictions across a diverse set of tasks, we provide the first empirical evidence supporting the proposed taxonomy.
arXiv Detail & Related papers (2022-10-13T17:55:51Z) - Compositional Law Parsing with Latent Random Functions [54.26307134687171]
We propose a deep latent variable model for Compositional LAw Parsing (CLAP)
CLAP achieves the human-like compositionality ability through an encoding-decoding architecture to represent concepts in the scene as latent variables.
Our experimental results demonstrate that CLAP outperforms the compared baseline methods in multiple visual tasks.
arXiv Detail & Related papers (2022-09-15T06:57:23Z) - Pile of Law: Learning Responsible Data Filtering from the Law and a
256GB Open-Source Legal Dataset [46.156169284961045]
We offer an approach to filtering grounded in law, which has directly addressed the tradeoffs in filtering material.
First, we gather and make available the Pile of Law, a 256GB dataset of open-source English-language legal and administrative data.
Second, we distill the legal norms that governments have developed to constrain the inclusion of toxic or private content into actionable lessons.
Third, we show how the Pile of Law offers researchers the opportunity to learn such filtering rules directly from the data.
arXiv Detail & Related papers (2022-07-01T06:25:15Z) - Unsupervised Law Article Mining based on Deep Pre-Trained Language
Representation Models with Application to the Italian Civil Code [3.9342247746757435]
This study proposes an advanced approach to law article prediction for the Italian legal system based on a BERT (Bidirectional Representations from Transformers) learning framework.
We define LamBERTa models by fine-tuning an Italian pre-trained BERT on the Italian civil code or its portions, for law article retrieval as a classification task.
We provide insights into the explainability and interpretability of our LamBERTa models, and we present an extensive experimental analysis over query sets of different type.
arXiv Detail & Related papers (2021-12-02T11:02:00Z) - The Legislative Recipe: Syntax for Machine-Readable Legislation [0.0]
This article attempts to unpack the notion of machine-readability.
It will reflect on logic syntax and symbolic language to assess the capacity and limits of representing legal knowledge.
arXiv Detail & Related papers (2021-08-19T13:40:35Z) - \textit{StateCensusLaws.org}: A Web Application for Consuming and
Annotating Legal Discourse Learning [89.77347919191774]
We create a web application to highlight the output of NLP models trained to parse and label discourse segments in law text.
We focus on state-level law that uses U.S. Census population numbers to allocate resources and organize government.
arXiv Detail & Related papers (2021-04-20T22:00:54Z) - A Dataset for Statutory Reasoning in Tax Law Entailment and Question
Answering [37.66486350122862]
This paper investigates the performance of natural language understanding approaches on statutory reasoning.
We introduce a dataset, together with a legal-domain text corpus.
We contrast this with a hand-constructed Prolog-based system, designed to fully solve the task.
arXiv Detail & Related papers (2020-05-11T16:54:42Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.