Named Entity Recognition in the Legal Domain using a Pointer Generator
Network
- URL: http://arxiv.org/abs/2012.09936v1
- Date: Thu, 17 Dec 2020 21:10:34 GMT
- Title: Named Entity Recognition in the Legal Domain using a Pointer Generator
Network
- Authors: Stavroula Skylaki, Ali Oskooei, Omar Bari, Nadja Herger, Zac Kriegman
(Thomson Reuters Labs)
- Abstract summary: We study the problem of legal NER with noisy text extracted from PDF files of filed court cases from US courts.
The exact location of the entities in the text is unknown and the entities may contain typos and/or OCR mistakes.
We formulate the NER task as a text-to-text sequence generation task and train a pointer generator network to generate the entities in the document rather than label them.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Named Entity Recognition (NER) is the task of identifying and classifying
named entities in unstructured text. In the legal domain, named entities of
interest may include the case parties, judges, names of courts, case numbers,
references to laws etc. We study the problem of legal NER with noisy text
extracted from PDF files of filed court cases from US courts. The "gold
standard" training data for NER systems provide annotation for each token of
the text with the corresponding entity or non-entity label. We work with only
partially complete training data, which differ from the gold standard NER data
in that the exact location of the entities in the text is unknown and the
entities may contain typos and/or OCR mistakes. To overcome the challenges of
our noisy training data, e.g. text extraction errors and/or typos and unknown
label indices, we formulate the NER task as a text-to-text sequence generation
task and train a pointer generator network to generate the entities in the
document rather than label them. We show that the pointer generator can be
effective for NER in the absence of gold standard data and outperforms the
common NER neural network architectures in long legal documents.
Related papers
- Learning Robust Named Entity Recognizers From Noisy Data With Retrieval Augmentation [67.89838237013078]
Named entity recognition (NER) models often struggle with noisy inputs.
We propose a more realistic setting in which only noisy text and its NER labels are available.
We employ a multi-view training framework that improves robust NER without retrieving text during inference.
arXiv Detail & Related papers (2024-07-26T07:30:41Z) - A Boundary Offset Prediction Network for Named Entity Recognition [9.885278527023532]
Named entity recognition (NER) is a fundamental task in natural language processing that aims to identify and classify named entities in text.
We propose a novel approach for NER, named the Boundary Offset Prediction Network (BOPN), which predicts the boundary offsets between candidate spans and their nearest entity spans.
Our method integrates entity type and span representations to generate type-aware boundary offsets instead of using entity types as detection targets.
arXiv Detail & Related papers (2023-10-23T05:04:07Z) - Named Entity Recognition via Machine Reading Comprehension: A Multi-Task
Learning Approach [50.12455129619845]
Named Entity Recognition (NER) aims to extract and classify entity mentions in the text into pre-defined types.
We propose to incorporate the label dependencies among entity types into a multi-task learning framework for better MRC-based NER.
arXiv Detail & Related papers (2023-09-20T03:15:05Z) - TransDocAnalyser: A Framework for Offline Semi-structured Handwritten
Document Analysis in the Legal Domain [3.5018563401895455]
We build the first semi-structured document analysis dataset in the legal domain.
This dataset combines a wide variety of handwritten text with printed text.
We propose an end-to-end framework for offline processing of handwritten semi-structured documents.
arXiv Detail & Related papers (2023-06-03T15:56:30Z) - GPT-NER: Named Entity Recognition via Large Language Models [58.609582116612934]
GPT-NER transforms the sequence labeling task to a generation task that can be easily adapted by Language Models.
We find that GPT-NER exhibits a greater ability in the low-resource and few-shot setups, when the amount of training data is extremely scarce.
This demonstrates the capabilities of GPT-NER in real-world NER applications where the number of labeled examples is limited.
arXiv Detail & Related papers (2023-04-20T16:17:26Z) - Disambiguation of Company names via Deep Recurrent Networks [101.90357454833845]
We propose a Siamese LSTM Network approach to extract -- via supervised learning -- an embedding of company name strings.
We analyse how an Active Learning approach to prioritise the samples to be labelled leads to a more efficient overall learning pipeline.
arXiv Detail & Related papers (2023-03-07T15:07:57Z) - Dynamic Named Entity Recognition [5.9401550252715865]
We introduce a new task: Dynamic Named Entity Recognition (DNER)
DNER provides a framework to better evaluate the ability of algorithms to extract entities by exploiting the context.
We evaluate baseline models and present experiments reflecting issues and research axes related to this novel task.
arXiv Detail & Related papers (2023-02-16T15:50:02Z) - E-NER -- An Annotated Named Entity Recognition Corpus of Legal Text [1.6221439565760059]
We describe a publicly available legal NER data set, called E-NER, based on legal company filings available from the US Securities and Exchange Commission's EDGAR data set.
Training a number of different NER algorithms on the general English CoNLL-2003 corpus but testing on our test collection confirmed significant degradations in accuracy.
arXiv Detail & Related papers (2022-12-19T09:03:32Z) - Optimizing Bi-Encoder for Named Entity Recognition via Contrastive
Learning [80.36076044023581]
We present an efficient bi-encoder framework for named entity recognition (NER)
We frame NER as a metric learning problem that maximizes the similarity between the vector representations of an entity mention and its type.
A major challenge to this bi-encoder formulation for NER lies in separating non-entity spans from entity mentions.
arXiv Detail & Related papers (2022-08-30T23:19:04Z) - Label Semantics for Few Shot Named Entity Recognition [68.01364012546402]
We study the problem of few shot learning for named entity recognition.
We leverage the semantic information in the names of the labels as a way of giving the model additional signal and enriched priors.
Our model learns to match the representations of named entities computed by the first encoder with label representations computed by the second encoder.
arXiv Detail & Related papers (2022-03-16T23:21:05Z) - DEXTER: Deep Encoding of External Knowledge for Named Entity Recognition
in Virtual Assistants [10.500933545429202]
In intelligent voice assistants, where NER is an important component, input to NER may be noisy because of user or speech recognition error.
We describe a NER system intended to address these problems.
We show that this technique improves related tasks, such as semantic parsing, with an improvement of up to 5% in error rate.
arXiv Detail & Related papers (2021-08-15T00:14:47Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.