Gradient Imitation Reinforcement Learning for General Low-Resource
Information Extraction
- URL: http://arxiv.org/abs/2211.06014v2
- Date: Mon, 14 Nov 2022 08:35:28 GMT
- Title: Gradient Imitation Reinforcement Learning for General Low-Resource
Information Extraction
- Authors: Xuming Hu, Shiao Meng, Chenwei Zhang, Xiangli Yang, Lijie Wen, Irwin
King, Philip S. Yu
- Abstract summary: We develop a Gradient Reinforcement Learning (GIRL) method to encourage pseudo-labeled data to imitate the gradient descent direction on labeled data.
We also leverage GIRL to solve all IE sub-tasks (named entity recognition, relation extraction, and event extraction) in low-resource settings.
- Score: 80.64518530825801
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Information Extraction (IE) aims to extract structured information from
heterogeneous sources. IE from natural language texts include sub-tasks such as
Named Entity Recognition (NER), Relation Extraction (RE), and Event Extraction
(EE). Most IE systems require comprehensive understandings of sentence
structure, implied semantics, and domain knowledge to perform well; thus, IE
tasks always need adequate external resources and annotations. However, it
takes time and effort to obtain more human annotations. Low-Resource
Information Extraction (LRIE) strives to use unsupervised data, reducing the
required resources and human annotation. In practice, existing systems either
utilize self-training schemes to generate pseudo labels that will cause the
gradual drift problem, or leverage consistency regularization methods which
inevitably possess confirmation bias. To alleviate confirmation bias due to the
lack of feedback loops in existing LRIE learning paradigms, we develop a
Gradient Imitation Reinforcement Learning (GIRL) method to encourage
pseudo-labeled data to imitate the gradient descent direction on labeled data,
which can force pseudo-labeled data to achieve better optimization capabilities
similar to labeled data. Based on how well the pseudo-labeled data imitates the
instructive gradient descent direction obtained from labeled data, we design a
reward to quantify the imitation process and bootstrap the optimization
capability of pseudo-labeled data through trial and error. In addition to
learning paradigms, GIRL is not limited to specific sub-tasks, and we leverage
GIRL to solve all IE sub-tasks (named entity recognition, relation extraction,
and event extraction) in low-resource settings (semi-supervised IE and few-shot
IE).
Related papers
- Web-Scale Visual Entity Recognition: An LLM-Driven Data Approach [56.55633052479446]
Web-scale visual entity recognition presents significant challenges due to the lack of clean, large-scale training data.
We propose a novel methodology to curate such a dataset, leveraging a multimodal large language model (LLM) for label verification, metadata generation, and rationale explanation.
Experiments demonstrate that models trained on this automatically curated data achieve state-of-the-art performance on web-scale visual entity recognition tasks.
arXiv Detail & Related papers (2024-10-31T06:55:24Z) - XAL: EXplainable Active Learning Makes Classifiers Better Low-resource Learners [71.8257151788923]
We propose a novel Explainable Active Learning framework (XAL) for low-resource text classification.
XAL encourages classifiers to justify their inferences and delve into unlabeled data for which they cannot provide reasonable explanations.
Experiments on six datasets show that XAL achieves consistent improvement over 9 strong baselines.
arXiv Detail & Related papers (2023-10-09T08:07:04Z) - STAR: Boosting Low-Resource Information Extraction by Structure-to-Text
Data Generation with Large Language Models [56.27786433792638]
STAR is a data generation method that leverages Large Language Models (LLMs) to synthesize data instances.
We design fine-grained step-by-step instructions to obtain the initial data instances.
Our experiments show that the data generated by STAR significantly improve the performance of low-resource event extraction and relation extraction tasks.
arXiv Detail & Related papers (2023-05-24T12:15:19Z) - Semi-Supervised Offline Reinforcement Learning with Action-Free
Trajectories [37.14064734165109]
Natural agents can learn from multiple data sources that differ in size, quality, and types of measurements.
We study this in the context of offline reinforcement learning (RL) by introducing a new, practically motivated semi-supervised setting.
arXiv Detail & Related papers (2022-10-12T18:22:23Z) - Gradient Imitation Reinforcement Learning for Low Resource Relation
Extraction [52.63803634033647]
Low-resource relation Extraction (LRE) aims to extract relation facts from limited labeled corpora when human annotation is scarce.
We develop a Gradient Imitation Reinforcement Learning method to encourage pseudo label data to imitate the gradient descent direction on labeled data.
We also propose a framework called GradLRE, which handles two major scenarios in low-resource relation extraction.
arXiv Detail & Related papers (2021-09-14T03:51:15Z) - DAGA: Data Augmentation with a Generation Approach for Low-resource
Tagging Tasks [88.62288327934499]
We propose a novel augmentation method with language models trained on the linearized labeled sentences.
Our method is applicable to both supervised and semi-supervised settings.
arXiv Detail & Related papers (2020-11-03T07:49:15Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.