Combining Feature and Instance Attribution to Detect Artifacts
- URL: http://arxiv.org/abs/2107.00323v1
- Date: Thu, 1 Jul 2021 09:26:13 GMT
- Title: Combining Feature and Instance Attribution to Detect Artifacts
- Authors: Pouya Pezeshkpour, Sarthak Jain, Sameer Singh and Byron C. Wallace
- Abstract summary: We propose methods to facilitate identification of training data artifacts.
We show that this proposed training-feature attribution approach can be used to uncover artifacts in training data.
We execute a small user study to evaluate whether these methods are useful to NLP researchers in practice.
- Score: 62.63504976810927
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Training the large deep neural networks that dominate NLP requires large
datasets. Many of these are collected automatically or via crowdsourcing, and
may exhibit systematic biases or annotation artifacts. By the latter, we mean
correlations between inputs and outputs that are spurious, insofar as they do
not represent a generally held causal relationship between features and
classes; models that exploit such correlations may appear to perform a given
task well, but fail on out of sample data. In this paper we propose methods to
facilitate identification of training data artifacts, using new hybrid
approaches that combine saliency maps (which highlight important input
features) with instance attribution methods (which retrieve training samples
influential to a given prediction). We show that this proposed training-feature
attribution approach can be used to uncover artifacts in training data, and use
it to identify previously unreported artifacts in a few standard NLP datasets.
We execute a small user study to evaluate whether these methods are useful to
NLP researchers in practice, with promising results. We make code for all
methods and experiments in this paper available.
Related papers
- C-ICL: Contrastive In-context Learning for Information Extraction [54.39470114243744]
c-ICL is a novel few-shot technique that leverages both correct and incorrect sample constructions to create in-context learning demonstrations.
Our experiments on various datasets indicate that c-ICL outperforms previous few-shot in-context learning methods.
arXiv Detail & Related papers (2024-02-17T11:28:08Z) - Augmenting NLP data to counter Annotation Artifacts for NLI Tasks [0.0]
Large pre-trained NLP models achieve high performance on benchmark datasets but do not actually "solve" the underlying task.
We explore this phenomenon by first using contrast and adversarial examples to understand limitations to the model's performance.
We then propose a data augmentation technique to fix this bias and measure its effectiveness.
arXiv Detail & Related papers (2023-02-09T15:34:53Z) - Domain-Specific NER via Retrieving Correlated Samples [37.98414661072985]
In this paper, we suggest enhancing NER models with correlated samples.
To explicitly simulate the human reasoning process, we perform a training-free entity type calibrating by majority voting.
Empirical results on datasets of the above two domains show the efficacy of our methods.
arXiv Detail & Related papers (2022-08-27T12:25:24Z) - CHALLENGER: Training with Attribution Maps [63.736435657236505]
We show that utilizing attribution maps for training neural networks can improve regularization of models and thus increase performance.
In particular, we show that our generic domain-independent approach yields state-of-the-art results in vision, natural language processing and on time series tasks.
arXiv Detail & Related papers (2022-05-30T13:34:46Z) - Robust Task-Oriented Dialogue Generation with Contrastive Pre-training
and Adversarial Filtering [17.7709632238066]
Data artifacts incentivize machine learning models to learn non-transferable generalizations.
We investigate whether popular datasets such as MultiWOZ contain such data artifacts.
We propose a contrastive learning based framework to encourage the model to ignore these cues and focus on learning generalisable patterns.
arXiv Detail & Related papers (2022-05-20T03:13:02Z) - An Empirical Comparison of Instance Attribution Methods for NLP [62.63504976810927]
We evaluate the degree to which different potential instance attribution agree with respect to the importance of training samples.
We find that simple retrieval methods yield training instances that differ from those identified via gradient-based methods.
arXiv Detail & Related papers (2021-04-09T01:03:17Z) - Bootstrapping Relation Extractors using Syntactic Search by Examples [47.11932446745022]
We propose a process for bootstrapping training datasets which can be performed quickly by non-NLP-experts.
We take advantage of search engines over syntactic-graphs which expose a friendly by-example syntax.
We show that the resulting models are competitive with models trained on manually annotated data and on data obtained from distant supervision.
arXiv Detail & Related papers (2021-02-09T18:17:59Z) - Relation-Guided Representation Learning [53.60351496449232]
We propose a new representation learning method that explicitly models and leverages sample relations.
Our framework well preserves the relations between samples.
By seeking to embed samples into subspace, we show that our method can address the large-scale and out-of-sample problem.
arXiv Detail & Related papers (2020-07-11T10:57:45Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.