Predicting Out-of-Domain Generalization with Neighborhood Invariance
- URL: http://arxiv.org/abs/2207.02093v3
- Date: Mon, 17 Jul 2023 15:16:56 GMT
- Title: Predicting Out-of-Domain Generalization with Neighborhood Invariance
- Authors: Nathan Ng and Neha Hulkund and Kyunghyun Cho and Marzyeh Ghassemi
- Abstract summary: We propose a measure of a classifier's output invariance in a local transformation neighborhood.
Our measure is simple to calculate, does not depend on the test point's true label, and can be applied even in out-of-domain (OOD) settings.
In experiments on benchmarks in image classification, sentiment analysis, and natural language inference, we demonstrate a strong and robust correlation between our measure and actual OOD generalization.
- Score: 59.05399533508682
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Developing and deploying machine learning models safely depends on the
ability to characterize and compare their abilities to generalize to new
environments. Although recent work has proposed a variety of methods that can
directly predict or theoretically bound the generalization capacity of a model,
they rely on strong assumptions such as matching train/test distributions and
access to model gradients. In order to characterize generalization when these
assumptions are not satisfied, we propose neighborhood invariance, a measure of
a classifier's output invariance in a local transformation neighborhood.
Specifically, we sample a set of transformations and given an input test point,
calculate the invariance as the largest fraction of transformed points
classified into the same class. Crucially, our measure is simple to calculate,
does not depend on the test point's true label, makes no assumptions about the
data distribution or model, and can be applied even in out-of-domain (OOD)
settings where existing methods cannot, requiring only selecting a set of
appropriate data transformations. In experiments on robustness benchmarks in
image classification, sentiment analysis, and natural language inference, we
demonstrate a strong and robust correlation between our neighborhood invariance
measure and actual OOD generalization on over 4,600 models evaluated on over
100 unique train/test domain pairs.
Related papers
Err
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.