One Representation to Rule Them All: Identifying Out-of-Support Examples
in Few-shot Learning with Generic Representations
- URL: http://arxiv.org/abs/2106.01423v1
- Date: Wed, 2 Jun 2021 19:07:27 GMT
- Title: One Representation to Rule Them All: Identifying Out-of-Support Examples
in Few-shot Learning with Generic Representations
- Authors: Henry Kvinge, Scott Howland, Nico Courts, Lauren A. Phillips, John
Buckheit, Zachary New, Elliott Skomski, Jung H. Lee, Sandeep Tiwari, Jessica
Hibler, Courtney D. Corley, Nathan O. Hodas
- Abstract summary: We describe a new method of identifying 'out-of-support' (OOS) examples within the Prototypical Networks framework.
We show that our method outperforms other existing approaches in the literature.
- Score: 0.8076739603800089
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The field of few-shot learning has made remarkable strides in developing
powerful models that can operate in the small data regime. Nearly all of these
methods assume every unlabeled instance encountered will belong to a handful of
known classes for which one has examples. This can be problematic for
real-world use cases where one routinely finds 'none-of-the-above' examples. In
this paper we describe this challenge of identifying what we term
'out-of-support' (OOS) examples. We describe how this problem is subtly
different from out-of-distribution detection and describe a new method of
identifying OOS examples within the Prototypical Networks framework using a
fixed point which we call the generic representation. We show that our method
outperforms other existing approaches in the literature as well as other
approaches that we propose in this paper. Finally, we investigate how the use
of such a generic point affects the geometry of a model's feature space.
Related papers
- Impossibility Theorems for Feature Attribution [21.88229793890961]
We show that for moderately rich model classes, any feature attribution method can provably fail to improve on random guessing for inferring model behaviour.
Our results apply to common end-tasks such as characterizing local model behaviour, identifying spurious features, and algorithmic recourse.
arXiv Detail & Related papers (2022-12-22T17:03:57Z) - Generalization Properties of Retrieval-based Models [50.35325326050263]
Retrieval-based machine learning methods have enjoyed success on a wide range of problems.
Despite growing literature showcasing the promise of these models, the theoretical underpinning for such models remains underexplored.
We present a formal treatment of retrieval-based models to characterize their generalization ability.
arXiv Detail & Related papers (2022-10-06T00:33:01Z) - Not All Instances Contribute Equally: Instance-adaptive Class
Representation Learning for Few-Shot Visual Recognition [94.04041301504567]
Few-shot visual recognition refers to recognize novel visual concepts from a few labeled instances.
We propose a novel metric-based meta-learning framework termed instance-adaptive class representation learning network (ICRL-Net) for few-shot visual recognition.
arXiv Detail & Related papers (2022-09-07T10:00:18Z) - An Additive Instance-Wise Approach to Multi-class Model Interpretation [53.87578024052922]
Interpretable machine learning offers insights into what factors drive a certain prediction of a black-box system.
Existing methods mainly focus on selecting explanatory input features, which follow either locally additive or instance-wise approaches.
This work exploits the strengths of both methods and proposes a global framework for learning local explanations simultaneously for multiple target classes.
arXiv Detail & Related papers (2022-07-07T06:50:27Z) - Learning to Detect Instance-level Salient Objects Using Complementary
Image Labels [55.049347205603304]
We present the first weakly-supervised approach to the salient instance detection problem.
We propose a novel weakly-supervised network with three branches: a Saliency Detection Branch leveraging class consistency information to locate candidate objects; a Boundary Detection Branch exploiting class discrepancy information to delineate object boundaries; and a Centroid Detection Branch using subitizing information to detect salient instance centroids.
arXiv Detail & Related papers (2021-11-19T10:15:22Z) - Explaining Classes through Word Attribution [1.3898207083070262]
We propose a method for explaining classes using deep learning models and the Integrated Gradients feature attribution technique.
We demonstrate the approach on Web register (genre) classification using the XML-R model and the Corpus of Online Registers of English (CORE)
arXiv Detail & Related papers (2021-08-31T07:22:29Z) - Instance Similarity Learning for Unsupervised Feature Representation [83.31011038813459]
We propose an instance similarity learning (ISL) method for unsupervised feature representation.
We employ the Generative Adversarial Networks (GAN) to mine the underlying feature manifold.
Experiments on image classification demonstrate the superiority of our method compared with the state-of-the-art methods.
arXiv Detail & Related papers (2021-08-05T16:42:06Z) - Unsupervised Detection of Adversarial Examples with Model Explanations [0.6091702876917279]
We propose a simple yet effective method to detect adversarial examples using methods developed to explain the model's behavior.
Our evaluations with MNIST handwritten dataset show that our method is capable of detecting adversarial examples with high confidence.
arXiv Detail & Related papers (2021-07-22T06:54:18Z) - Contextualizing Enhances Gradient Based Meta Learning [7.009032627535598]
We show how to equip meta learning methods with contextualizers and show that their use can significantly boost performance on a range of few shot learning datasets.
Our approach is particularly apt for low-data environments where it is difficult to update parameters without overfitting.
arXiv Detail & Related papers (2020-07-17T04:01:56Z) - UniT: Unified Knowledge Transfer for Any-shot Object Detection and
Segmentation [52.487469544343305]
Methods for object detection and segmentation rely on large scale instance-level annotations for training.
We propose an intuitive and unified semi-supervised model that is applicable to a range of supervision.
arXiv Detail & Related papers (2020-06-12T22:45:47Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.