Explaining in Style: Training a GAN to explain a classifier in
StyleSpace
- URL: http://arxiv.org/abs/2104.13369v1
- Date: Tue, 27 Apr 2021 17:57:19 GMT
- Title: Explaining in Style: Training a GAN to explain a classifier in
StyleSpace
- Authors: Oran Lang, Yossi Gandelsman, Michal Yarom, Yoav Wald, Gal Elidan,
Avinatan Hassidim, William T. Freeman, Phillip Isola, Amir Globerson, Michal
Irani, Inbar Mosseri
- Abstract summary: We present StylEx, a method for training a generative model to explain semantic attributes of an image.
StylEx finds attributes that align well with semantic ones, generate meaningful image-specific explanations, and are human-interpretable.
Our results show that the method finds attributes that align well with semantic ones, generate meaningful image-specific explanations, and are human-interpretable.
- Score: 75.75927763429745
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Image classification models can depend on multiple different semantic
attributes of the image. An explanation of the decision of the classifier needs
to both discover and visualize these properties. Here we present StylEx, a
method for doing this, by training a generative model to specifically explain
multiple attributes that underlie classifier decisions. A natural source for
such attributes is the StyleSpace of StyleGAN, which is known to generate
semantically meaningful dimensions in the image. However, because standard GAN
training is not dependent on the classifier, it may not represent these
attributes which are important for the classifier decision, and the dimensions
of StyleSpace may represent irrelevant attributes. To overcome this, we propose
a training procedure for a StyleGAN, which incorporates the classifier model,
in order to learn a classifier-specific StyleSpace. Explanatory attributes are
then selected from this space. These can be used to visualize the effect of
changing multiple attributes per image, thus providing image-specific
explanations. We apply StylEx to multiple domains, including animals, leaves,
faces and retinal images. For these, we show how an image can be modified in
different ways to change its classifier output. Our results show that the
method finds attributes that align well with semantic ones, generate meaningful
image-specific explanations, and are human-interpretable as measured in
user-studies.
Related papers
- Measuring Style Similarity in Diffusion Models [118.22433042873136]
We present a framework for understanding and extracting style descriptors from images.
Our framework comprises a new dataset curated using the insight that style is a subjective property of an image.
We also propose a method to extract style attribute descriptors that can be used to style of a generated image to the images used in the training dataset of a text-to-image model.
arXiv Detail & Related papers (2024-04-01T17:58:30Z) - Text Descriptions are Compressive and Invariant Representations for
Visual Learning [63.3464863723631]
We show that an alternative approach, in line with humans' understanding of multiple visual features per class, can provide compelling performance in the robust few-shot learning setting.
In particular, we introduce a novel method, textit SLR-AVD (Sparse Logistic Regression using Augmented Visual Descriptors).
This method first automatically generates multiple visual descriptions of each class via a large language model (LLM), then uses a VLM to translate these descriptions to a set of visual feature embeddings of each image, and finally uses sparse logistic regression to select a relevant subset of these features to classify
arXiv Detail & Related papers (2023-07-10T03:06:45Z) - Everything is There in Latent Space: Attribute Editing and Attribute
Style Manipulation by StyleGAN Latent Space Exploration [39.18239951479647]
We present Few-shot Latent-based Attribute Manipulation and Editing (FLAME)
FLAME is a framework to perform highly controlled image editing by latent space manipulation.
We generate diverse attribute styles in disentangled manner.
arXiv Detail & Related papers (2022-07-20T12:40:32Z) - Boosting Zero-shot Learning via Contrastive Optimization of Attribute
Representations [28.46906100680767]
We propose a new framework to boost Zero-shot learning (ZSL) by explicitly learning attribute prototypes beyond images.
A prototype generation module is designed to generate attribute prototypes from attribute semantics.
A hard example-based contrastive optimization scheme is introduced to reinforce attribute-level features in the embedding space.
arXiv Detail & Related papers (2022-07-08T11:05:35Z) - Attribute Group Editing for Reliable Few-shot Image Generation [85.52840521454411]
We propose a new editing-based method, i.e., Attribute Group Editing (AGE), for few-shot image generation.
AGE examines the internal representation learned in GANs and identifies semantically meaningful directions.
arXiv Detail & Related papers (2022-03-16T06:54:09Z) - SMILE: Semantically-guided Multi-attribute Image and Layout Editing [154.69452301122175]
Attribute image manipulation has been a very active topic since the introduction of Generative Adversarial Networks (GANs)
We present a multimodal representation that handles all attributes, be it guided by random noise or images, while only using the underlying domain information of the target domain.
Our method is capable of adding, removing or changing either fine-grained or coarse attributes by using an image as a reference or by exploring the style distribution space.
arXiv Detail & Related papers (2020-10-05T20:15:21Z) - Attribute Propagation Network for Graph Zero-shot Learning [57.68486382473194]
We introduce the attribute propagation network (APNet), which is composed of 1) a graph propagation model generating attribute vector for each class and 2) a parameterized nearest neighbor (NN) classifier.
APNet achieves either compelling performance or new state-of-the-art results in experiments with two zero-shot learning settings and five benchmark datasets.
arXiv Detail & Related papers (2020-09-24T16:53:40Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.