HumanACGAN: conditional generative adversarial network with human-based
auxiliary classifier and its evaluation in phoneme perception
- URL: http://arxiv.org/abs/2102.04051v1
- Date: Mon, 8 Feb 2021 08:25:29 GMT
- Title: HumanACGAN: conditional generative adversarial network with human-based
auxiliary classifier and its evaluation in phoneme perception
- Authors: Yota Ueda, Kazuki Fujii, Yuki Saito, Shinnosuke Takamichi, Yukino
Baba, Hiroshi Saruwatari
- Abstract summary: We propose a conditional generative adversarial network (GAN) incorporating humans' perceptual evaluations.
A deep neural network (DNN)-based generator of a GAN can represent a real-data distribution accurately but can never represent a human-acceptable distribution.
This paper proposes the HumanACGAN, a theoretical extension of the HumanGAN, to deal with conditional human-acceptable distributions.
- Score: 52.76447516087089
- License: http://creativecommons.org/licenses/by-sa/4.0/
- Abstract: We propose a conditional generative adversarial network (GAN) incorporating
humans' perceptual evaluations. A deep neural network (DNN)-based generator of
a GAN can represent a real-data distribution accurately but can never represent
a human-acceptable distribution, which are ranges of data in which humans
accept the naturalness regardless of whether the data are real or not. A
HumanGAN was proposed to model the human-acceptable distribution. A DNN-based
generator is trained using a human-based discriminator, i.e., humans'
perceptual evaluations, instead of the GAN's DNN-based discriminator. However,
the HumanGAN cannot represent conditional distributions. This paper proposes
the HumanACGAN, a theoretical extension of the HumanGAN, to deal with
conditional human-acceptable distributions. Our HumanACGAN trains a DNN-based
conditional generator by regarding humans as not only a discriminator but also
an auxiliary classifier. The generator is trained by deceiving the human-based
discriminator that scores the unconditioned naturalness and the human-based
classifier that scores the class-conditioned perceptual acceptability. The
training can be executed using the backpropagation algorithm involving humans'
perceptual evaluations. Our experimental results in phoneme perception
demonstrate that our HumanACGAN can successfully train this conditional
generator.
Related papers
- It HAS to be Subjective: Human Annotator Simulation via Zero-shot
Density Estimation [15.8765167340819]
Human annotator simulation (HAS) serves as a cost-effective substitute for human evaluation such as data annotation and system assessment.
Human perception and behaviour during human evaluation exhibit inherent variability due to diverse cognitive processes and subjective interpretations.
This paper introduces a novel meta-learning framework that treats HAS as a zero-shot density estimation problem.
arXiv Detail & Related papers (2023-09-30T20:54:59Z) - Collapse by Conditioning: Training Class-conditional GANs with Limited
Data [109.30895503994687]
We propose a training strategy for conditional GANs (cGANs) that effectively prevents the observed mode-collapse by leveraging unconditional learning.
Our training strategy starts with an unconditional GAN and gradually injects conditional information into the generator and the objective function.
The proposed method for training cGANs with limited data results not only in stable training but also in generating high-quality images.
arXiv Detail & Related papers (2022-01-17T18:59:23Z) - An Adversarial Human Pose Estimation Network Injected with Graph
Structure [75.08618278188209]
In this paper, we design a novel generative adversarial network (GAN) to improve the localization accuracy of visible joints when some joints are invisible.
The network consists of two simple but efficient modules, Cascade Feature Network (CFN) and Graph Structure Network (GSN)
arXiv Detail & Related papers (2021-03-29T12:07:08Z) - Disentangled Sequence Clustering for Human Intention Inference [40.46123013107865]
Disentangled Sequence Clustering Variational Autoencoder (DiSCVAE)
Disentangled Sequence Clustering Variational Autoencoder (DiSCVAE)
arXiv Detail & Related papers (2021-01-23T13:39:34Z) - Guiding GANs: How to control non-conditional pre-trained GANs for
conditional image generation [69.10717733870575]
We present a novel method for guiding generic non-conditional GANs to behave as conditional GANs.
Our approach adds into the mix an encoder network to generate the high-dimensional random input that are fed to the generator network of a non-conditional GAN.
arXiv Detail & Related papers (2021-01-04T14:03:32Z) - Host-Pathongen Co-evolution Inspired Algorithm Enables Robust GAN
Training [0.0]
Generative adversarial networks (GANs) are pairs of artificial neural networks that are trained one against each other.
GANs have allowed for the generation of impressive imitations of real-life films, images and texts, whose fakeness is barely noticeable to humans.
We propose a more robust algorithm for GANs training. We empirically show the increased stability and a better ability to generate high-quality images while using less computational power.
arXiv Detail & Related papers (2020-05-22T09:54:06Z) - Discriminator Contrastive Divergence: Semi-Amortized Generative Modeling
by Exploring Energy of the Discriminator [85.68825725223873]
Generative Adversarial Networks (GANs) have shown great promise in modeling high dimensional data.
We introduce the Discriminator Contrastive Divergence, which is well motivated by the property of WGAN's discriminator.
We demonstrate the benefits of significant improved generation on both synthetic data and several real-world image generation benchmarks.
arXiv Detail & Related papers (2020-04-05T01:50:16Z) - Your GAN is Secretly an Energy-based Model and You Should use
Discriminator Driven Latent Sampling [106.68533003806276]
We show that sampling in latent space can be achieved by sampling in latent space according to an energy-based model induced by the sum of the latent prior log-density and the discriminator output score.
We show that Discriminator Driven Latent Sampling(DDLS) is highly efficient compared to previous methods which work in the high-dimensional pixel space.
arXiv Detail & Related papers (2020-03-12T23:33:50Z) - xAI-GAN: Enhancing Generative Adversarial Networks via Explainable AI
Systems [16.360144499713524]
Generative Adversarial Networks (GANs) are a revolutionary class of Deep Neural Networks (DNNs) that have been successfully used to generate realistic images, music, text, and other data.
We propose a new class of GAN that leverages recent advances in explainable AI (xAI) systems to provide a "richer" form of corrective feedback from discriminators to generators.
We observe xAI-GANs provide an improvement of up to 23.18% in the quality of generated images on both MNIST and FMNIST datasets over standard GANs.
arXiv Detail & Related papers (2020-02-24T18:38:13Z) - GIM: Gaussian Isolation Machines [40.7916016364212]
In many cases, neural network classifiers are exposed to input data that is outside of their training distribution data.
We present a novel hybrid (generative-discriminative) classifier aimed at solving the problem arising when OOD data is encountered.
The proposed GIM's novelty lies in its discriminative performance and generative capabilities, a combination of characteristics not usually seen in a single classifier.
arXiv Detail & Related papers (2020-02-06T09:51:47Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.