Mitigating Inappropriateness in Image Generation: Can there be Value in
  Reflecting the World's Ugliness?
        - URL: http://arxiv.org/abs/2305.18398v1
- Date: Sun, 28 May 2023 13:35:50 GMT
- Title: Mitigating Inappropriateness in Image Generation: Can there be Value in
  Reflecting the World's Ugliness?
- Authors: Manuel Brack, Felix Friedrich, Patrick Schramowski, Kristian Kersting
- Abstract summary: We demonstrate inappropriate degeneration on a large-scale for various generative text-to-image models.
We use models' representations of the world's ugliness to align them with human preferences.
- Score: 18.701950647429
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract:   Text-conditioned image generation models have recently achieved astonishing
results in image quality and text alignment and are consequently employed in a
fast-growing number of applications. Since they are highly data-driven, relying
on billion-sized datasets randomly scraped from the web, they also reproduce
inappropriate human behavior. Specifically, we demonstrate inappropriate
degeneration on a large-scale for various generative text-to-image models, thus
motivating the need for monitoring and moderating them at deployment. To this
end, we evaluate mitigation strategies at inference to suppress the generation
of inappropriate content. Our findings show that we can use models'
representations of the world's ugliness to align them with human preferences.
 
      
        Related papers
        - Improving face generation quality and prompt following with synthetic   captions [57.47448046728439]
 We introduce a training-free pipeline designed to generate accurate appearance descriptions from images of people.
We then use these synthetic captions to fine-tune a text-to-image diffusion model.
Our results demonstrate that this approach significantly improves the model's ability to generate high-quality, realistic human faces.
 arXiv  Detail & Related papers  (2024-05-17T15:50:53Z)
- A Taxonomy of the Biases of the Images created by Generative Artificial   Intelligence [2.0257616108612373]
 Generative artificial intelligence models show an amazing performance creating unique content automatically just by being given a prompt by the user.
We analyze in detail how the generated content by these models can be strongly biased with respect to a plethora of variables.
We discuss the social, political and economical implications of these biases and possible ways to mitigate them.
 arXiv  Detail & Related papers  (2024-05-02T22:01:28Z)
- ImagenHub: Standardizing the evaluation of conditional image generation
  models [48.51117156168]
 This paper proposes ImagenHub, which is a one-stop library to standardize the inference and evaluation of all conditional image generation models.
We design two human evaluation scores, i.e. Semantic Consistency and Perceptual Quality, along with comprehensive guidelines to evaluate generated images.
Our human evaluation achieves a high inter-worker agreement of Krippendorff's alpha on 76% models with a value higher than 0.4.
 arXiv  Detail & Related papers  (2023-10-02T19:41:42Z)
- ITI-GEN: Inclusive Text-to-Image Generation [56.72212367905351]
 This study investigates inclusive text-to-image generative models that generate images based on human-written prompts.
We show that, for some attributes, images can represent concepts more expressively than text.
We propose a novel approach, ITI-GEN, that leverages readily available reference images for Inclusive Text-to-Image GENeration.
 arXiv  Detail & Related papers  (2023-09-11T15:54:30Z)
- RenAIssance: A Survey into AI Text-to-Image Generation in the Era of
  Large Model [93.8067369210696]
 Text-to-image generation (TTI) refers to the usage of models that could process text input and generate high fidelity images based on text descriptions.
 Diffusion models are one prominent type of generative model used for the generation of images through the systematic introduction of noises with repeating steps.
In the era of large models, scaling up model size and the integration with large language models have further improved the performance of TTI models.
 arXiv  Detail & Related papers  (2023-09-02T03:27:20Z)
- Fair Diffusion: Instructing Text-to-Image Generation Models on Fairness [15.059419033330126]
 We present a novel strategy, called Fair Diffusion, to attenuate biases after the deployment of generative text-to-image models.
Specifically, we demonstrate shifting a bias, based on human instructions, in any direction yielding arbitrarily new proportions for, e.g., identity groups.
This introduced control enables instructing generative image models on fairness, with no data filtering and additional training required.
 arXiv  Detail & Related papers  (2023-02-07T18:25:28Z)
- DreamArtist++: Controllable One-Shot Text-to-Image Generation via   Positive-Negative Adapter [63.622879199281705]
 Some example-based image generation approaches have been proposed, emphi.e. generating new concepts based on absorbing the salient features of a few input references.
We propose a simple yet effective framework, namely DreamArtist, which adopts a novel positive-negative prompt-tuning learning strategy on the pre-trained diffusion model.
We have conducted extensive experiments and evaluated the proposed method from image similarity (fidelity) and diversity, generation controllability, and style cloning.
 arXiv  Detail & Related papers  (2022-11-21T10:37:56Z)
- Will Large-scale Generative Models Corrupt Future Datasets? [5.593352892211305]
 Large-scale text-to-image generative models can generate high-quality and realistic images from users' prompts.
This paper empirically answers this question by simulating contamination.
We conclude that generated images negatively affect downstream performance, while the significance depends on tasks and the amount of generated images.
 arXiv  Detail & Related papers  (2022-11-15T12:25:33Z)
- Safe Latent Diffusion: Mitigating Inappropriate Degeneration in
  Diffusion Models [18.701950647429]
 Text-conditioned image generation models suffer from degenerated and biased human behavior.
We present safe latent diffusion (SLD) to help combat these undesired side effects.
We show that SLD removes and suppresses inappropriate image parts during the diffusion process.
 arXiv  Detail & Related papers  (2022-11-09T18:54:25Z)
- Re-Imagen: Retrieval-Augmented Text-to-Image Generator [58.60472701831404]
 Retrieval-Augmented Text-to-Image Generator (Re-Imagen)
Retrieval-Augmented Text-to-Image Generator (Re-Imagen)
 arXiv  Detail & Related papers  (2022-09-29T00:57:28Z)
- InvGAN: Invertible GANs [88.58338626299837]
 InvGAN, short for Invertible GAN, successfully embeds real images to the latent space of a high quality generative model.
This allows us to perform image inpainting, merging, and online data augmentation.
 arXiv  Detail & Related papers  (2021-12-08T21:39:00Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
       
     
           This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.