Perceptual Artifacts Localization for Image Synthesis Tasks
        - URL: http://arxiv.org/abs/2310.05590v1
- Date: Mon, 9 Oct 2023 10:22:08 GMT
- Title: Perceptual Artifacts Localization for Image Synthesis Tasks
- Authors: Lingzhi Zhang, Zhengjie Xu, Connelly Barnes, Yuqian Zhou, Qing Liu, He
  Zhang, Sohrab Amirghodsi, Zhe Lin, Eli Shechtman, Jianbo Shi
- Abstract summary: We introduce a novel dataset comprising 10,168 generated images, each annotated with per-pixel perceptual artifact labels.
A segmentation model, trained on our proposed dataset, effectively localizes artifacts across a range of tasks.
We propose an innovative zoom-in inpainting pipeline that seamlessly rectifies perceptual artifacts in the generated images.
- Score: 59.638307505334076
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract:   Recent advancements in deep generative models have facilitated the creation
of photo-realistic images across various tasks. However, these generated images
often exhibit perceptual artifacts in specific regions, necessitating manual
correction. In this study, we present a comprehensive empirical examination of
Perceptual Artifacts Localization (PAL) spanning diverse image synthesis
endeavors. We introduce a novel dataset comprising 10,168 generated images,
each annotated with per-pixel perceptual artifact labels across ten synthesis
tasks. A segmentation model, trained on our proposed dataset, effectively
localizes artifacts across a range of tasks. Additionally, we illustrate its
proficiency in adapting to previously unseen models using minimal training
samples. We further propose an innovative zoom-in inpainting pipeline that
seamlessly rectifies perceptual artifacts in the generated images. Through our
experimental analyses, we elucidate several practical downstream applications,
such as automated artifact rectification, non-referential image quality
evaluation, and abnormal region detection in images. The dataset and code are
released.
 
      
        Related papers
        - Semi-Automated Quality Assurance in Digital Pathology: Tile   Classification Approach [0.0]
 Quality assurance is a critical but underexplored area in digital pathology.<n>Artifacts have been shown to negatively impact the performance of AI diagnostic models.
 arXiv  Detail & Related papers  (2025-06-12T17:30:34Z)
- Spot the Fake: Large Multimodal Model-Based Synthetic Image Detection   with Artifact Explanation [15.442558725312976]
 We introduce FakeVLM, a specialized large multimodal model for both general synthetic image and DeepFake detection tasks.
FakeVLM excels in distinguishing real from fake images and provides clear, natural language explanations for image artifacts.
We present FakeClue, a comprehensive dataset containing over 100,000 images across seven categories, annotated with fine-grained artifact clues in natural language.
 arXiv  Detail & Related papers  (2025-03-19T05:14:44Z)
- A Large-scale AI-generated Image Inpainting Benchmark [11.216906046169683]
 We propose a methodology for creating high-quality inpainting datasets and apply it to create DiQuID.
DiQuID comprises over 95,000 inpainted images generated from 78,000 original images sourced from MS-COCO, RAISE, and OpenImages.
We provide comprehensive benchmarking results using state-of-the-art forgery detection methods, demonstrating the dataset's effectiveness in evaluating and improving detection algorithms.
 arXiv  Detail & Related papers  (2025-02-10T15:56:28Z)
- Refine-by-Align: Reference-Guided Artifacts Refinement through Semantic   Alignment [40.112548587906005]
 We present Refine-by-Align, a first-of-its-kind model that employs a diffusion-based framework to address this challenge.
We show that our pipeline greatly pushes the boundary of fine details in the image synthesis models.
 arXiv  Detail & Related papers  (2024-11-30T01:26:04Z)
- Contrasting Deepfakes Diffusion via Contrastive Learning and   Global-Local Similarities [88.398085358514]
 Contrastive Deepfake Embeddings (CoDE) is a novel embedding space specifically designed for deepfake detection.
CoDE is trained via contrastive learning by additionally enforcing global-local similarities.
 arXiv  Detail & Related papers  (2024-07-29T18:00:10Z)
- SynArtifact: Classifying and Alleviating Artifacts in Synthetic Images   via Vision-Language Model [15.616316848126642]
 We develop a comprehensive artifact taxonomy and construct a dataset of synthetic images with artifact annotations for fine-tuning Vision-Language Model (VLM)
The fine-tuned VLM exhibits superior ability of identifying artifacts and outperforms the baseline by 25.66%.
 arXiv  Detail & Related papers  (2024-02-28T05:54:02Z)
- Rethinking the Up-Sampling Operations in CNN-based Generative Network
  for Generalizable Deepfake Detection [86.97062579515833]
 We introduce the concept of Neighboring Pixel Relationships(NPR) as a means to capture and characterize the generalized structural artifacts stemming from up-sampling operations.
A comprehensive analysis is conducted on an open-world dataset, comprising samples generated by tft28 distinct generative models.
This analysis culminates in the establishment of a novel state-of-the-art performance, showcasing a remarkable tft11.6% improvement over existing methods.
 arXiv  Detail & Related papers  (2023-12-16T14:27:06Z)
- Parents and Children: Distinguishing Multimodal DeepFakes from Natural   Images [60.34381768479834]
 Recent advancements in diffusion models have enabled the generation of realistic deepfakes from textual prompts in natural language.
We pioneer a systematic study on deepfake detection generated by state-of-the-art diffusion models.
 arXiv  Detail & Related papers  (2023-04-02T10:25:09Z)
- Ensembling with Deep Generative Views [72.70801582346344]
 generative models can synthesize "views" of artificial images that mimic real-world variations, such as changes in color or pose.
Here, we investigate whether such views can be applied to real images to benefit downstream analysis tasks such as image classification.
We use StyleGAN2 as the source of generative augmentations and investigate this setup on classification tasks involving facial attributes, cat faces, and cars.
 arXiv  Detail & Related papers  (2021-04-29T17:58:35Z)
- Image Completion via Inference in Deep Generative Models [16.99337751292915]
 We consider image completion from the perspective of amortized inference in an image generative model.
We demonstrate superior sample quality and diversity compared to prior art on the CIFAR-10 and FFHQ-256 datasets.
 arXiv  Detail & Related papers  (2021-02-24T02:59:43Z)
- Graph Neural Networks for UnsupervisedDomain Adaptation of
  Histopathological ImageAnalytics [22.04114134677181]
 We present a novel method for the unsupervised domain adaptation for histological image analysis.
It is based on a backbone for embedding images into a feature space, and a graph neural layer for propa-gating the supervision signals of images with labels.
In experiments, our methodachieves state-of-the-art performance on four public datasets.
 arXiv  Detail & Related papers  (2020-08-21T04:53:44Z)
- Intrinsic Autoencoders for Joint Neural Rendering and Intrinsic Image
  Decomposition [67.9464567157846]
 We propose an autoencoder for joint generation of realistic images from synthetic 3D models while simultaneously decomposing real images into their intrinsic shape and appearance properties.
Our experiments confirm that a joint treatment of rendering and decomposition is indeed beneficial and that our approach outperforms state-of-the-art image-to-image translation baselines both qualitatively and quantitatively.
 arXiv  Detail & Related papers  (2020-06-29T12:53:58Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
       
     
           This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.