Review Neural Networks about Image Transformation Based on IGC Learning
Framework with Annotated Information
- URL: http://arxiv.org/abs/2206.10155v1
- Date: Tue, 21 Jun 2022 07:27:47 GMT
- Title: Review Neural Networks about Image Transformation Based on IGC Learning
Framework with Annotated Information
- Authors: Yuanjie Yan, Suorong Yang, Yan Wang, Jian Zhao, Furao Shen
- Abstract summary: In Computer Vision (CV), many problems can be regarded as the image transformation task, e.g., semantic segmentation and style transfer.
Some surveys only review the research on style transfer or image-to-image translation, all of which are just a branch of image transformation.
This paper proposes a novel learning framework including Independent learning, Guided learning, and Cooperative learning.
- Score: 13.317099281011515
- License: http://creativecommons.org/publicdomain/zero/1.0/
- Abstract: Image transformation, a class of vision and graphics problems whose goal is
to learn the mapping between an input image and an output image, develops
rapidly in the context of deep neural networks. In Computer Vision (CV), many
problems can be regarded as the image transformation task, e.g., semantic
segmentation and style transfer. These works have different topics and
motivations, making the image transformation task flourishing. Some surveys
only review the research on style transfer or image-to-image translation, all
of which are just a branch of image transformation. However, none of the
surveys summarize those works together in a unified framework to our best
knowledge. This paper proposes a novel learning framework including Independent
learning, Guided learning, and Cooperative learning, called the IGC learning
framework. The image transformation we discuss mainly involves the general
image-to-image translation and style transfer about deep neural networks. From
the perspective of this framework, we review those subtasks and give a unified
interpretation of various scenarios. We categorize related subtasks about the
image transformation according to similar development trends. Furthermore,
experiments have been performed to verify the effectiveness of IGC learning.
Finally, new research directions and open problems are discussed for future
research.
Related papers
- Deep Image Deblurring: A Survey [165.32391279761006]
Deblurring is a classic problem in low-level computer vision, which aims to recover a sharp image from a blurred input image.
Recent advances in deep learning have led to significant progress in solving this problem.
arXiv Detail & Related papers (2022-01-26T01:31:30Z) - Integrating Visuospatial, Linguistic and Commonsense Structure into
Story Visualization [81.26077816854449]
We first explore the use of constituency parse trees for encoding structured input.
Second, we augment the structured input with commonsense information and study the impact of this external knowledge on the generation of visual story.
Third, we incorporate visual structure via bounding boxes and dense captioning to provide feedback about the characters/objects in generated images.
arXiv Detail & Related papers (2021-10-21T00:16:02Z) - Factors of Influence for Transfer Learning across Diverse Appearance
Domains and Task Types [50.1843146606122]
A simple form of transfer learning is common in current state-of-the-art computer vision models.
Previous systematic studies of transfer learning have been limited and the circumstances in which it is expected to work are not fully understood.
In this paper we carry out an extensive experimental exploration of transfer learning across vastly different image domains.
arXiv Detail & Related papers (2021-03-24T16:24:20Z) - Unsupervised Image Transformation Learning via Generative Adversarial
Networks [40.84518581293321]
We study the image transformation problem by learning the underlying transformations from a collection of images using Generative Adversarial Networks (GANs)
We propose an unsupervised learning framework, termed as TrGAN, to project images onto a transformation space that is shared by the generator and the discriminator.
arXiv Detail & Related papers (2021-03-13T17:08:19Z) - Comparative evaluation of CNN architectures for Image Caption Generation [1.2183405753834562]
We have evaluated 17 different Convolutional Neural Networks on two popular Image Caption Generation frameworks.
We observe that model complexity of Convolutional Neural Network, as measured by number of parameters, and the accuracy of the model on Object Recognition task does not necessarily co-relate with its efficacy on feature extraction for Image Caption Generation task.
arXiv Detail & Related papers (2021-02-23T05:43:54Z) - Cross-View Image Synthesis with Deformable Convolution and Attention
Mechanism [29.528402825356398]
We propose to use Generative Adversarial Networks(GANs) based on a deformable convolution and attention mechanism to solve the problem of cross-view image synthesis.
It is difficult to understand and transform scenes appearance and semantic information from another view, thus we use deformed convolution in the U-net network to improve the network's ability to extract features of objects at different scales.
arXiv Detail & Related papers (2020-07-20T03:08:36Z) - Structural-analogy from a Single Image Pair [118.61885732829117]
In this paper, we explore the capabilities of neural networks to understand image structure given only a single pair of images, A and B.
We generate an image that keeps the appearance and style of B, but has a structural arrangement that corresponds to A.
Our method can be used to generate high quality imagery in other conditional generation tasks utilizing images A and B only.
arXiv Detail & Related papers (2020-04-05T14:51:10Z) - Learning Transformation-Aware Embeddings for Image Forensics [15.484408315588569]
Image Provenance Analysis aims at discovering relationships among different manipulated image versions that share content.
One of the main sub-problems for provenance analysis that has not yet been addressed directly is the edit ordering of images that share full content or are near-duplicates.
This paper introduces a novel deep learning-based approach to provide a plausible ordering to images that have been generated from a single image through transformations.
arXiv Detail & Related papers (2020-01-13T22:01:24Z) - Fine-grained Image-to-Image Transformation towards Visual Recognition [102.51124181873101]
We aim at transforming an image with a fine-grained category to synthesize new images that preserve the identity of the input image.
We adopt a model based on generative adversarial networks to disentangle the identity related and unrelated factors of an image.
Experiments on the CompCars and Multi-PIE datasets demonstrate that our model preserves the identity of the generated images much better than the state-of-the-art image-to-image transformation models.
arXiv Detail & Related papers (2020-01-12T05:26:47Z) - Supervised and Unsupervised Learning of Parameterized Color Enhancement [112.88623543850224]
We tackle the problem of color enhancement as an image translation task using both supervised and unsupervised learning.
We achieve state-of-the-art results compared to both supervised (paired data) and unsupervised (unpaired data) image enhancement methods on the MIT-Adobe FiveK benchmark.
We show the generalization capability of our method, by applying it on photos from the early 20th century and to dark video frames.
arXiv Detail & Related papers (2019-12-30T13:57:06Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.