Transforming and Projecting Images into Class-conditional Generative
Networks
- URL: http://arxiv.org/abs/2005.01703v2
- Date: Thu, 27 Aug 2020 18:10:52 GMT
- Title: Transforming and Projecting Images into Class-conditional Generative
Networks
- Authors: Minyoung Huh, Richard Zhang, Jun-Yan Zhu, Sylvain Paris, Aaron
Hertzmann
- Abstract summary: We present a method for projecting an input image into the space of a class-conditional generative neural network.
Specifically, we demonstrate that one can solve for image translation, scale, and global color transformation.
We show the effectiveness of our method on real images and further demonstrate how the corresponding projections lead to better editability of these images.
- Score: 44.79971598515697
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: We present a method for projecting an input image into the space of a
class-conditional generative neural network. We propose a method that optimizes
for transformation to counteract the model biases in generative neural
networks. Specifically, we demonstrate that one can solve for image
translation, scale, and global color transformation, during the projection
optimization to address the object-center bias and color bias of a Generative
Adversarial Network. This projection process poses a difficult optimization
problem, and purely gradient-based optimizations fail to find good solutions.
We describe a hybrid optimization strategy that finds good projections by
estimating transformations and class parameters. We show the effectiveness of
our method on real images and further demonstrate how the corresponding
projections lead to better editability of these images.
Related papers
- Hyper-VolTran: Fast and Generalizable One-Shot Image to 3D Object
Structure via HyperNetworks [53.67497327319569]
We introduce a novel neural rendering technique to solve image-to-3D from a single view.
Our approach employs the signed distance function as the surface representation and incorporates generalizable priors through geometry-encoding volumes and HyperNetworks.
Our experiments show the advantages of our proposed approach with consistent results and rapid generation.
arXiv Detail & Related papers (2023-12-24T08:42:37Z) - Optimize and Reduce: A Top-Down Approach for Image Vectorization [12.998637003026273]
We propose Optimize & Reduce (O&R), a top-down approach to vectorization that is both fast and domain-agnostic.
O&R aims to attain a compact representation of input images by iteratively optimizing B'ezier curve parameters.
We demonstrate that our method is domain agnostic and outperforms existing works in both reconstruction and perceptual quality for a fixed number of shapes.
arXiv Detail & Related papers (2023-12-18T16:41:03Z) - Adaptive Image Registration: A Hybrid Approach Integrating Deep Learning
and Optimization Functions for Enhanced Precision [13.242184146186974]
We propose a single framework for image registration based on deep neural networks and optimization.
We show improvements of up to 1.6% in test data, while maintaining the same inference time, and a substantial 1.0% points performance gain in deformation field smoothness.
arXiv Detail & Related papers (2023-11-27T02:48:06Z) - Improving Diffusion-based Image Translation using Asymmetric Gradient
Guidance [51.188396199083336]
We present an approach that guides the reverse process of diffusion sampling by applying asymmetric gradient guidance.
Our model's adaptability allows it to be implemented with both image-fusion and latent-dif models.
Experiments show that our method outperforms various state-of-the-art models in image translation tasks.
arXiv Detail & Related papers (2023-06-07T12:56:56Z) - Convolutional Analysis Operator Learning by End-To-End Training of
Iterative Neural Networks [3.6280929178575994]
We show how convolutional sparsifying filters can be efficiently learned by end-to-end training of iterative neural networks.
We evaluated our approach on a non-Cartesian 2D cardiac cine MRI example and show that the obtained filters are better suitable for the corresponding reconstruction algorithm than the ones obtained by decoupled pre-training.
arXiv Detail & Related papers (2022-03-04T07:32:16Z) - A training-free recursive multiresolution framework for diffeomorphic
deformable image registration [6.929709872589039]
We propose a novel diffeomorphic training-free approach for deformable image registration.
The proposed architecture is simple in design. The moving image is warped successively at each resolution and finally aligned to the fixed image.
The entire system is end-to-end and optimized for each pair of images from scratch.
arXiv Detail & Related papers (2022-02-01T15:17:17Z) - Adaptive Image Transformations for Transfer-based Adversarial Attack [73.74904401540743]
We propose a novel architecture, called Adaptive Image Transformation Learner (AITL)
Our elaborately designed learner adaptively selects the most effective combination of image transformations specific to the input image.
Our method significantly improves the attack success rates on both normally trained models and defense models under various settings.
arXiv Detail & Related papers (2021-11-27T08:15:44Z) - Style Intervention: How to Achieve Spatial Disentanglement with
Style-based Generators? [100.60938767993088]
We propose a lightweight optimization-based algorithm which could adapt to arbitrary input images and render natural translation effects under flexible objectives.
We verify the performance of the proposed framework in facial attribute editing on high-resolution images, where both photo-realism and consistency are required.
arXiv Detail & Related papers (2020-11-19T07:37:31Z) - A Flexible Framework for Designing Trainable Priors with Adaptive
Smoothing and Game Encoding [57.1077544780653]
We introduce a general framework for designing and training neural network layers whose forward passes can be interpreted as solving non-smooth convex optimization problems.
We focus on convex games, solved by local agents represented by the nodes of a graph and interacting through regularization functions.
This approach is appealing for solving imaging problems, as it allows the use of classical image priors within deep models that are trainable end to end.
arXiv Detail & Related papers (2020-06-26T08:34:54Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.