Joint Estimation of Image Representations and their Lie Invariants
- URL: http://arxiv.org/abs/2012.02903v2
- Date: Tue, 8 Dec 2020 13:28:42 GMT
- Title: Joint Estimation of Image Representations and their Lie Invariants
- Authors: Christine Allen-Blanchette and Kostas Daniilidis
- Abstract summary: Images encode both the state of the world and its content.
The automatic extraction of this information is challenging because of the high-dimensionality and entangled encoding inherent to the image representation.
This article introduces two theoretical approaches aimed at the resolution of these challenges.
- Score: 57.3768308075675
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Images encode both the state of the world and its content. The former is
useful for tasks such as planning and control, and the latter for
classification. The automatic extraction of this information is challenging
because of the high-dimensionality and entangled encoding inherent to the image
representation. This article introduces two theoretical approaches aimed at the
resolution of these challenges. The approaches allow for the interpolation and
extrapolation of images from an image sequence by joint estimation of the image
representation and the generators of the sequence dynamics. In the first
approach, the image representations are learned using probabilistic PCA
\cite{tipping1999probabilistic}. The linear-Gaussian conditional distributions
allow for a closed form analytical description of the latent distributions but
assumes the underlying image manifold is a linear subspace. In the second
approach, the image representations are learned using probabilistic nonlinear
PCA which relieves the linear manifold assumption at the cost of requiring a
variational approximation of the latent distributions. In both approaches, the
underlying dynamics of the image sequence are modelled explicitly to
disentangle them from the image representations. The dynamics themselves are
modelled with Lie group structure which enforces the desirable properties of
smoothness and composability of inter-image transformations.
Related papers
- A nonlinear elasticity model in computer vision [0.0]
The purpose of this paper is to analyze a nonlinear elasticity model previously introduced by the authors for comparing two images.
The existence of transformations is proved among derivatives of $-valued pairs of gradient vector-valued intensity maps.
The question is as to whether for images related by a linear mapping the uniquer is given by that.
arXiv Detail & Related papers (2024-08-30T12:27:22Z) - Bayesian Unsupervised Disentanglement of Anatomy and Geometry for Deep Groupwise Image Registration [50.62725807357586]
This article presents a general Bayesian learning framework for multi-modal groupwise image registration.
We propose a novel hierarchical variational auto-encoding architecture to realise the inference procedure of the latent variables.
Experiments were conducted to validate the proposed framework, including four different datasets from cardiac, brain, and abdominal medical images.
arXiv Detail & Related papers (2024-01-04T08:46:39Z) - Image Inpainting via Tractable Steering of Diffusion Models [54.13818673257381]
This paper proposes to exploit the ability of Tractable Probabilistic Models (TPMs) to exactly and efficiently compute the constrained posterior.
Specifically, this paper adopts a class of expressive TPMs termed Probabilistic Circuits (PCs)
We show that our approach can consistently improve the overall quality and semantic coherence of inpainted images with only 10% additional computational overhead.
arXiv Detail & Related papers (2023-11-28T21:14:02Z) - Real-World Image Variation by Aligning Diffusion Inversion Chain [53.772004619296794]
A domain gap exists between generated images and real-world images, which poses a challenge in generating high-quality variations of real-world images.
We propose a novel inference pipeline called Real-world Image Variation by ALignment (RIVAL)
Our pipeline enhances the generation quality of image variations by aligning the image generation process to the source image's inversion chain.
arXiv Detail & Related papers (2023-05-30T04:09:47Z) - Object-Centric Relational Representations for Image Generation [18.069747511100132]
This paper explores a novel method to condition image generation, based on object-centric relational representations.
We show that such architectural biases entail properties that facilitate the manipulation and conditioning of the generative process.
We also propose a novel benchmark for image generation consisting of a synthetic dataset of images paired with their relational representation.
arXiv Detail & Related papers (2023-03-26T11:17:17Z) - Tensor Component Analysis for Interpreting the Latent Space of GANs [41.020230946351816]
This paper addresses the problem of finding interpretable directions in the latent space of pre-trained Generative Adversarial Networks (GANs)
Our scheme allows for both linear edits corresponding to the individual modes of the tensor, and non-linear ones that model the multiplicative interactions between them.
We show experimentally that we can utilise the former to better separate style- from geometry-based transformations, and the latter to generate an extended set of possible transformations.
arXiv Detail & Related papers (2021-11-23T09:14:39Z) - NeurInt : Learning to Interpolate through Neural ODEs [18.104328632453676]
We propose a novel generative model that learns a distribution of trajectories between two images.
We demonstrate our approach's effectiveness in generating images improved quality as well as its ability to learn a diverse distribution over smooth trajectories for any pair of real source and target images.
arXiv Detail & Related papers (2021-11-07T16:31:18Z) - Invariant Deep Compressible Covariance Pooling for Aerial Scene
Categorization [80.55951673479237]
We propose a novel invariant deep compressible covariance pooling (IDCCP) to solve nuisance variations in aerial scene categorization.
We conduct extensive experiments on the publicly released aerial scene image data sets and demonstrate the superiority of this method compared with state-of-the-art methods.
arXiv Detail & Related papers (2020-11-11T11:13:07Z) - Deep Variational Network Toward Blind Image Restoration [60.45350399661175]
Blind image restoration is a common yet challenging problem in computer vision.
We propose a novel blind image restoration method, aiming to integrate both the advantages of them.
Experiments on two typical blind IR tasks, namely image denoising and super-resolution, demonstrate that the proposed method achieves superior performance over current state-of-the-arts.
arXiv Detail & Related papers (2020-08-25T03:30:53Z) - A Generalized Asymmetric Dual-front Model for Active Contours and Image
Segmentation [16.651587651843055]
Voronoi diagram-based dual-front active contour models are known as a powerful and efficient way for addressing the image segmentation and domain partitioning problems.
In this paper, we introduce a type of asymmetric quadratic metrics dual-front model.
The proposed dual-front model can be applied for image segmentation in conjunction with various region-based homogeneity terms.
arXiv Detail & Related papers (2020-06-14T08:24:01Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.