Fast computation of mutual information in the frequency domain with
applications to global multimodal image alignment
- URL: http://arxiv.org/abs/2106.14699v1
- Date: Mon, 28 Jun 2021 13:27:05 GMT
- Title: Fast computation of mutual information in the frequency domain with
applications to global multimodal image alignment
- Authors: Johan \"Ofverstedt, Joakim Lindblad, Nata\v{s}a Sladoje
- Abstract summary: Information-theoretic concept of mutual information (MI) is widely used as a similarity measure to guide multimodal alignment processes.
We propose an efficient algorithm for computed MI for all discrete spatial displacements.
We evaluate the efficacy of the proposed method on three benchmark datasets.
- Score: 3.584984184069584
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Multimodal image alignment is the process of finding spatial correspondences
between images formed by different imaging techniques or under different
conditions, to facilitate heterogeneous data fusion and correlative analysis.
The information-theoretic concept of mutual information (MI) is widely used as
a similarity measure to guide multimodal alignment processes, where most works
have focused on local maximization of MI that typically works well only for
small displacements; this points to a need for global maximization of MI, which
has previously been computationally infeasible due to the high run-time
complexity of existing algorithms. We propose an efficient algorithm for
computing MI for all discrete displacements (formalized as the cross-mutual
information function (CMIF)), which is based on cross-correlation computed in
the frequency domain. We show that the algorithm is equivalent to a direct
method while asymptotically superior in terms of run-time. Furthermore, we
propose a method for multimodal image alignment for transformation models with
few degrees of freedom (e.g. rigid) based on the proposed CMIF-algorithm. We
evaluate the efficacy of the proposed method on three distinct benchmark
datasets, of aerial images, cytological images, and histological images, and we
observe excellent success-rates (in recovering known rigid transformations),
overall outperforming alternative methods, including local optimization of MI
as well as several recent deep learning-based approaches. We also evaluate the
run-times of a GPU implementation of the proposed algorithm and observe
speed-ups from 100 to more than 10,000 times for realistic image sizes compared
to a GPU implementation of a direct method. Code is shared as open-source at
\url{github.com/MIDA-group/globalign}.
Related papers
- Fast and Scalable Semi-Supervised Learning for Multi-View Subspace Clustering [13.638434337947302]
FSSMSC is a novel solution to the high computational complexity commonly found in existing approaches.
The method generates a consensus anchor graph across all views, representing each data point as a sparse linear combination of chosen landmarks.
The effectiveness and efficiency of FSSMSC are validated through extensive experiments on multiple benchmark datasets of varying scales.
arXiv Detail & Related papers (2024-08-11T06:54:00Z) - Multiway Point Cloud Mosaicking with Diffusion and Global Optimization [74.3802812773891]
We introduce a novel framework for multiway point cloud mosaicking (named Wednesday)
At the core of our approach is ODIN, a learned pairwise registration algorithm that identifies overlaps and refines attention scores.
Tested on four diverse, large-scale datasets, our method state-of-the-art pairwise and rotation registration results by a large margin on all benchmarks.
arXiv Detail & Related papers (2024-03-30T17:29:13Z) - QN-Mixer: A Quasi-Newton MLP-Mixer Model for Sparse-View CT Reconstruction [0.0]
We introduce QN-Mixer, an algorithm based on the quasi-Newton approach.
Incept-Mixer is an efficient neural architecture that serves as a non-local regularization term.
Our approach intelligently downsamples information, significantly reducing computational requirements.
arXiv Detail & Related papers (2024-02-28T00:20:25Z) - MB-RACS: Measurement-Bounds-based Rate-Adaptive Image Compressed Sensing Network [65.1004435124796]
We propose a Measurement-Bounds-based Rate-Adaptive Image Compressed Sensing Network (MB-RACS) framework.
Our experiments demonstrate that the proposed MB-RACS method surpasses current leading methods.
arXiv Detail & Related papers (2024-01-19T04:40:20Z) - Late Fusion Multi-view Clustering via Global and Local Alignment
Maximization [61.89218392703043]
Multi-view clustering (MVC) optimally integrates complementary information from different views to improve clustering performance.
Most of existing approaches directly fuse multiple pre-specified similarities to learn an optimal similarity matrix for clustering.
We propose late fusion MVC via alignment to address these issues.
arXiv Detail & Related papers (2022-08-02T01:49:31Z) - Dual-Flow Transformation Network for Deformable Image Registration with
Region Consistency Constraint [95.30864269428808]
Current deep learning (DL)-based image registration approaches learn the spatial transformation from one image to another by leveraging a convolutional neural network.
We present a novel dual-flow transformation network with region consistency constraint which maximizes the similarity of ROIs within a pair of images.
Experiments on four public 3D MRI datasets show that the proposed method achieves the best registration performance in accuracy and generalization.
arXiv Detail & Related papers (2021-12-04T05:30:44Z) - Image-specific Convolutional Kernel Modulation for Single Image
Super-resolution [85.09413241502209]
In this issue, we propose a novel image-specific convolutional modulation kernel (IKM)
We exploit the global contextual information of image or feature to generate an attention weight for adaptively modulating the convolutional kernels.
Experiments on single image super-resolution show that the proposed methods achieve superior performances over state-of-the-art methods.
arXiv Detail & Related papers (2021-11-16T11:05:10Z) - Cross-Sim-NGF: FFT-Based Global Rigid Multimodal Alignment of Image
Volumes using Normalized Gradient Fields [3.584984184069584]
Multimodal image alignment involves finding spatial correspondences between volumes varying in appearance and structure.
We propose a global optimization method for rigid multimodal 3D image alignment, based on a novel efficient algorithm for computing similarity of normalized fields (NGF) in the frequency domain.
arXiv Detail & Related papers (2021-10-19T15:28:37Z) - An Optimal Control Framework for Joint-channel Parallel MRI
Reconstruction without Coil Sensitivities [5.536263246814308]
We develop a novel calibration-free fast parallel MRI (pMRI) reconstruction method incorporate with discrete-time optimal control framework.
We propose to recover both magnitude and phase information by taking advantage of structured multiplayer convolutional networks in image and Fourier spaces.
arXiv Detail & Related papers (2021-09-20T06:42:42Z) - LocalTrans: A Multiscale Local Transformer Network for Cross-Resolution
Homography Estimation [52.63874513999119]
Cross-resolution image alignment is a key problem in multiscale giga photography.
Existing deep homography methods neglecting the explicit formulation of correspondences between them, which leads to degraded accuracy in cross-resolution challenges.
We propose a local transformer network embedded within a multiscale structure to explicitly learn correspondences between the multimodal inputs.
arXiv Detail & Related papers (2021-06-08T02:51:45Z) - Distributed Optimization, Averaging via ADMM, and Network Topology [0.0]
We study the connection between network topology and convergence rates for different algorithms on a real world problem of sensor localization.
We also show interesting connections between ADMM and lifted Markov chains besides providing an explicitly characterization of its convergence.
arXiv Detail & Related papers (2020-09-05T21:44:39Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.