Bayesian Unsupervised Disentanglement of Anatomy and Geometry for Deep Groupwise Image Registration
- URL: http://arxiv.org/abs/2401.02141v2
- Date: Fri, 04 Oct 2024 13:29:14 GMT
- Title: Bayesian Unsupervised Disentanglement of Anatomy and Geometry for Deep Groupwise Image Registration
- Authors: Xinzhe Luo, Xin Wang, Linda Shapiro, Chun Yuan, Jianfeng Feng, Xiahai Zhuang,
- Abstract summary: This article presents a general Bayesian learning framework for multi-modal groupwise image registration.
We propose a novel hierarchical variational auto-encoding architecture to realise the inference procedure of the latent variables.
Experiments were conducted to validate the proposed framework, including four different datasets from cardiac, brain, and abdominal medical images.
- Score: 50.62725807357586
- License:
- Abstract: This article presents a general Bayesian learning framework for multi-modal groupwise image registration. The method builds on probabilistic modelling of the image generative process, where the underlying common anatomy and geometric variations of the observed images are explicitly disentangled as latent variables. Therefore, groupwise image registration is achieved via hierarchical Bayesian inference. We propose a novel hierarchical variational auto-encoding architecture to realise the inference procedure of the latent variables, where the registration parameters can be explicitly estimated in a mathematically interpretable fashion. Remarkably, this new paradigm learns groupwise image registration in an unsupervised closed-loop self-reconstruction process, sparing the burden of designing complex image-based similarity measures. The computationally efficient disentangled network architecture is also inherently scalable and flexible, allowing for groupwise registration on large-scale image groups with variable sizes. Furthermore, the inferred structural representations from multi-modal images via disentanglement learning are capable of capturing the latent anatomy of the observations with visual semantics. Extensive experiments were conducted to validate the proposed framework, including four different datasets from cardiac, brain, and abdominal medical images. The results have demonstrated the superiority of our method over conventional similarity-based approaches in terms of accuracy, efficiency, scalability, and interpretability.
Related papers
- Joint segmentation and discontinuity-preserving deformable registration:
Application to cardiac cine-MR images [74.99415008543276]
Most deep learning-based registration methods assume that the deformation fields are smooth and continuous everywhere in the image domain.
We propose a novel discontinuity-preserving image registration method to tackle this challenge, which ensures globally discontinuous and locally smooth deformation fields.
A co-attention block is proposed in the segmentation component of the network to learn the structural correlations in the input images.
We evaluate our method on the task of intra-subject-temporal image registration using large-scale cinematic cardiac magnetic resonance image sequences.
arXiv Detail & Related papers (2022-11-24T23:45:01Z) - $\mathcal{X}$-Metric: An N-Dimensional Information-Theoretic Framework
for Groupwise Registration and Deep Combined Computing [14.36896617430302]
This paper presents a generic probabilistic framework for estimating the statistical dependency and finding the anatomical correspondences among medical images.
The method builds on a novel formulation of the $N$-dimensional joint intensity distribution by representing the common anatomy as latent variables.
Experiments were conducted to demonstrate the versatility and applicability of our model, including multimodal groupwise registration, motion correction for dynamic contrast enhanced magnetic resonance images, and deep combined computing for multimodal medical images.
arXiv Detail & Related papers (2022-11-03T07:39:10Z) - Unsupervised Multimodal Change Detection Based on Structural
Relationship Graph Representation Learning [40.631724905575034]
Unsupervised multimodal change detection is a practical and challenging topic that can play an important role in time-sensitive emergency applications.
We take advantage of two types of modality-independent structural relationships in multimodal images.
We present a structural relationship graph representation learning framework for measuring the similarity of the two structural relationships.
arXiv Detail & Related papers (2022-10-03T13:55:08Z) - Bayesian intrinsic groupwise registration via explicit hierarchical
disentanglement [18.374535632681884]
We propose a general framework which formulates groupwise registration as a procedure of hierarchical Bayesian inference.
Here, we propose a novel variational posterior and network architecture that facilitate joint learning of the common structural representation.
Results have demonstrated the efficacy of our framework in realizing multimodal groupwise registration in an end-to-end fashion.
arXiv Detail & Related papers (2022-06-06T06:13:24Z) - Deep Co-Attention Network for Multi-View Subspace Learning [73.3450258002607]
We propose a deep co-attention network for multi-view subspace learning.
It aims to extract both the common information and the complementary information in an adversarial setting.
In particular, it uses a novel cross reconstruction loss and leverages the label information to guide the construction of the latent representation.
arXiv Detail & Related papers (2021-02-15T18:46:44Z) - SIR: Self-supervised Image Rectification via Seeing the Same Scene from
Multiple Different Lenses [82.56853587380168]
We propose a novel self-supervised image rectification (SIR) method based on an important insight that the rectified results of distorted images of the same scene from different lens should be the same.
We leverage a differentiable warping module to generate the rectified images and re-distorted images from the distortion parameters.
Our method achieves comparable or even better performance than the supervised baseline method and representative state-of-the-art methods.
arXiv Detail & Related papers (2020-11-30T08:23:25Z) - Orientation-Disentangled Unsupervised Representation Learning for
Computational Pathology [6.468635277309852]
We propose to extend the Variational Auto-Encoder framework by leveraging the group structure of rotation-equivariant convolutional networks.
We show that the trained models efficiently disentangle the inherent orientation information of single-cell images.
arXiv Detail & Related papers (2020-08-26T16:57:45Z) - Learning Deformable Image Registration from Optimization: Perspective,
Modules, Bilevel Training and Beyond [62.730497582218284]
We develop a new deep learning based framework to optimize a diffeomorphic model via multi-scale propagation.
We conduct two groups of image registration experiments on 3D volume datasets including image-to-atlas registration on brain MRI data and image-to-image registration on liver CT data.
arXiv Detail & Related papers (2020-04-30T03:23:45Z) - Pathological Retinal Region Segmentation From OCT Images Using Geometric
Relation Based Augmentation [84.7571086566595]
We propose improvements over previous GAN-based medical image synthesis methods by jointly encoding the intrinsic relationship of geometry and shape.
The proposed method outperforms state-of-the-art segmentation methods on the public RETOUCH dataset having images captured from different acquisition procedures.
arXiv Detail & Related papers (2020-03-31T11:50:43Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.