Attentive Symmetric Autoencoder for Brain MRI Segmentation
- URL: http://arxiv.org/abs/2209.08887v1
- Date: Mon, 19 Sep 2022 09:43:19 GMT
- Title: Attentive Symmetric Autoencoder for Brain MRI Segmentation
- Authors: Junjia Huang, Haofeng Li, Guanbin Li, Xiang Wan
- Abstract summary: We propose a novel Attentive Symmetric Auto-encoder based on Vision Transformer (ViT) for 3D brain MRI segmentation tasks.
In the pre-training stage, the proposed auto-encoder pays more attention to reconstruct the informative patches according to the gradient metrics.
Experimental results show that our proposed attentive symmetric auto-encoder outperforms the state-of-the-art self-supervised learning methods and medical image segmentation models.
- Score: 56.02577247523737
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Self-supervised learning methods based on image patch reconstruction have
witnessed great success in training auto-encoders, whose pre-trained weights
can be transferred to fine-tune other downstream tasks of image understanding.
However, existing methods seldom study the various importance of reconstructed
patches and the symmetry of anatomical structures, when they are applied to 3D
medical images. In this paper we propose a novel Attentive Symmetric
Auto-encoder (ASA) based on Vision Transformer (ViT) for 3D brain MRI
segmentation tasks. We conjecture that forcing the auto-encoder to recover
informative image regions can harvest more discriminative representations, than
to recover smooth image patches. Then we adopt a gradient based metric to
estimate the importance of each image patch. In the pre-training stage, the
proposed auto-encoder pays more attention to reconstruct the informative
patches according to the gradient metrics. Moreover, we resort to the prior of
brain structures and develop a Symmetric Position Encoding (SPE) method to
better exploit the correlations between long-range but spatially symmetric
regions to obtain effective features. Experimental results show that our
proposed attentive symmetric auto-encoder outperforms the state-of-the-art
self-supervised learning methods and medical image segmentation models on three
brain MRI segmentation benchmarks.
Related papers
- Autoregressive Sequence Modeling for 3D Medical Image Representation [48.706230961589924]
We introduce a pioneering method for learning 3D medical image representations through an autoregressive sequence pre-training framework.
Our approach various 3D medical images based on spatial, contrast, and semantic correlations, treating them as interconnected visual tokens within a token sequence.
arXiv Detail & Related papers (2024-09-13T10:19:10Z) - Mask-Guided Attention U-Net for Enhanced Neonatal Brain Extraction and Image Preprocessing [0.9674145073701153]
We introduce MGA-Net, a novel mask-guided attention neural network.
It is designed to extract the brain from other structures and reconstruct high-quality brain images.
We extensively validated the proposed MGA-Net on diverse datasets from varied clinical settings and neonatal age groups.
arXiv Detail & Related papers (2024-06-25T16:48:18Z) - NeuroPictor: Refining fMRI-to-Image Reconstruction via Multi-individual Pretraining and Multi-level Modulation [55.51412454263856]
This paper proposes to directly modulate the generation process of diffusion models using fMRI signals.
By training with about 67,000 fMRI-image pairs from various individuals, our model enjoys superior fMRI-to-image decoding capacity.
arXiv Detail & Related papers (2024-03-27T02:42:52Z) - fMRI-PTE: A Large-scale fMRI Pretrained Transformer Encoder for
Multi-Subject Brain Activity Decoding [54.17776744076334]
We propose fMRI-PTE, an innovative auto-encoder approach for fMRI pre-training.
Our approach involves transforming fMRI signals into unified 2D representations, ensuring consistency in dimensions and preserving brain activity patterns.
Our contributions encompass introducing fMRI-PTE, innovative data transformation, efficient training, a novel learning strategy, and the universal applicability of our approach.
arXiv Detail & Related papers (2023-11-01T07:24:22Z) - Disruptive Autoencoders: Leveraging Low-level features for 3D Medical
Image Pre-training [51.16994853817024]
This work focuses on designing an effective pre-training framework for 3D radiology images.
We introduce Disruptive Autoencoders, a pre-training framework that attempts to reconstruct the original image from disruptions created by a combination of local masking and low-level perturbations.
The proposed pre-training framework is tested across multiple downstream tasks and achieves state-of-the-art performance.
arXiv Detail & Related papers (2023-07-31T17:59:42Z) - 3D Brainformer: 3D Fusion Transformer for Brain Tumor Segmentation [6.127298607534532]
Deep learning has recently emerged to improve brain tumor segmentation.
Transformers have been leveraged to address the limitations of convolutional networks.
We propose a 3D Transformer-based segmentation approach.
arXiv Detail & Related papers (2023-04-28T02:11:29Z) - HybridMIM: A Hybrid Masked Image Modeling Framework for 3D Medical Image
Segmentation [29.15746532186427]
HybridMIM is a novel hybrid self-supervised learning method based on masked image modeling for 3D medical image segmentation.
We learn the semantic information of medical images at three levels, including:1) partial region prediction to reconstruct key contents of the 3D image, which largely reduces the pre-training time burden.
The proposed framework is versatile to support both CNN and transformer as encoder backbones, and also enables to pre-train decoders for image segmentation.
arXiv Detail & Related papers (2023-03-18T04:43:12Z) - Atrous Residual Interconnected Encoder to Attention Decoder Framework
for Vertebrae Segmentation via 3D Volumetric CT Images [1.8146155083014204]
This paper proposes a novel algorithm for automated vertebrae segmentation via 3D volumetric spine CT images.
The proposed model is based on the structure of encoder to decoder, using layer normalization to optimize mini-batch training performance.
The experimental results show that our model achieves competitive performance compared with other state-of-the-art medical semantic segmentation methods.
arXiv Detail & Related papers (2021-04-08T12:09:16Z) - Improved Slice-wise Tumour Detection in Brain MRIs by Computing
Dissimilarities between Latent Representations [68.8204255655161]
Anomaly detection for Magnetic Resonance Images (MRIs) can be solved with unsupervised methods.
We have proposed a slice-wise semi-supervised method for tumour detection based on the computation of a dissimilarity function in the latent space of a Variational AutoEncoder.
We show that by training the models on higher resolution images and by improving the quality of the reconstructions, we obtain results which are comparable with different baselines.
arXiv Detail & Related papers (2020-07-24T14:02:09Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.