TransBTSV2: Wider Instead of Deeper Transformer for Medical Image
Segmentation
- URL: http://arxiv.org/abs/2201.12785v1
- Date: Sun, 30 Jan 2022 11:00:34 GMT
- Title: TransBTSV2: Wider Instead of Deeper Transformer for Medical Image
Segmentation
- Authors: Jiangyun Li, Wenxuan Wang, Chen Chen, Tianxiang Zhang, Sen Zha, Hong
Yu, Jing Wang
- Abstract summary: We exploit Transformer in 3D CNN for 3D medical image segmentation.
We propose a novel network named TransBTSV2 based on the encoder-decoder structure.
As a hybrid CNN-Transformer architecture, TransBTSV2 can achieve accurate segmentation of medical images without any pre-training.
- Score: 12.85662034471981
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Transformer, benefiting from global (long-range) information modeling using
self-attention mechanism, has been successful in natural language processing
and computer vision recently. Convolutional Neural Networks, capable of
capturing local features, are unable to model explicit long-distance
dependencies from global feature space. However, both local and global features
are crucial for dense prediction tasks, especially for 3D medical image
segmentation. In this paper, we exploit Transformer in 3D CNN for 3D medical
image volumetric segmentation and propose a novel network named TransBTSV2
based on the encoder-decoder structure. Different from our original TransBTS,
the proposed TransBTSV2 is not limited to brain tumor segmentation (BTS) but
focuses on general medical image segmentation, providing a strong and efficient
3D baseline for volumetric segmentation of medical images. As a hybrid
CNN-Transformer architecture, TransBTSV2 can achieve accurate segmentation of
medical images without any pre-training. With the proposed insight to redesign
the internal structure of Transformer and the introduced Deformable Bottleneck
Module, a highly efficient architecture is achieved with superior performance.
Extensive experimental results on four medical image datasets (BraTS 2019,
BraTS 2020, LiTS 2017 and KiTS 2019) demonstrate that TransBTSV2 achieves
comparable or better results as compared to the state-of-the-art methods for
the segmentation of brain tumor, liver tumor as well as kidney tumor. Code is
available at https://github.com/Wenxuan-1119/TransBTS.
Related papers
- 3D TransUNet: Advancing Medical Image Segmentation through Vision
Transformers [40.21263511313524]
Medical image segmentation plays a crucial role in advancing healthcare systems for disease diagnosis and treatment planning.
The u-shaped architecture, popularly known as U-Net, has proven highly successful for various medical image segmentation tasks.
To address these limitations, researchers have turned to Transformers, renowned for their global self-attention mechanisms.
arXiv Detail & Related papers (2023-10-11T18:07:19Z) - 3DSAM-adapter: Holistic adaptation of SAM from 2D to 3D for promptable tumor segmentation [52.699139151447945]
We propose a novel adaptation method for transferring the segment anything model (SAM) from 2D to 3D for promptable medical image segmentation.
Our model can outperform domain state-of-the-art medical image segmentation models on 3 out of 4 tasks, specifically by 8.25%, 29.87%, and 10.11% for kidney tumor, pancreas tumor, colon cancer segmentation, and achieve similar performance for liver tumor segmentation.
arXiv Detail & Related papers (2023-06-23T12:09:52Z) - MISSU: 3D Medical Image Segmentation via Self-distilling TransUNet [55.16833099336073]
We propose to self-distill a Transformer-based UNet for medical image segmentation.
It simultaneously learns global semantic information and local spatial-detailed features.
Our MISSU achieves the best performance over previous state-of-the-art methods.
arXiv Detail & Related papers (2022-06-02T07:38:53Z) - PHTrans: Parallelly Aggregating Global and Local Representations for
Medical Image Segmentation [7.140322699310487]
We propose a novel hybrid architecture for medical image segmentation called PHTrans.
PHTrans parallelly hybridizes Transformer and CNN in main building blocks to produce hierarchical representations from global and local features.
arXiv Detail & Related papers (2022-03-09T08:06:56Z) - Swin-Unet: Unet-like Pure Transformer for Medical Image Segmentation [63.46694853953092]
Swin-Unet is an Unet-like pure Transformer for medical image segmentation.
tokenized image patches are fed into the Transformer-based U-shaped decoder-Decoder architecture.
arXiv Detail & Related papers (2021-05-12T09:30:26Z) - TransBTS: Multimodal Brain Tumor Segmentation Using Transformer [9.296315610803985]
We propose a novel network named TransBTS based on the encoder-decoder structure.
To capture the local 3D context information, the encoder first utilizes 3D CNN to extract the volumetric feature maps.
Meanwhile, the feature maps are reformed elaborately for tokens that are fed into Transformer for global feature modeling.
arXiv Detail & Related papers (2021-03-07T19:12:14Z) - CoTr: Efficiently Bridging CNN and Transformer for 3D Medical Image
Segmentation [95.51455777713092]
Convolutional neural networks (CNNs) have been the de facto standard for nowadays 3D medical image segmentation.
We propose a novel framework that efficiently bridges a bf Convolutional neural network and a bf Transformer bf (CoTr) for accurate 3D medical image segmentation.
arXiv Detail & Related papers (2021-03-04T13:34:22Z) - Medical Transformer: Gated Axial-Attention for Medical Image
Segmentation [73.98974074534497]
We study the feasibility of using Transformer-based network architectures for medical image segmentation tasks.
We propose a Gated Axial-Attention model which extends the existing architectures by introducing an additional control mechanism in the self-attention module.
To train the model effectively on medical images, we propose a Local-Global training strategy (LoGo) which further improves the performance.
arXiv Detail & Related papers (2021-02-21T18:35:14Z) - TransUNet: Transformers Make Strong Encoders for Medical Image
Segmentation [78.01570371790669]
Medical image segmentation is an essential prerequisite for developing healthcare systems.
On various medical image segmentation tasks, the u-shaped architecture, also known as U-Net, has become the de-facto standard.
We propose TransUNet, which merits both Transformers and U-Net, as a strong alternative for medical image segmentation.
arXiv Detail & Related papers (2021-02-08T16:10:50Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.