Advances in Medical Image Analysis with Vision Transformers: A
Comprehensive Review
- URL: http://arxiv.org/abs/2301.03505v3
- Date: Sun, 5 Nov 2023 07:45:28 GMT
- Title: Advances in Medical Image Analysis with Vision Transformers: A
Comprehensive Review
- Authors: Reza Azad, Amirhossein Kazerouni, Moein Heidari, Ehsan Khodapanah
Aghdam, Amirali Molaei, Yiwei Jia, Abin Jose, Rijo Roy, Dorit Merhof
- Abstract summary: We provide an encyclopedic review of the applications of Transformers in medical imaging.
Specifically, we present a systematic and thorough review of relevant recent Transformer literature for different medical image analysis tasks.
- Score: 6.953789750981636
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: The remarkable performance of the Transformer architecture in natural
language processing has recently also triggered broad interest in Computer
Vision. Among other merits, Transformers are witnessed as capable of learning
long-range dependencies and spatial correlations, which is a clear advantage
over convolutional neural networks (CNNs), which have been the de facto
standard in Computer Vision problems so far. Thus, Transformers have become an
integral part of modern medical image analysis. In this review, we provide an
encyclopedic review of the applications of Transformers in medical imaging.
Specifically, we present a systematic and thorough review of relevant recent
Transformer literature for different medical image analysis tasks, including
classification, segmentation, detection, registration, synthesis, and clinical
report generation. For each of these applications, we investigate the novelty,
strengths and weaknesses of the different proposed strategies and develop
taxonomies highlighting key properties and contributions. Further, if
applicable, we outline current benchmarks on different datasets. Finally, we
summarize key challenges and discuss different future research directions. In
addition, we have provided cited papers with their corresponding
implementations in https://github.com/mindflow-institue/Awesome-Transformer.
Related papers
- Transformers-based architectures for stroke segmentation: A review [0.6554326244334866]
Stroke remains a significant global health concern, necessitating precise and efficient diagnostic tools for timely intervention and improved patient outcomes.
Transformers, initially designed for natural language processing, have exhibited remarkable capabilities in various computer vision applications, including medical image analysis.
This review aims to provide an in-depth exploration of the cutting-edge Transformer-based architectures applied in the context of stroke segmentation.
arXiv Detail & Related papers (2024-03-27T14:42:08Z) - A Comprehensive Survey on Applications of Transformers for Deep Learning
Tasks [60.38369406877899]
Transformer is a deep neural network that employs a self-attention mechanism to comprehend the contextual relationships within sequential data.
transformer models excel in handling long dependencies between input sequence elements and enable parallel processing.
Our survey encompasses the identification of the top five application domains for transformer-based models.
arXiv Detail & Related papers (2023-06-11T23:13:51Z) - Vision Transformers in Medical Imaging: A Review [0.0]
Transformer, a model comprising attention-based encoder-decoder architecture, have gained prevalence in the field of natural language processing (NLP)
In this paper, we attempt to provide a comprehensive and recent review on the application of transformers in medical imaging by; describing the transformer model comparing it with a diversity of convolutional neural networks (CNNs)
arXiv Detail & Related papers (2022-11-18T05:52:37Z) - Transforming medical imaging with Transformers? A comparative review of
key properties, current progresses, and future perspectives [21.164122592628388]
Transformer, the latest technological advance of deep learning, has gained prevalence in natural language processing or computer vision.
We offer a comprehensive review of the state-of-the-art Transformer-based approaches for medical imaging.
arXiv Detail & Related papers (2022-06-02T16:38:31Z) - Transformers in Medical Image Analysis: A Review [46.71636151229035]
Our paper presents both a position paper and a primer, promoting awareness and application of Transformers in the field of medical image analysis.
Specifically, we first overview the core concepts of the attention mechanism built into Transformers and other basic components.
Second, we give a new taxonomy of various Transformer architectures tailored for medical image applications and discuss their limitations.
arXiv Detail & Related papers (2022-02-24T16:04:03Z) - Transformers in Medical Imaging: A Survey [88.03790310594533]
Transformers have been successfully applied to several computer vision problems, achieving state-of-the-art results.
Medical imaging has also witnessed growing interest for Transformers that can capture global context compared to CNNs with local receptive fields.
We provide a review of the applications of Transformers in medical imaging covering various aspects, ranging from recently proposed architectural designs to unsolved issues.
arXiv Detail & Related papers (2022-01-24T18:50:18Z) - Medical Transformer: Gated Axial-Attention for Medical Image
Segmentation [73.98974074534497]
We study the feasibility of using Transformer-based network architectures for medical image segmentation tasks.
We propose a Gated Axial-Attention model which extends the existing architectures by introducing an additional control mechanism in the self-attention module.
To train the model effectively on medical images, we propose a Local-Global training strategy (LoGo) which further improves the performance.
arXiv Detail & Related papers (2021-02-21T18:35:14Z) - Transformers in Vision: A Survey [101.07348618962111]
Transformers enable modeling long dependencies between input sequence elements and support parallel processing of sequence.
Transformers require minimal inductive biases for their design and are naturally suited as set-functions.
This survey aims to provide a comprehensive overview of the Transformer models in the computer vision discipline.
arXiv Detail & Related papers (2021-01-04T18:57:24Z) - A Survey on Visual Transformer [126.56860258176324]
Transformer is a type of deep neural network mainly based on the self-attention mechanism.
In this paper, we review these vision transformer models by categorizing them in different tasks and analyzing their advantages and disadvantages.
arXiv Detail & Related papers (2020-12-23T09:37:54Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.